Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

picard MarkIlluminaAdapters for Non-Illumina Adapters Error

0

8 comments

  • Avatar
    SkyWarrior

    Let me give you a small bit of advice. MarkIlluminaAdapters tool with this command line does not completely mark all possible adapter remnants unfortunately. MGI adapters tend to be in different orientations. I would strongly suggest using cutadapt to remove all adapters from MGI data before any other preprocessing steps. 

    1
    Comment actions Permalink
  • Avatar
    SkyWarrior

    Just use cutadapt tool to trim all adapters from reads. For fastqc you need to incorporate MGI adapters to the known adapters list or provide those adapters as parameters to fastqc commandline. 

    1
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Hi Adi Abuali,

    I think there is a typo in your command line, the THREE_PRIME_ADAPTER argument should be given as --THREE_PRIME_ADAPTER with the hyphens. Same with the five prime adapter argument.

    You can read more here: https://gatk.broadinstitute.org/hc/en-us/articles/360057439471-MarkIlluminaAdapters-Picard-

    Best,

    Genevieve

    0
    Comment actions Permalink
  • Avatar
    Adi Abuali

    Hi Genevieve,

    Thanks but I tried this before and it gave me different error as below:

    picard MarkIlluminaAdapters -I /mnt/d/gen/uAdi.bam -O /mnt/d/gen/uAdi.bam_mark.bam -M /mnt/d/gen/metrics.txt --ADAPTERS null --THREE_PRIME_ADAPTER:AAGTCGGAGGCCAAGCGGTCTTAGGAAGACAA --FIVE_PRIME_ADAPTER:AAGTCGGATCGTAGCCATGTCGTTCTGTGAGCCAAGGAGTTG

    No argument value found for tagged argument: THREE_PRIME_ADAPTER:AAGTCGGAGGCCAAGCGGTCTTAGGAAGACAA

    0
    Comment actions Permalink
  • Avatar
    Adi Abuali

    Finally it works with below syntax:

    gatk MarkIlluminaAdapters I=/mnt/d/gen/uAdi.bam O=/mnt/d/gen/uAdi.bam_mark.bam M=/mnt/d/gen/metrics.txt ADAPTERS=null FIVE_PRIME_ADAPTER=AAGTCGGATCGTAGCCATGTCGTTCTGTGAGCCAAG
    GAGTTG THREE_PRIME_ADAPTER=AAGTCGGAGGCCAAGCGGTCTTAGGAAGACAA TMP_DIR=/mnt/d/tmp

     

    Thanks,

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Thanks for the update and posting your solution Adi Abuali, glad you were able to solve the problem!

    0
    Comment actions Permalink
  • Avatar
    Adi Abuali

    Thank you SkyWarrior...I  know it's not GATK tool as well, but if you advice for MGI fastp tool as well for same approach ?

    0
    Comment actions Permalink
  • Avatar
    Adi Abuali

    Thank you for the advise...Appreciated.

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk