Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

How do I use MarkIlluminaAdapters properly in combination with its metrics for new adapters other than Illumina standards?

0

5 comments

  • Avatar
    Genevieve Brandt (she/her)

    Hi NawarDalila,

    The GATK support team is focused on resolving questions about GATK tool-specific errors and abnormal results from the tools. For all other questions, such as this one, we are building a backlog to work through when we have the capacity.

    Please continue to post your questions because we will be mining them for improvements to documentation, resources, and tools.

    We cannot guarantee a reply, however, we ask other community members to help out if you know the answer.

    For context, check out our support policy.

     

    0
    Comment actions Permalink
  • Avatar
    NawarDalila

    Dear Genevieve Brandt,

    Thanks a lot for your reply. My question was indeed about MarkIlluminaAdapters' abnormal results. Maybe I was not clear enough in that so here I try again:

    Are those those options enough for adding new adapters to be marked by the tool? The options are:

    "--ADAPTERS null \
    --FIVE_PRIME_ADAPTER "AGATCGGAAGAGCACACGTCTGAACTCCAGTCA" \
    --THREE_PRIME_ADAPTER "AGATCGGAAGAGCGTCGTGTAGGGAAAGAGTGT"
    "

    Otherwise, I will be waiting for the backlog :)

    Best,

    Nawar

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Hi NawarDalila, I will look into this request further to find out if we have any advice.

    1
    Comment actions Permalink
  • Avatar
    Yossi Farjoun

    Hi NawarDalila

     

    The adapters you list seem to be very short, and they seem to only have the illumina-adapter part of the oligo rather than including the entire adapter with the sample-barcode and the primer. For example, the dual indexed Illumina adapter that is included in Picard is:

     

    AATGATACGGCGACCACCGAGATCTACACNNNNNNNNACACTCTTTCCCTACACGACGCTCTTCCGATCT

    and 

    AGATCGGAAGAGCACACGTCTGAACTCCAGTCACNNNNNNNNATCTCGTATGCCGTCTTCTGCTTG

     

    (where the N's are indicating the location and length of the sample-index.)

     

    So I'm wondering if you have provided the complete adapter. The amount of long ends the were clipped indicates that too many bases are being clipped.

    1
    Comment actions Permalink
  • Avatar
    NawarDalila

    Hi Yossi Farjoun,

    Thanks a lot for your reply. If you take a look at the original website for the company (https://www.twistbioscience.com/resources/protocol/unique-dual-index-sequences-protocol-reference-document-spreadsheet-and-sample), then press "Download the Sample Sheet Templates". You can see that they specified those exact adapters in the Sample Sheet for Novaseq6000 standard (The short one that I posted). Usually we don't specify those when using the kit from Illumina (I anticipate it is automatically done as standard). 

    But from your answer, I know now that it will not matter because it is the same from each end. The sequence of the DNA regions will always start after the Ns in the second stage.

    Thanks a lot again, your answer was very helpful and insightful.

    Best,

    Nawar

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk