Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Alt Contig tutorial question

Answered
0

7 comments

  • Official comment
    Avatar
    Genevieve Brandt (she/her)

    Hi Linda Do,

    I was looking into this tutorial and I found the summary at the top to have some information about section 6.2:

    The third workflow takes the alt-aware alignments from the first workflow and performs a postalt-processing step as well as the same adjustment from the second workflow. Postalt-processing uses the bwa-postalt.js javascript program that Heng Li provides as a companion to BWA. This allows for variant calling on all alternate contigs including HLA alternate contigs.

    It looks like the altalt.sam is the alt aware alignment from the first workflow, which would be the output from step 3. It is not before or after steps 4, 5, or 6.1, but it is an alternative method. You can read more at the top of the tutorial for more information about the three different variant calling methods in the tutorial.

    Does this help to answer your question?

    Best,

    Genevieve

     

    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Hi,

    The GATK support team is focused on resolving questions about GATK tool-specific errors and abnormal results from the tools. For all other questions, such as this one, we are building a backlog to work through when we have the capacity.

    Please continue to post your questions because we will be mining them for improvements to documentation, resources, and tools.

    We cannot guarantee a reply, however, we ask other community members to help out if you know the answer.

    For context, check out our support policy.

    0
    Comment actions Permalink
  • Avatar
    Linda Do

    Thank you for your answer. So for clarification, does this mean that the correct workflow 3 would be:

    bwa mem chr19chr19KI270866v1alt.fasta 8017read1.fq 8017read2.fq > 8017bwamem.sam

    k8 bwa-postalt.js chr19_chr19_KI270866v1_alt.fasta.alt \ 8017bwamem.sam > altalt_postalt.sam

    and since you said it does not come before steps 4,5, and 6.1, this would be it? just 2 commands?

    So when the tutorial said: 

    The resulting postalt-processed SAM, altalt_postalt.sam, undergoes the same processing as the first workflow (commands 4.1 through 4.7) except that (i) we omit --max_alternate_alleles 3 and --read_filter OverclippedRead options for the HaplotypeCaller command like we did in section 6.1 and (ii) we perform the 0x1 flag removal step from section 6.1.

    I should ignore this paragraph?

     

    Thank you in advance

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Hi Linda Do,

    Ah, I see! No, I was incorrect before, follow the tutorial and do the processing it recommends to call variants.

    Sorry about the confusion.

    Best,

    Genevieve

    0
    Comment actions Permalink
  • Avatar
    Linda Do

    So for clarification, so altalt.sam is indeed the same as 8017bwamem.sam? And these 2 commands should indeed come before steps 4 through 6.1?

    Thank you in advance!

     

    0
    Comment actions Permalink
  • Avatar
    Genevieve Brandt (she/her)

    Yes, that looks right! Though I am not the original author of the tutorial, but it looks to be the correct way to interpret it.

    0
    Comment actions Permalink
  • Avatar
    Linda Do

    Thank you for your help!

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk