Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

Structural Variants Follow

3 comments

  • Avatar
    Siyuan Feng

    Hi Derek,

    While I appreciate very much your definitions and illustrations of SV-related concepts, there definition for "insert size" seem to differ from other sources.

    Here, 'insert size' is defined as:

    the entire length of the DNA fragment including the adapters is called the insert size

    But it is mostly defined as the length of the DNA fragment between the adapters by many other online resources, such as Illumina and a plausible post.

    While it is fine to adopt either definition (as long as the definition is clearly made), I wonder which one applies to the Picard tool "CollectInsertSizeMetrics", as it will affect the way I interpret its output. I ask this question here because this Picard tool refer to the GATK dictionary as a reference (although not to this specific post).

    2
    Comment actions Permalink
  • Avatar
    Jun Hui Ho

    Hi, Derek.

    Thank you for creating this post which helps me in understanding the structural variant. However, I do have a question regarding the category of insertion (INS). 

    Based on the image below that I snipped from this post, insertion (INS) is categorized as balanced structural variant.

    May I know the reason behind this? I thought insertion is an unbalanced SV as a new sequence is inserted into the sample genome but is not present in the reference genome.

    Thank you.

     

    0
    Comment actions Permalink
  • Avatar
    Gökalp Çelik

    Hi Jun Hui Ho

    It is the copy number that does matter when classifying SVs. Apparently insertion of novel DNA sequences in to the original DNA does not change the copy number of the original one therefore it is classified as balanced. 

    Regards. 

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk