Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

(How to) Generate an unmapped BAM from FASTQ or aligned BAM Follow

3 comments

  • Avatar
    bam

    Dear GATK Team,

    thanks for the nice Tutorial.

    But I think you skipped parts of the script to determine alignment tags. I think it should look like the following code:

    samtools view 6484_snippet.bam | cut -f 12- | tr '\t' '\n' | cut -d ':' -f1 | sort | uniq

    Greetings, Barbara

    0
    Comment actions Permalink
  • Avatar
    Priyadarshini Thirunavukkarasu

    Hello

    I created unmapped bam file from fastq file (sample 1). When I tried to search the bam file using query name, I got the 'Exec format error'

    #1_ucheck.bam: unmapped bam file from Sample 1 fastq file

    samtools view 1_ucheck.bam | grep 'A00684:110:H2TYCDMXY:1:1101:2790:1000'
    [E::hts_hopen] Failed to open file 1_ucheck.bam
    [E::hts_open_format] Failed to open file "1_ucheck.bam" : Exec format error
    samtools view: failed to open "1_ucheck.bam" for reading: Exec format error

    I am not sure about this error. Because from the same fastq file (sample 1), could also align with reference sequence and convert it into a bam file. When I tried to search using queryname in this bam file, it did not show any error. It would be helpful, if someone could help to understand the error in the unmapped bam file. Thanks

    #alignedsample1.bam: mapped bam file from sample1 fastq file

    samtools view alignedsample1.bam | grep 'A00684:110:H2TYCDMXY:1:1101:2790:1000'
    A00684:110:H2TYCDMXY:1:1101:2790:1000 99 NC_000012.12 10166659 60 101M = 10166871 313 TGATTGGACAGAAAGCTTCCACTATGTCTCCTTACCCACCCTTCTCCCAATACCTGAACAATTTGCTACTCTCTTCAGTGTTTCTTGGAGATTCAGATTCT FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF:FFFFFFFFFFFFFFFFF:FFFFFFFF:FFFFFFFFFF:: NM:i:0 MD:Z:101 MC:Z:101M AS:i:101 XS:i:0
    A00684:110:H2TYCDMXY:1:1101:2790:1000 147 NC_000012.12 10166871 60 101M = 10166659 -313 GCCGGGCTGAGATCTGTCCCTCCAGTTTCTTTTTCTGGTGAGTTAGGTTTGCTTGCTCTTGTGTTAGGAGGTCAGACACCTGGGATACTGAATCACAGTTG FFFFFFFFFFFFFFFFFFFFFFF:FFFFF,FFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFFF NM:i:0 MD:Z:101 MC:Z:101M AS:i:101 XS:i:20
    0
    Comment actions Permalink
  • Avatar
    Soerendip Pidneroes

    Why not provide the commands to download the files and to setup the environment?

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk