Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

MarkDuplicates (Picard) Follow

1 comment

  • Avatar
    maisarasora

    Hi

    I tried to run the markduplicate but I get the following error notification

    $ java -jar /home/maiunix/bin/picard.jar MarkDuplicates -I SRR10441201.sorted.bam -O SRR10441201.sorted.marked_duplicates.bam -M SRR10441201.sorted.marked_dup_metrics.txt
    ERROR: Invalid argument '-I'.

    So, I checked the bam file using the Validatesamfile

    ValidateSamFile -I SRR10441201.sorted.bam -MODE SUMMARY

    And I get the following summary

    ## HISTOGRAM java.lang.String
    Error Type Count
    ERROR:MISSING_READ_GROUP 1
    ERROR:POORLY_FORMATTED_HEADER_TAG 1
    WARNING:RECORD_MISSING_READ_GROUP 48896196

    I'm not sure if my bam file is having problem since I already sorted the bam file using samtools.

     

    Please comment about this.. Thank you

     

     

     

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk