Genome Analysis Toolkit

Variant Discovery in High-Throughput Sequencing Data

GATK process banner

Need Help?

Search our documentation

Community Forum

Hi, How can we help?

Developed in the Data Sciences Platform at the Broad Institute, the toolkit offers a wide variety of tools with a primary focus on variant discovery and genotyping. Its powerful processing engine and high-performance computing features make it capable of taking on projects of any size. Learn more

GenomeSTRiP analysis error

0

8 comments

  • Avatar
    Bob Handsaker

    Can you post the gender map file?

     

    0
    Comment actions Permalink
  • Avatar
    Thandeka

    The issue is with the highlighted sample (ERR1955423). I have attached an image of the gender map file. and also posted a part of the file. I'm doing an analysis of over 200 samples. 

    190618_FD09251561 M
    190618_FD09251568 F
    190618_FD09251569 F
    ERR1955423 F
    ERR1955529 M
    ERR1955524 M
    ERR1955438 F
    ERR1955419 F
    ERR1955398 M
    ERR1955487 M
    ERR1955462 F
    ERR1955431 F
    ERR1955515 M
    ERR1955528 F
    ERR1955413 M
    ERR1955481 F
    ERR1955427 M
    ERR1955461 F
    ERR1955397 M
    ERR1955482 M
    ERR1955432 F
    ERR1955464 M
    ERR1955457 F
    ERR1955477 M
    ERR1955470 F
    ERR1955538 M
    ERR1955469 F
    ERR1955430 M
    ERR1955493 F
    ERR1955460 M
    ERR1955534 F
    ERR1955420 M
    ERR1955443 F
    ERR1955406 M
    ERR1955527 F
    ERR1955429 M
    ERR1955425 F
    ERR1955463 M
    ERR1955516 M
    ERR1955451 M
    ERR1955526 F
    ERR1955476 M
    ERR1955512 F
    ERR1955513 M
    ERR1955488 F
    ERR1955412 M 

    0
    Comment actions Permalink
  • Avatar
    Bob Handsaker

    Is there some reason you are not using the file produced by preprocessing (sample_gender.report.txt)?  This is what will be used by default if you do not supply your own gender file.

    Does your have a header (which for a two-column file must be SAMPLE GENDER)?

    Are the line terminators correct (Unix style)?

     

     

    0
    Comment actions Permalink
  • Avatar
    Bob Handsaker

    Do you have two tabs on that line?

     

    0
    Comment actions Permalink
  • Avatar
    Thandeka

    I did not have any specific reason, I think the best way would be to use the sample_gender.report.txt because I thought its a requirement to have a gender map file.

    It does not have a header, its a Unix style, it has two tabs the one with the sample and another with the gender.

    0
    Comment actions Permalink
  • Avatar
    Bob Handsaker

    If you are going to supply your own, it needs to have a header, it should be tab-delimited.

    If it is a two column file, each line should have one tab. I suspect that the line where you are getting an error has two tabs (i.e. three columns) and the second column is empty.

    0
    Comment actions Permalink
  • Avatar
    Thandeka

    Bob Handsaker thank you very much, I think using the sample_gender.report.txt produced by preprocessing is the best option

     

    0
    Comment actions Permalink
  • Avatar
    Bob Handsaker

    It's usually best, unless you think it is wrong for some reason (due to a really unusual aneuploidy on the sex chromosomes or something like that).

    0
    Comment actions Permalink

Please sign in to leave a comment.

Powered by Zendesk