Errors about read group (RG) information

See the Dictionary entry on read groups for more information about what they represent and why they're very important.

Note that the command line examples in this article have not yet been updated for GATK4. However, the principles they illustrate are valid.

Errors about missing or undefined read groups

As detailed in the FAQs about input requirements, GATK expects all read groups appearing in the read data to be specified in the file header, and will fail with an error if it does not find that information (whether there is no read group information in the file, or a subset of reads do not have read groups).

Typically you should add read group information when you perform the original alignments (with e.g. BWA, which has an option to do so). So what do you do if you forgot to do that, and you don't want to have to rerun BWA all over again?

Solution

You can use a Picard tool called AddOrReplaceReadGroups to add the missing information to your input file.

Here's an example:

# throws an error
java -jar GenomeAnalysisTK.jar \
    -T HaplotypeCaller \
    -R reference.fasta \
    -I reads_without_RG.bam \
    -o output.vcf

# fix the read groups
java -jar picard.jar AddOrReplaceReadGroups \
    I= reads_without_RG.bam \
    O=  reads_with_RG.bam \
    SORT_ORDER=coordinate \
    RGID=foo \
    RGLB=bar \
    RGPL=illumina \
    RGSM=Sample1 \
    CREATE_INDEX=True

# runs without error
java -jar GenomeAnalysisTK.jar \
    -T HaplotypeCaller \
    -R reference.fasta \
    -I reads_with_RG.bam \
    -o output.vcf

Note that if you don't know what information to put in the read groups, you should ask whoever performed the sequencing or provided the BAM to give you the metadata you need.

Errors about read group (RG) information

Errors about missing or undefined read groups

Solution

Trending Articles

Notorious Naushad of Ippa gang nabbed

NCERT Solutions for Class 9th Sanskrit Chapter 3 पाथेयम्

मुख मैथुन से उठाएं सेक्स का भरपूर मज़ा, जानें क्या है इसका सही तरीकामुख मैथुन...

Kattangoor Mandal Sarpanch Wardmumber Mobile Numbers List Part II Nalgonda...

Philips VES15.1HE–LA CHASSIS Using VESTEL 17MB95M Main board – service mode,...

Chai Status, Funny Tea Quotes in Hindi, चाय पर शायरी

sunstar exam scanner pdf

Evanescence – Afterlife (From the Netflix Series “Devil May Cry”) – Single...

Shale Hill Secrets (Love-Joint) (ENG+RUS) [L] [10.56GB]

collect2: error: ld returned 1 exit status while compiling openssl

[GET] Dickie Bush and Nicholas Cole – Ghostwriter GPT ($350.00)

Benson Boone – Sorry I’m Here For Someone Else – Single [iTunes Plus M4A]

Brunei reaffirms healthcare commitment

Aaron Powers

Mp3 Download: Mr Raw - Adamma ft. Flavour & Harry B

GTA 5 PPSSPP Zip File Download For Android Mediafire 382 MB

Lawyer Alumna to Speak in WMSU Recognition Day

SOFT COPY ZA NGAIZA CHEMISTRY

Kusvirana neMurume weBusiness Partner Yangu – Beche rakakweshwa ipapo

Building Instruments With Logic Pro Samplers TUTORiAL-HiDERA