Skip to content

GATK indel realigner error in mtDNA variant calling pipeline #12

@asifzubair

Description

@asifzubair

GATK is throwing an error because it complains about ReadGroups. This BioStar might have some solutions on how to overcome this.

root@2f68d7ec1b12:/home/atacseeker_app# java -Xmx4g \
> -Djava.io.tmpdir=`pwd`/tmp \
> -jar ${externaltoolsfolder}GenomeAnalysisTK.jar \
> -T IndelRealigner \
> -R ./data/chr${ref}.fa \
> -I $DIR"/"${SORTED}".bam" \
> -o $DIR"/"${SORTED}".realigned.bam" \
> -targetIntervals ./data/intervals_file_${ref}.list  \
> -known ./data/MITOMAP_HMTDB_known_indels_${ref}.vcf \
> -compress 0;
INFO  03:03:23,347 HelpFormatter - -------------------------------------------------------------------------------- 
INFO  03:03:23,349 HelpFormatter - The Genome Analysis Toolkit (GATK) v2.7-4-g6f46d11, Compiled 2013/10/10 17:27:51 
INFO  03:03:23,350 HelpFormatter - Copyright (c) 2010 The Broad Institute 
INFO  03:03:23,351 HelpFormatter - For support and documentation go to http://www.broadinstitute.org/gatk 
INFO  03:03:23,355 HelpFormatter - Program Args: 
-T IndelRealigner 
-R ./data/chrRSRS.fa 
-I /data/chrM.CAP1727A4-041814-2-N.sorted.nodup.unique_q10minimum.chrRSRS.nodup.sorted.bam 
-o /data/chrM.CAP1727A4-041814-2-N.sorted.nodup.unique_q10minimum.chrRSRS.nodup.sorted.realigned.bam 
-targetIntervals ./data/intervals_file_RSRS.list 
-known ./data/MITOMAP_HMTDB_known_indels_RSRS.vcf 
-compress 0 
INFO  03:03:23,355 HelpFormatter - Date/Time: 2016/06/28 03:03:23 
INFO  03:03:23,356 HelpFormatter - -------------------------------------------------------------------------------- 
INFO  03:03:23,356 HelpFormatter - -------------------------------------------------------------------------------- 
INFO  03:03:23,374 ArgumentTypeDescriptor - Dynamically determined type of ./data/MITOMAP_HMTDB_known_indels_RSRS.vcf to be VCF 
INFO  03:03:24,254 GenomeAnalysisEngine - Strictness is SILENT 
INFO  03:03:24,409 GenomeAnalysisEngine - Downsampling Settings: No downsampling 
INFO  03:03:24,416 SAMDataSource$SAMReaders - Initializing SAMRecords in serial 
INFO  03:03:24,468 SAMDataSource$SAMReaders - Done initializing BAM readers: total time 0.05 
INFO  03:03:25,680 GATKRunReport - Uploaded run statistics report to AWS S3 
##### ERROR ------------------------------------------------------------------------------------------
##### ERROR A USER ERROR has occurred (version 2.7-4-g6f46d11): 
##### ERROR
##### ERROR MESSAGE: SAM/BAM file /data/chrM.CAP1727A4-041814-2-N.sorted.nodup.unique_q10minimum.chrRSRS.nodup.sorted.bam is malformed: 
SAM file doesn't have any read groups defined in the header.  
The GATK no longer supports SAM files without read groups
##### ERROR ------------------------------------------------------------------------------------------

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions