Quantcast
Browsing all 12345 articles
Browse latest View live

What's in the resource bundle and how can I get it?

NOTE: we are currently working on overhauling the bundle to 1) add support for Hg38 and 2) document the provenance of the resource files more fully. 1. Obtaining the bundle We provide these files (or...

View Article


BQSR for WES data generated by different exome-capture platforms

Hi everyone, We have over thousand WES samples generated by two different exome-capture platforms. Samples were multiplexed and sequenced on Illumina HiSeq, with each lane containing 3-10 samples....

View Article


I am getting this error in VariantFiltration --genotypeFilterExpression

$java -jar $gatk -T VariantFiltration -R /storage/sandesh/TargetSpot/genome/Corca1_AssemblyScaffolds.fa -o snpsDP10.vcf --variant snps.vcf --genotypeFilterExpression "DP < 9" --genotypeFilterName...

View Article

Creating an ubuntu snap package for gatk

I would like to create an ubuntu snap package (snapcraft.io) for gatk, to make it easy for myself and others to install and run gatk like any other program. However, the licence forbids re-distribution...

View Article

No varaint calling with Wgsim simulated reads

I have generated 1 million 600bp single ended reads using Wgsim read simulator from chr21 of hg19. All the bases have 40 base quality. The reads are aligned with BWA-MEM and then sorted and...

View Article


FastaAlternateReferenceMaker to consensus sequence?

How do I create a consensus seq with this command? By default, it fill the gaps with Reference seq but I don't want it, that's not a consensus sequence. I need the gaps be filled by Ns. How to do it?...

View Article

Going public with pipeline scripts

Folks, it really makes my day when I get to announce some good news that has been cooking for a long time. So this is going to be a very happy Humpday indeed. The good news (which I may have hinted at...

View Article

Fix chromosome namings in...

Hi, I would say this is my fault but would you mind bring in sync your file in the hg38 bundle on your FTP server? java -jar GATK/GenomeAnalysisTK-3.6/GenomeAnalysisTK.jar -T BaseRecalibrator...

View Article


problem with RnaSeqMetrics in Picard CollectMultipleMetrics

When I try to run CollectMultipleMetrics with PROGRAM=RnaSeqMetrics and I= R= O= set to something I get : Exception in thread "main" java.lang.NullPointerException at...

View Article


Correct understanding of BQSR

Hi, I just wanted to reach clarification on some issues related to BQSR. We work on bacterial genomes, with approximately 8000 in the collection and about 120 new every week. We asked ourself if BQSR...

View Article

Is there any way to deal with the absence of an actual reference genome ?

Hi everyone, I am just discovering GATK, therefore I strongly apologize in advance, as my question may sound a bit naive or missing the point ! I am currently working on RAD-seq data obtain from pools...

View Article

Haplotype caller BP_RESOLUTION :More AD values than alleles called for

My intention is to find different bases called in a particular chromosome location irrespective of it being assigned as SNP/badbase. I user the below command: java -jar 3.5/GenomeAnalysisTK.jar -T...

View Article

Hapmap raw variant to VCF conversion?

Hi I would like to obtain snps+genotypes in .vcf format for a number of lymphoblastoid cell lines, part of the hapmap project. Some of them have been sequenced by the 1000genomes project (so I have...

View Article


(howto) Run the genotype refinement workflow

Overview This tutorial describes step-by-step instruction for applying the Genotype Refinement workflow (described in this method article) to your data. Step 1: Derive posterior probabilities of...

View Article

Using depth of coverage metrics for variant evaluation

Overview This document describes the proper use of metrics associated with depth of coverage for the purpose of evaluating variants. The metrics involved are the following: DepthPerAlleleBySample (AD):...

View Article


Image may be NSFW.
Clik here to view.

(How to) Mark duplicates with MarkDuplicates or MarkDuplicatesWithMateCigar

This tutorial updates Tutorial#2799. Here we discuss two tools, MarkDuplicates and MarkDuplicatesWithMateCigar, that flag duplicates. We provide example data and example commands for you to follow...

View Article

Error while running HC on ChrX

Dear GATK, I have a set of 4 bam files that I was able to generate genotypes from using HaplotypeCaller for Chr20. I took the same data and tried to call genotypes for ChrX, and received the following...

View Article


Extracting variants for multiple read groups with VariantsToTable

I have a VCF file representing several related individuals and I want to extract the information into a table using VariantsToTable. All I really need is the genotype for each individual in the set at...

View Article

Very long run-time - GATK HaplotypeCaller GVCF with Ion proton reads

I analyzing the proton reads using GATK Haplotypecaller. but I have an very long run-time to analyze this data with below command. gatk -Xmx48g -Djava.io.tmpdir=./ -T HaplotypeCaller -nct 8 -R...

View Article

whole genome versus whole exome PON

Quick question: Do you recommend using a PON made of whole exome samples, if the cancer dataset is whole exome, and similarly use a PON made of whole genome samples, if the cancer dataset is whole...

View Article
Browsing all 12345 articles
Browse latest View live