Quantcast
Channel: Recent Discussions — GATK-Forum
Browsing all 12345 articles
Browse latest View live

Unknown index type error of SelectVariants

Good day to you! I ran a selectvariant command by nohup java -Xmx5g -jar /home/wuwei/software/GenomeAnalysisTK-3.8-0-ge9d806836/GenomeAnalysisTK.jar -R $referrence -T SelectVariants --variant:VCF...

View Article


(How to) Run Spark-enabled GATK tools on a local multi-core machine

This is a placeholder for a document in progress.

View Article


(How to) Run Spark-enabled GATK tools on a Spark cluster

This is a placeholder for documentation in progress.

View Article

Spark

In a nutshell, Spark is a piece of software that GATK4 uses to do multithreading, which is a form of parallelization that allows a computer (or cluster of computers) to finish executing a task sooner....

View Article

Image may be NSFW.
Clik here to view.

What are the smallest units I can break whole human genomes into, for...

Hi, and thank you so much for the wonderful tools and support! For our current project, we'd like to run 2000+ whole genomes from FASTQ to VCF using GATK best practices. I'd like to optimize the...

View Article


HaplotypeCaller/ Variantannotator no allele balance tag for all SNPs

Version 3.1.1. Human normal samples. I couldnt find AlleleBalance and AlleleBalanceBySample tags in my vcf outputs. Tags are not found even for single variant I tried HaplotypeCaller with -all or...

View Article

Estimation of the cross-individual contamination by chromosome

Dear all, We are trying to estimate the cross-individual contamination in our samples (whole genome) with ContEst and we were wondering if we could do it by chromosomes to speed up the process. We...

View Article

prebuilt gatk4 does not show --list on OSX HighSierra+java9

Hi there, Is it possible that mac and or java 9 are not happy with the gatk wrapper? Thanks in advance Stephane $ gatk --list Using GATK jar /opt/biotools/gatk-4.0.0.0/gatk-package-4.0.0.0-local.jar...

View Article


Open Tumor / Normal pair dataset for benchmarking and training

Hi, I am looking for a real tumor normal pair dataset to apply analysis and compare callers. Do you know of one I can get (bam would be quicker but even fastq is OK as I could map against 38 to be...

View Article


Information regarding the GATK Workshop in Taiwan

Hi, a colleague and I are quite interested in doing a GATK 4 training. Today I've seen the announcement in the event calendar that there is one of your larger trainings coming up in March in Taiwan. To...

View Article

CombineGVCFs : key isn't defined in the VCFHeader

Hi there, I am trying to combine gVCFs using CombineGVCFs and get the error: "Key END found in VariantContext field INFO at chr1:10439 but this key isn't defined in the VCFHeader". The gVCFs are...

View Article

Testing FPGA implementation of HaplotypeCaller (PairHMM)

Hi, We are two researchers from the Politecnico di Milano. We are trying to test the FPGA implementation of the HaplotypeCaller (PairHMM) on GATK 3.8-0-ge9d806836, using a Terasic DE5a-Net (Arria 10,...

View Article

GATK4: CollectAlleleCount output & Model ModelSegments

Hi everyone, I am trying to run the CNV discovery pipeline and I have noticed that the header of sample.allelicCounts.tsv (produced by CollectAlleleCount) gives problems when used to rum ModelSegments....

View Article


Dynamically pass multiple input to Picard's MarkDuplicates (multiplexed data)

To pass multiple BAM files to MarkDuplicates we use the following syntax: java -jar picard.jar MarkDuplicates \ INPUT=lane1.bam \ INPUT=lane2.bam \ OUTPUT=dedup.bam \ METRICS_FILE=dedub_metrics.txt...

View Article

merge bam files: dealing with alignment to a split reference database

Dear GATK team, I have a large reference database (NCBI nt) and it's to big to index. I split it to smaller partial dbs and run my alignment tool against each of the partial dbs separately. So for 1...

View Article


Is there a consensus minimum number of probes to use as cutoff for GATK CNV...

Hello, I was just wondering if there was a cutoff used for the minimum number of probes that constitutes a segment. For example, there are a few segment calls made up of 2-3 probes that occur at a...

View Article

Dangling heads and tails

Hello, The logic for merging dangling tails appears to be different from that for dangling heads. For example, it looks like tails can be merged to a non-reference path while heads cannot. Is there a...

View Article


SelectVariants error

I`m calling SNP by haplotype caller,and i got vcf document for each sample. After that , i use selectvariant to select variants of SNP . But i got this error: ##### ERROR...

View Article

How to call SNP without confidence SNP ?

Hello! I have WGS data of 100 samples. There's few people work on my species,so I didn't find confidence SNP set to use. I follow the best-practices of GATK. And my question is how to get convinced SNP...

View Article

Changing compression level in GATK 4.0.0.0

When running GATK 4.0.0.0, (in this case using Apply BQSR) the notice 11:36:10.430 INFO ApplyBQSR - HTSJDK Defaults.COMPRESSION_LEVEL : 1 appears. A bit of digging led me to the Python code in the...

View Article
Browsing all 12345 articles
Browse latest View live


<script src="https://jsc.adskeeper.com/r/s/rssing.com.1596347.js" async> </script>