I use mutect2 of GATK 3.6 and GATK 3.7 to call variant. I know there is a downsampling in mutect2 which has an important influence on the result. So I change the downsampling level. For example: the default value is:
maxReadsInRegionPerSample = 1000;
minReadsPerAlignmentStart = 5;
I change these parameters to a bigger one:
maxReadsInRegionPerSample = 2000;
minReadsPerAlignmentStart = 10;
Then I compile the code, run it and get the result named downsample_2x.vcf. However, compared to the default result original.vcf, the result is very strange:
There are more variants in downsample_2x.vcf, which is easy to understand because there are much more samples. However, there are also less variants in downsample_2x.vcf(That is, variants in original.vcf are not show in downsample_2x.vcf, around 200 within total 900 variants). Since the sample get bigger, why there are less variants? It's difficult for me to understand. If the result with more samples is much more accurate, how about these missing 200 variants?Any reply will be much appreicated!