Version highlights for GATK version 3.5
The last GATK 3.x release of the year 2015 has arrived! The major feature in GATK 3.5 is the eagerly awaited MuTect2 (beta version), which brings somatic SNP and Indel calling to GATK. This is just the...
View ArticlePhasing information for use in downstream tools
Hi, I know that HaplotypeCaller outputs physical phasing information in PG and PGT fields, so as not to confuse physical phasing with pedigree based methods, however if I wanted to use the physical...
View ArticleRelease notes for GATK version 3.5
GATK 3.5 was released on November 25, 2015. Itemized changes are listed below. For more details, see the user-friendly version highlights. New tools MuTect2: somatic SNP and indel caller based on...
View ArticleHC step 2: Local re-assembly and haplotype determination
This document details the procedure used by HaplotypeCaller to re-assemble read data and determine candidate haplotypes as a prelude to variant calling. For more context information on how this fits...
View ArticleHaplotypeCaller/ Variantannotator no allele balance tag for all SNPs
Version 3.1.1. Human normal samples. I couldnt find AlleleBalance and AlleleBalanceBySample tags in my vcf outputs. Tags are not found even for single variant I tried HaplotypeCaller with -all or...
View ArticleHaplotypeCaller GVCF mode Qscript
I tried HaplotypeCaller on whole-genome sample in GVCF mode and it takes several days and ends up incomplete. I would like to use Queue script to parallelize the jobs. I am new to using Queue scripts...
View ArticleInconsistent handling of high base quality scores
It seems as if the handling of sam/bam/cram files with higher-than-usual base quality scores is inconsistent. In particular, a base quality score of 93 causes HaplotypeCaller to throw an exception if a...
View ArticleHaplotypeCaller Re-Alignment Step
Hello community, I am really new to the field of genome analysis (I've got a computer science background). At the moment I am dealing with the topic of variant calling and decided to use / understand...
View ArticleNumber of Haplotypecaller output files
Hello! On certain runs I get only .vcf file as an output and sometimes I see both .vcf and .vcf.idx as output files. Is there an issue with the runs that yield only .vcf file and not .vcf.idx?
View Articleparameter questions for haplotypecaller
Hi GATK, I'm runing the GATK germline variant calling pipeline. I have some questions about realignertargetcreator and haplotypecaller's parameters. Please correct me at any point if I went wrong. For...
View ArticleUsing HC GVCF mode for determining rate of heterozygosity
Hi GATK team, I was hoping I could get some insight on determining rate of heterozygosity from a gvcf file. We have three diploid lizard samples. Each was run through our GATK pipeline using HC in GVCF...
View Articlemany `LowQual` flags in output .vcf when using `HaplotypeCaller` with...
Hi GATK team, I am working on a pipeline for exome sequencing variant calling. And I am only interested in the genotype for some specific positions so I used GENOTYPE_GIVEN_ALLELES mode with given vcf...
View ArticleGeneration of haplotype or phase data for SNPs in the flanking region of STRs
Does anyone know of an effective way to determine haplotypes or phasing data for SNPs and STRs? I understand that STRs are inherently difficult for aligners; however, I'm trying to determine haplotypes...
View ArticleHow the HaplotypeCaller's reference confidence model works
This document describes the reference confidence model applied by HaplotypeCaller to generate genomic VCFs (gVCFS), invoked by -ERC GVCF or -ERC BP_RESOLUTION (see the FAQ on gVCFs for format details)....
View ArticleDifference between vcf directly generated by HC and vcf generated from...
Hi, I have three general questions about using HaplotypeCaller (I know I could have tested by myself, but I figured it might be reliable to get some answer from people who are developing the tool): For...
View ArticleGenotypeGVCFs number of
Hi all, I'm very new to GATK. I'm trying to map an EMS mutation in Arabidopsis. I have fastq files of a wt M3 bulk and a mut M3 bullk (both offspring of the same parent). The strategy is to call for...
View ArticleRough file size for a BP_RESOLUTION GVCF on a whole genome
Hello, Does anyone know a rough estimate of the file size of a gvcf produced at BP_RESOLUTION by the HaplotypeCallerfor a whole genome sequencing experiment. Perhaps a rather simple question, but i...
View ArticleQuestion regarding ReadBackedPhasing
Hi all, Can I use ReadBackedPhasing or some other computational tools to distinguish between paternal and maternal chromosomes/reads based on pair-end DNA sequence data (fastq files)? Thank you.
View ArticleDefault value of --output_mode in HaplotypeCaller
Hello, I am reading the documentation of HaplotypeCaller in order to use this tool for "Single-sample all-sites calling on DNAseq (for -ERC GVCF cohort analysis workflow)". The table in the middle of...
View ArticleLower GATK's HaplotypeCaller threshold for allele frequency as part of a...
Hi! As part of a variant calling pipeline i'm interested in lowering the threshold for allele frequency tolerance in GATK's HaplotypeCaller variant caller to 0.01 (1%), if possible. If not, is there...
View Article