Quantcast
Channel: haplotypecaller — GATK-Forum

Could you please let me know any tool, to concatenate the gvcf files? Or...

Could you please let me know any tool, to concatenate the gvcf files? Or there is any other solution to run the intermediate HaplotypeCaller in GVCF mode on parts of the chromosome to speed up the...

View Article


No variants found by HaplotypeCaller

I have human RNA-seq data: paired-end reads (2x150bp) aligned with STAR. I tried to run HaplotypeCaller with usual parameters: `gatk HaplotypeCaller -R GRCh38.primary_assembly.genome.fa -I...

View Article


How to create a Haplotype file from HaplotypeCaller

I am new to use GATK4. I would like to generate a Haplotype file (.bam or vcf file with haplotype information). I followed the Best Practice step till haplotypecaller. REF="refs/ucsc.hg19.fasta"...

View Article

Why a variant site is listed in a GVCF run on a single sample, with no reads...

I have read couple documents on GVCF but still can't understand how it works. Just one example from the GVCF file I got from HaplotypeCaller from a single bam file with `-ERC GVCF` option: ``` chr22...

View Article

How do we know the stretch of variants that have been phased using haplotype...

I am using the following command to run haplotype caller /opt/apps/gatk/4.2.1/gatk HaplotypeCaller --dbsnp /home/dhwani.dholakia/archive/files_required_for_exome_analysis/dbsnp/GRCH37.p17_refseq.vcf -R...

View Article


Regarding different genotype calls for same sample using Unified Genotyper &...

Hello Team, I am doing multi sample cohort exome analysis for which i have called SNPs using both callers (UG & HC).For one of my important SNP, i am observing different genotypes in few samples...

View Article

Mapping a locus to a chromosome in a genome

Hello I am currently working on a maize mutant and wild type data. I have the DNA-Seq data for these samples and I am currently using the GATK pipeline to analyze the data. I am trying to map the...

View Article

Haplotype caller active regions parameters not available in GATK4

I would like to know like GATK 3.6 there was an option to define active regions, is there any option in GATK v4.2.1 The Active region parameters can be seen from this link:-...

View Article


How to analyze separately called gVCFs?

I have VCFs from a WGS experiment on several thousand subjects. I did not process the data through the GATK pipeline myself and have no access to the original BAM files or intermediary files. Each...

View Article


HaplotypeCaller error: Badly formed genome unclippedLoc: Query interval is...

Hi there, I'm running the GATK best practice pipeline (locally on a Linux server without using docker) for Germline short variant discovery (SNPs + Indels) using the Generic germline short variant...

View Article

AD allele depth interpretation

Hello, I have a query on the interpretation of the AD variable in a vcf generated by calling about 800 samples together. The header defines it as: ##FORMAT= and the forum further elaborates: AD is the...

View Article

understand HaplotypeCaller output vcf format

Hi there, I am using GATK4.1.0.0 version on germline pair-end illumina WGS data with following command: ``` gatk4.1.0.0 --java-options '-Xmx5G' HaplotypeCaller -R...

View Article

Regarding optimal size of gvcf file generated by Haplotypecaller

Hi everyone, I am running Haplotypecaller to generate individual gvcf files of 50 samples of Cattle (Bos taurus) after removing Illumina adapters from 30x raw reads with ~ 120 bp read length....

View Article


Are there runtime measurements of the HaplotypeCaller with the PairHMM FPGA...

Hello. I would like to know if there are runtime measurements of the HaplotypeCaller with the "--pairHMM EXPERIMENTAL_FPGA_LOGLESS_CACHING" option against other options such as "--pairHMM...

View Article

Throwing Error: Invalid or corrupt jarfile while using GATK wrapper version...

Hello All, I am using GATK wrapper version: gatk-4.1.2.0, and while trying use it on HaplotypeCaller tool as following, ./gatk HaplotypeCaller --help it is throwing the following error message, Error:...

View Article


Image may be NSFW.
Clik here to view.

Can I disable local reassembly and realignment?

Hello, I want to dentify short variants in RNAseq data. But I find HaplotypeCaller introduces many unusual alignments (see the raw bam and bamout file). Can I disable local reassembly and realignment...

View Article

The bamout file results are inconsistent with the VCF file results

Hi, I use GATK4 were analyzed, and found that took place on a site of "bamout" file multiple mutations, respectively from G mutation is T, the number of reads supported mutation is 14, and from G...

View Article


GATK 3.8: genotype called as HOM_VAR while there is a large number of REF and...

Hi the GATK team, there is a mutation in my (haloplex) bam that was called (REF=C) as HOM_VAR in my bam while there are only a few reads carrying a ALT=A $ samtools mpileup -r "15:42171531-42171531"...

View Article

Force skip INDELs in GATK output

Dear GATK, I ma using GATK4, my syntax is: gatk HaplotypeCaller -R $reference -I in.bam -ERC BP_RESOLUTION --max-alternate-alleles 1 -O out.raw.snps.g.vcf -L $SNP_TARGET In some regions i have curious...

View Article

detail information for --minimum-mapping-quality option in HaplotypeCaller

Hi, I posted the same question by adding the comment to a different post, but I couldn't get the answer yet, so let me post the same one here. I'm using HaplotypeCaller to find variants, however, could...

View Article

HaplotypeCaller single sample mode error

Hello, I am trying to run GATK4 HaplotypeCaller on a bam file from a single individual on Ubuntu 18 with gatk HaplotypeCaller -R Tse_SBAPGDGG_D.fa -I AHP2746_sorted_dedup.bam -ERC GVCF -O AHP2746.gvcf...

View Article


BAM with hight depth : variant called as HOM_VAR but there are almost no ALT...

Hi GATK Team, I'm using 4.1.2.0 / HaplotypeCaller with the following command with a Haloplex BAM (high depth) gatk --java-options "-Xmx5g -Djava.io.tmpdir=." HaplotypeCaller --minimum-mapping-quality...

View Article


VQSR filtering and dbSNP

Hi, For VQSR filtering, it's assumed that all calls made by HaplotypeCaller or MuTect2 are put into VQSR? Polymorphic sites should only be culled AFTER the vcf is annotated with VQSLOD scores, and a...

View Article

How can I force HaplotypeCaller to call a genotype -- want to avoid gt=./.

In a single sample vcf I have homozygous snp that is not assigned a genotype, the genotype is missing (gt=./.). What knobs can I turn on HaplotypeCaller (in GATK 3.8 or 4.1.2) to assign a genotype to...

View Article

Beginner's question: Why can't I start the analysis run on Terra?

Hello. I am new in genomic analysis and GATK. I am trying to start Haplotypecaller workflow on Terra. I have uploaded aligned genomic reads (bam file and bam index) and used appropriate genome assembly...

View Article


Image may be NSFW.
Clik here to view.

Variant Caller and BAM file INFO not matching

Hello, I am using gatk-4.0.3.0 following the Best practices. BWA-> Picard Mark Dup -> GATK Base Recalib/Apply -> HC variant calling per sample For one of our samples, we have a variant called...

View Article

GVCF of amplicon sequencing

Hi, I'm trying to generate a GVCF with HaplotypeCaller based on amplicon (PCR) sequencing, and bases towards the end of the reads (~10 bases last bases) consistently get genotype quality of 0, even...

View Article

Gatk haplotype caller java error

Hello GATK team, I am trying to run gatk haplotypecaller (version 4.1.1.0) on a bam file and used the following command: "java -jar /ufrc/gatk-4.1.1.0/gatk-package-4.1.1.0-local.jar -T HaplotypeCaller...

View Article

Should I use GATK haplotype caller/unified genotyper for finding SNPs in 2...

I have two whole genome sequenced bacterial samples. I have their contigs file with me. I would like to pairwise comparison between isolate1 and isolate2 to identify SNPs between them. Should I use...

View Article



All ALT fields are the same after GenomicsDBImport and GenotypeGVCFs

Hi GATK team, I'm running GATK 4.1.3.0 to perform targeted single-cell genotyping, where each sample in the output VCF is a single cell (10-20k samples total per VCF). I've noticed something strange in...

View Article

Emitting variants with low mapping quality

Hello, I am validating a new exome sequencing analysis pipeline using in-house-generated data from sample NA12878 and the Genome in a Bottle high confidence calls for this sample. I am using GATK...

View Article

HaplotypeCaller generate incorrect heterozygous Calls

Hi, We have used GATK 3.6 and GATK 4.1 haplotype caller, to generate the variants, unfortunately, we are getting lots of incorrect heterozygous calls. Even only one read supports the variant it is...

View Article

How to get a smaller list of deNovo SNPs between 3 genotype

Hello. I am currently working on maize whole genome dataset and I have 3 samples- WT, MT and B73. I obtained the VCF files for all 3 datasets using the haplotype caller. However, the list of SNPs that...

View Article


GATK Exception in HaplotypeCaller

Hi, I am using gatk 4.0.8.1 HaplotypeCaller for making g.vcf. I am running follwing command gatk --java-options "-Xms24g -Xmx48g" HaplotypeCaller -R new_hg38.fa -I S11_.sorted.BQRC.bam -O S11.g.vcf -L...

View Article

What exactly is DP and how filtered is it?

I have read somewhat contradictory things across the documentation as to what DP actually measures, and I can't quite tell if I'm missing some distinction between the "different" DPs. I have read that...

View Article

How can a homozygous reference 0/0 genotype (GT) have a heterozygous phased...

Dear GATK support team, Multi-sample variant calling files made with GATK4 have both genotype, genotype phase and genotype phase block information....

View Article


Discovering singletons with GenotypeGVCFs?

Hi, I have several samples that I ran HaplotypeCaller (in normal mode) with that I am looking to discover germline variants from. I read that GenotypeGVCFs isn't good with discovering singletons, and...

View Article


non-reference allele didn't be called into vcf by HC

Hi GATK team I ran HC joint calling and found out that some non-reference alleles didn't appear in the vcf. Most of these allele seems to reside on the region of relatively poor alignments. My first...

View Article

How does MAPQ affect haplotypecaller during germline calling

I understand that, during germline calling, pairHMM calculates the likelihood of each haplotype by taking base quality into consideration. My question is: what role does MAPQ play during germline...

View Article

Exit code 3 for HaplotypeCaller?

I am getting an exit code 3 for HaplotypeCaller version 4.1.2.0. I checked the command line output and the only thing I can see is this: WARN InbreedingCoeff - Annotation will not be calculated, must...

View Article

Different HaplotypeCaller variant calls based on Java version?

Hi, we are running HaplotypeCaller on two different Linux servers and getting slightly different VCF calls (only with SNVs, not Indels) with identical BAM files despite using the same script and having...

View Article


Using HaplotypeCaller for human samples, should I set the ploidy for non...

I've been working on the basis that this complexity is automatically handled by the algorithm, but i'm not sure. My question specifically relates to HaplotypeCaller in GATK4.

View Article

I have tried creating a GATK project in the IntelliJ IDE as documented in...

It is giving error "error: package com.sun.javadoc does not exist final com.sun.javadoc.ClassDoc classDoc" everytime I run debug 'GATK debug'. Please help me

View Article


Running HaplotypeCaller on a multi-lane sample

Hello, I am interested in running HaplotypeCaller for a multi-lane sample. What is the best way to do this? I followed the protocol here...

View Article

Why does haplotype caller produce a vcf with per-lane output?

I'm using gatk4, I have input data for a single sample across four different lanes: v02_L1.fastq.gz v02_L2.fastq.gz v02_L3.fastq.gz v02_L4.fastq.gz I want to merge this into per sample results....

View Article


Number of Snps differ between individual GVCF files and final vcf file

I have total 96 gvcf file with respective number of variations 190 176 213 201 156 11 201 191 168 138 189 183 197 203 188 192 192 139 145 37 198 205 162 202 133 146 175 137 157 198 162 174 204 109 200...

View Article

What is the current recommended practice for interval padding?

In the GATK Best Practices WDL in github, the GenotypeGVCFs commandline does not specify any interval padding via -ip or --interval-padding parameter....

View Article

Help understanding genotype in spanning deletion notation

Hello, I am working with a collaborator who had sequencing and variant calling done at the Broad. The resulting multi-sample VCF has spanning deletion notation. I am trying to understand what the...

View Article

How to add sample names in VCF?

I am using GATK best practices for germline SNPs and Indels 4.1.2.0. After mapping and recalibration, I run haplotypecaller in GVCF mode. I am combining all vcf files (output from haplotypecaller)...

View Article


How can I debug and develop the algorithm in GATK ,such as haplotypecaller,...

Is there any paper or documents ? and those test data where i need download,such as NA12878.HiSeq.b37.chr20.10_11mb.bam.

View Article