GATK 3.2 was released on July 14, 2014. Highlights are listed below. Read the detailed version history overview here: http://www.broadinstitute.org/gatk/guide/version-history
We also want to take this opportunity to thank super-user Phillip Dexheimer for all of his excellent contributions to the codebase, especially for this release.
Haplotype Caller
- Various improvements were made to the assembly engine and likelihood calculation, which leads to more accurate genotype likelihoods (and hence better genotypes).
- Reads are now realigned to the most likely haplotype before being used by the annotations, so AD and DP will now correspond directly to the reads that were used to generate the likelihoods.
- The caller is now more conservative in low complexity regions, which significantly reduces false positive indels at the expense of a little sensitivity; mostly relevant for whole genome calling.
- Small performance optimizations to the function to calculate the log of exponentials and to the Smith-Waterman code (thanks to Nigel Delaney).
- Fixed small bug where indel discovery was inconsistent based on the active-region size.
- Removed scary warning messages for "VectorPairHMM".
- Made VECTOR_LOGLESS_CACHING the default implementation for PairHMM.
- When we subset PLs because alleles are removed during genotyping we now also subset the AD.
- Fixed bug where reference sample depth was dropped in the DP annotation.
Variant Recalibrator
- The -mode argument is now required.
- The plotting script now uses the
theme
instead ofopt
functions to work with recent versions of the ggplot2 R library.
AnalyzeCovariates
- The plotting script now uses the
theme
instead ofopt
functions to work with recent versions of the ggplot2 R library.
Variant Annotator
- SB tables are created even if the ref or alt columns have no counts (used in the FS and SOR annotations).
Genotype GVCFs
- Added missing arguments so that now it models more closely what's available in the Haplotype Caller.
- Fixed recurring error about missing PLs.
- No longer pulls the headers from all input rods including dbSNP, rather just from the input variants.
- --includeNonVariantSites should now be working.
Select Variants
- The dreaded "Invalid JEXL expression detected" error is now a kinder user error.
Indel Realigner
- Now throws a user error when it encounters reads with I operators greater than the number of read bases.
- Fixed bug where reads that are all insertions (e.g. 50I) were causing it to fail.
CalculateGenotypePosteriors
- Now computes posterior probabilities only for SNP sites with SNP priors (other sites have flat priors applied).
- Now computes genotype posteriors using likelihoods from all members of the trio.
- Added annotations for calling potential de novo mutations.
- Now uses PP tag instead of GP tag because posteriors are Phred-scaled.
Cat Variants
- Can now process .list files with -V.
- Can now handle BCF and Block-Compressed VCF files.
Validate Variants
- Now works with gVCF files.
- By default, all strict validations are performed; use --validationTypeToExclude to exclude specific tests.
FastaAlternateReferenceMaker
- Now use '--use_IUPAC_sample sample_name' to specify which sample's genotypes should be used for the IUPAC encoding with multi-sample VCF files.
Miscellaneous
- Refactored maven directories and java packages replacing "sting" with "gatk".
- Extended on-the-fly sample renaming feature to VCFs with the --sample_rename_mapping_file argument.
- Added a new read transformer that refactors NDN cigar elements to one N element.
- Now a Tabix index is created for block-compressed output formats.
- Switched outputRoot in SplitSamFile to an empty string instead of null (thanks to Carlos Barroto).
- Enabled the AB annotation in the reference model pipeline (thanks to John Wallace).
- We now check that output files are specified in a writeable location.
- We now allow blank lines in a (non-BAM) list file.
- Added legibility improvements to the Progress Meter.
- Allow for non-tab whitespace in sample names when performing on-the-fly sample-renaming (thanks to Mike McCowan).
- Made IntervalSharder respect the IntervalMergingRule specified on the command line.
- Sam, tribble, and variant jars updated to version 1.109.1722; htsjdk updated to version 1.112.1452.