Skip to content

Genome Analysis Toolkit (GATK)

Definition
AI-generated

The Genome Analysis Toolkit (GATK) is an open-source software suite, developed at the Broad Institute, for analyzing high-throughput sequencing data.

Why it matters in GWAS

Many large sequencing studies that supply variants for association analysis, imputation panels, or rare-variant tests document GATK-based calling and QC; knowing the toolkit helps interpret methods sections, filter fields, and reproducibility choices.

Example usage

"Variants were called with GATK best practices, including BQSR and joint genotyping across all samples."

References

  • McKenna A, et al. (2010). The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Research. https://doi.org/10.1101/gr.107524.110
  • DePristo MA, et al. (2011). A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nature Genetics. https://doi.org/10.1038/ng.806

Last updated (UTC · Git history)