Skip to content

GENCODE

Definition
AI-generated

GENCODE is a reference gene annotation project for human and mouse that aims to catalog protein-coding genes, splice isoforms, and noncoding transcripts on the reference genome, combining automated pipelines with manual curation.

Topics

Why it matters in GWAS

VEP and many pipelines default to GENCODE (or Ensembl transcripts derived from the same effort); eQTL and TWAS studies typically report effects relative to specific GENCODE versions. Reproducible cross-cohort analysis requires noting the GENCODE (or Ensembl) release and transcript set.

Example usage

"We annotated variants with Ensembl VEP using GENCODE Comprehensive on GRCh38."

References

  • Mudge JM, et al. (2025). GENCODE 2025: reference gene annotation for human and mouse. Nucleic Acids Res.
  • Ji HJ, Pertea M, Salzberg SL. (2026). Annotating genomes at increased scale and resolution. Nat Rev Genet. https://doi.org/10.1038/s41576-026-00937-3

Last updated (UTC · Git history)