Skip to content

Canonical Transcript

Definition
AI-generated

A canonical transcript is the representative splice isoform a database or tool uses as the default for a gene—often for displaying consequences, longest coding sequence, or community-curated “principal” isoforms. The exact rule varies by resource (e.g. Ensembl, RefSeq, MANE), so “canonical” is not a single universal sequence.

Why it matters in GWAS

Coding annotations (“missense in gene X”) depend on which transcript is chosen; a variant can be intronic in one isoform and splice-altering in another. Harmonizing TWAS, colocalization, and VEP output requires knowing whether results use canonical, MANE, or all transcripts.

Example usage

"The lead SNP was synonymous in the canonical transcript but fell in an alternative exon used in brain isoforms."

References

  • Ji HJ, Pertea M, Salzberg SL. (2026). Annotating genomes at increased scale and resolution. Nat Rev Genet. https://doi.org/10.1038/s41576-026-00937-3

Last updated (UTC · Git history)