Skip to content

NCBI Reference Sequence (RefSeq)

Definition
AI-generated

RefSeq is NCBI’s curated, non-redundant set of reference sequences for genomes, transcripts, and proteins.

Topics

Why it matters in GWAS

Lead variants are often reported with rs IDs and gene symbols; harmonizing with RefSeq transcript IDs matters for clinical reporting, dbSNP cross-references, and resources that use RefSeq-centric models. MANE selects one RefSeq and one Ensembl transcript per gene to reduce ambiguity.

Example usage

"We mapped GWAS genes to RefSeq NM accessions for consistency with the clinical variant database."

References

  • Goldfarb T, et al. (2025). NCBI RefSeq: reference sequence standards through 25 years of curation and annotation. Nucleic Acids Res.
  • Ji HJ, Pertea M, Salzberg SL. (2026). Annotating genomes at increased scale and resolution. Nat Rev Genet. https://doi.org/10.1038/s41576-026-00937-3

Last updated (UTC · Git history)