Skip to content

PRS Clumping and Thresholding

Definition
AI-generated

Clumping and thresholding (C+T) is a polygenic score construction strategy: variants are first clumped to reduce linkage disequilibrium redundancy (one “index” SNP per LD block), then SNPs are included according to a series of P-value thresholds from the base GWAS, often selecting the threshold that maximizes prediction in a validation set.

Topics
Synonyms

Why it matters in GWAS

C+T is simple, interpretable, and widely implemented (e.g. PRSice-2, PLINK-based workflows); understanding it clarifies why LD reference choice and multiple testing over thresholds require careful reporting.

Example usage

"The methods explicitly include PRS Clumping and Thresholding to support interpretation of the main findings."

References

  • Choi SW, Mak TSH, O'Reilly PF. (2020). Tutorial: a guide to performing polygenic risk score analyses. Nat Protoc. https://doi.org/10.1038/s41596-020-0353-1
  • Purcell SM, et al. (2009). Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature.

Last updated (UTC · Git history)