Skip to content

Supervised Learning

Definition
AI-generated

Supervised learning trains models from input–output pairs by minimizing prediction error on labeled data; common objectives include cross-entropy for classification and squared or robust losses for regression, with generalization estimated by held-out or cross-validated performance.

Why it matters in GWAS

Case–control GWAS itself fits supervised models (often linear or mixed) at each variant; PRS training, phenotype imputation from EHR, and variant pathogenicity classifiers are also supervised. Label leakage (related samples, population stratification) can inflate accuracy unless study design and splits address genetics explicitly.

Example usage

"We treated three-year T2D incidence as the supervised learning target and used elastic net and gradient boosting baselines."

References

  • Hastie T, Tibshirani R, Friedman J. (2009). The Elements of Statistical Learning. Springer.

Last updated (UTC · Git history)