Machine Learning¶
Machine learning is a set of methods that fit predictive or generative models from data by optimizing an objective (loss or likelihood), rather than encoding all rules by hand; it spans supervised, unsupervised, self-supervised, and reinforcement learning, implemented with algorithms from linear models to deep neural networks.
Why it matters in GWAS¶
Machine learning is used for polygenic scores, phenotype imputation, variant and splice prediction, single-cell annotation, and mining literature or EHR text. Genetic studies require careful validation (cross-cohort, cross-ancestry, family-based checks) because flexible models can absorb population structure, relatedness, or batch effects that inflate apparent performance.
Example usage¶
"We framed PRS construction as a supervised machine-learning problem with L2-regularized linear models and nested cross-validation."
Related terms¶
References¶
- Jordan MI, Mitchell TM. (2015). Machine learning: Trends, perspectives, and prospects. Science.
Last updated (UTC · Git history)