Skip to content

Contamination

Definition
AI-generated

Contamination is the presence of DNA from more than one source in a sample, causing distorted genotype calls and abnormal allele balance or heterozygosity patterns.

Topics

Why it matters in GWAS

Contaminated samples can generate widespread genotype error, apparent relatedness artifacts, and inflated missingness, so they are usually excluded during QC.

Example usage

"The sample showed excess heterozygosity consistent with contamination and was removed before imputation."

References

  • Anderson CA, et al. (2010). Data quality control in genetic case-control association studies. Nat Protoc.

Last updated (UTC · Git history)