Skip to content

Duplicate read

Definition
AI-generated

Duplicate reads are extra copies of the same DNA fragment arising from PCR or optical clustering; pipelines mark duplicates so downstream callers do not overweight the same molecule.

Why it matters in GWAS

Failure to mark duplicates inflates apparent support for variants and distorts allele balance metrics used in QC.

Example usage

"Picard marked each duplicate read; optical duplicates were distinguished from PCR duplicates."

References

  • Picard toolkit: MarkDuplicates. Broad Institute.

Last updated (UTC · Git history)