Skip to content

Harness

Definition
AI-generated

A harness is the scaffolding around a language model or agent run—fixed prompts or tasks, datasets or fixtures, tool mocks or live APIs, checks, metrics, and logging—so behavior can be run, compared, and regression-tested in a repeatable way.

Synonyms
Plural
Harnesses

Why it matters in GWAS

Before relying on LLM outputs for curation or QC messaging, a harness can regression-test prompts on fixed variant examples and summary-statistics snippets so upgrades to models or tools do not silently break allele alignment or column naming.

Example usage

"The harness failed when we switched tokenizer settings—output length exceeded the JSON schema the pipeline parser expected."

References

  • Liang P, et al. (2023). Holistic evaluation of language models (HELM). Trans Mach Learn Res.

Last updated (UTC · Git history)