Skip to content

Token

Definition
AI-generated

A token is the atomic unit an LLM’s tokenizer produces from text—often a subword piece rather than a whole word.

Synonyms
Plural
Tokens

Why it matters in GWAS

Long methods sections, full GWAS appendices, or pasted summary-statistics tables can exhaust context or budget; knowing token boundaries helps decide what to RAG-retrieve versus paste, and explains truncation artifacts in variant lists or citations.

Example usage

"We chunked the supplementary table so each chunk stayed under the provider’s per-request token cap."

References

  • Sennrich R, Haddow B, Birch A. (2016). Neural machine translation of rare words with subword units. ACL.

Last updated (UTC · Git history)