Token¶
Definition
AI-generated
A token is the atomic unit an LLM’s tokenizer produces from text—often a subword piece rather than a whole word.
Topics
Synonyms
Plural
Tokens
Why it matters in GWAS¶
Long methods sections, full GWAS appendices, or pasted summary-statistics tables can exhaust context or budget; knowing token boundaries helps decide what to RAG-retrieve versus paste, and explains truncation artifacts in variant lists or citations.
Example usage¶
"We chunked the supplementary table so each chunk stayed under the provider’s per-request token cap."
Related terms¶
References¶
- Sennrich R, Haddow B, Birch A. (2016). Neural machine translation of rare words with subword units. ACL.
Last updated (UTC · Git history)