SantageAI Glossary › Token
AI Glossary

What is Token?

A token is the basic unit of text that AI language models process, roughly equivalent to three-quarters of a word or about four characters in English.

What is the core idea behind tokens in AI?

Tokens are the atoms of language for AI.

How do tokens in AI differ from related concepts?

ConceptDifference
Token vs WordWords are linguistic units. Tokens are computational units
Token vs CharacterCharacters are individual letters. Tokens are subword units
Token vs EmbeddingTokens are input units. Embeddings are their numerical representations

How do tokens in AI work?

What are the limitations of tokens in AI?

Why are tokens in AI important?

Tokens are the fundamental unit of measurement in the AI industry. API pricing, context windows, rate limits, and model capabilities are all measured in tokens.

How are tokens in AI used in practice?

AI APIs from OpenAI, Anthropic, and Google charge per token. Most providers offer tokenizer tools to estimate costs before making API calls.

Frequently Asked Questions

How many tokens are in a typical sentence?
A typical English sentence of 15 words is approximately 20 tokens. The exact count depends on word complexity and the specific tokenizer used.
Why do different models have different token counts for the same text?
Different models use different tokenization schemes, so the same text may be split into a different number of tokens depending on the model.