Context window

The maximum amount of tokens an LLM can read at once — system prompt, history, files, and your latest message all share this budget.

When the window fills up, the model "forgets" the earliest content. Tools like agentic editors compress, summarize, or selectively retrieve files to keep the most relevant context inside the window.

See it in the glossary →