Core Concepts

Completion

The text output generated by a language model in response to a prompt.

A completion is the model's response to your input prompt. In API terms, completions are output tokens — they are generated one token at a time and are typically priced 2–5× higher than input tokens because generation is more compute-intensive than prefill. The length of the completion directly drives your output token cost.

Related Terms