Concepts Fondamentaux

Inference

The process of running a trained model to generate outputs from new inputs.

Inference is what happens every time you call an AI API: the model takes your prompt and produces a completion. Unlike training, inference does not update the model's weights. It is the primary cost driver for production AI systems — you pay per token of inference, not for the model's training.

Termes Associés