Arquitectura

Reasoning Model

A model variant that produces explicit step-by-step thinking before answering.

Reasoning models (o1, o3, Claude 3.7 with extended thinking, Gemini Thinking) use chain-of-thought internally during inference, often generating thousands of 'thinking tokens' before producing a final answer. This dramatically improves accuracy on math, science, and logic but increases latency and cost. Thinking tokens may be billed at a discount or separately from output tokens.

Términos Relacionados