Entrenamiento

Pre-Training

The initial large-scale training phase where a model learns language from massive text corpora.

Pre-training is the most compute-intensive phase of model development. The model is trained to predict the next token across trillions of tokens of internet text, books, and code. This instills broad world knowledge and language capability. Pre-training costs tens to hundreds of millions of dollars for frontier models and is done once by the provider.

Términos Relacionados