Eğitim

Instruction Tuning

Fine-tuning a model on examples of instructions paired with ideal responses.

Instruction tuning (also called supervised fine-tuning, SFT) trains a pre-trained base model on human-written instruction-response pairs. This transforms the raw next-token predictor into an assistant that follows user instructions. All commercial chat models — ChatGPT, Claude, Gemini — have been instruction-tuned on top of their pre-trained base.

İlgili Terimler