Concepts Fondamentaux

RAG (Retrieval-Augmented Generation)

Enhancing model responses by fetching relevant documents from an external knowledge base at query time.

RAG combines a retrieval system (typically a vector database) with a generative model. When a query arrives, relevant passages are fetched and injected into the prompt context before the model responds. This keeps responses factually grounded without the cost of fine-tuning, allows knowledge to be updated without retraining, and is the dominant pattern for production knowledge-base chatbots.

Termes Associés