Retrieval-Augmented Generation

A pattern in which an LLM is conditioned at inference time on documents fetched from an external corpus by a retriever, so that generation is grounded in up-to-date, citable knowledge without retraining. RAG is the default means by which LLM agents access long-term memory and shared knowledge bases.

In this vault

Backlinks