Skip to content

Retrieval-Augmented Generation

Definition

Retrieval-augmented generation, or RAG, is a method where an AI system retrieves relevant information before generating an answer or taking action.

RAG grounds AI outputs in a company's documents, knowledge bases, policies, product information, customer history, or other trusted sources. Instead of relying only on model memory, the system pulls current context into the task.

For agent systems, RAG is often the difference between useful and dangerous. An agent handling support, sales, onboarding, or internal questions needs access to the right information at the right moment. Without retrieval, it guesses. With retrieval, it can operate from company-specific truth.