Back to Glossary

Retrieval-Augmented Generation (RAG)

A technique that enhances AI model responses by retrieving relevant information from external data sources before generating an answer.

Retrieval-augmented generation (RAG) combines the generative capabilities of language models with information retrieval systems. Instead of relying solely on what a model learned during training, RAG first searches a knowledge base — such as company documents, databases, or the web — and feeds the retrieved information into the model as context.

This approach reduces hallucinations, keeps responses grounded in up-to-date facts, and allows models to work with proprietary data they were never trained on. RAG architectures typically use vector databases to store and retrieve document embeddings efficiently.

RAG has become a foundational pattern in enterprise AI. Engineers who can design and optimize RAG pipelines — from chunking strategies to retrieval ranking to prompt composition — are among the most sought-after in the AI job market.

Related AI Job Categories

    Retrieval-Augmented Generation (RAG) — AI Careers Glossary | We Love AI Jobs