Question 1

What is RAG (Retrieval-Augmented Generation)?

Accepted Answer

Retrieval-Augmented Generation (RAG) is an AI architecture that combines a language model with a retrieval system. Instead of relying solely on training data, RAG retrieves relevant documents from an external knowledge base and includes them in the prompt context. This produces more accurate, up-to-date, and verifiable responses while reducing hallucinations.

Question 2

How does RAG (Retrieval-Augmented Generation) work?

Accepted Answer

A company chatbot using RAG would first search the company knowledge base for relevant documents, then feed those documents to an LLM along with the user question. The LLM generates an answer grounded in the actual company data.

Question 3

How does RAG compare to fine-tuning for adding custom knowledge?

Accepted Answer

RAG retrieves relevant information at query time from an external knowledge base, making it easy to update and requiring no model training. Fine-tuning bakes knowledge into the model weights, requiring retraining when information changes. RAG is better for factual knowledge; fine-tuning is better for changing model behavior.

Question 4

What components are needed to build a RAG system?

Accepted Answer

A basic RAG system requires a document corpus, an embedding model to convert text into vectors, a vector database for storing and searching embeddings, and a language model to generate answers. Tools like LangChain, LlamaIndex, and cloud providers offer frameworks that simplify RAG implementation.

Question 5

What are common problems with RAG implementations?

Accepted Answer

Common issues include retrieving irrelevant documents due to poor chunking or embedding quality, the model ignoring retrieved context in favor of its training data, context window limits when too many documents are retrieved, and maintaining the knowledge base as source documents change.

What is RAG (Retrieval-Augmented Generation)?

Definition

💡 Example

Related concepts

Why this matters

Real-world example

See it in action

Explore AI tools