The 7 Categories of RAG Clarifying Questions

Mental model: you're asking questions across users, data, retrieval needs, generation needs, operations, compliance, and constraints. Always in roughly that order — start with users (most important, most often skipped) and end with constraints (budget/timeline are technical reality but should not drive architecture).


1. Users & Use Case (the "why")

This is the single most important category. If you don't know who's using it and for what, every other decision is a guess.

The user-and-stakes questions are the ones interviewers grade hardest because they're consulting signals.


2. Data — the corpus

This is where most RAG systems actually live or die. The hardest part of production RAG is rarely the LLM; it's the data.