Retrieval Strategies

The "which tool when" decision tree

This is the cheat sheet to memorize:

Symptom Reach for
Exact-string / keyword misses Hybrid search (BM25 + vector)
Right answer in top-50 but not top-5 Reranking
Conversational, pronoun-heavy queries Query rewriting
Specialized domain, query-doc vocabulary gap HyDE
Ambiguous queries needing broader recall Multi-query fan-out
Compound / chain-of-reasoning questions Multi-hop iterative retrieval
Mixed semantic + structured query Self-querying
Per-user / per-tenant data isolation Metadata filters (always, in production)
Chunk-size tension (precision vs context) Small-to-big retrieval
Relational questions Graph RAG

The order to apply them in a real customer engagement

You don't bolt all 10 on at once. The FDE move:

  1. Naive RAG + metadata filters — minimum viable security and isolation
  2. Add hybrid search — biggest quality jump for least effort
  3. Add reranking — second biggest quality jump
  4. Add query rewriting — fixes the conversational-UX gap