chapter eleven
11 Correction, planning, and reasoning
This chapter covers
- Why retrieval quality alone is not sufficient for RAG systems
- How control, reasoning order, and validation determine real-world RAG performance
- How and why RAG architectures are shifting from single-pass to agentic
- When to use corrective control (CRAG) versus planning-first retrieval (LevelRAG)
- How retrieval failures map to architectural decisions
In chapter 10, we explored RAPTOR and optimizing the index by organizing information into a semantic tree to address the pervasive issues of Lost in the Middle and Incorrect Specificity (FP6). RAPTOR RAG.showed how more sophisticated indexing strategies can meaningfully improve RAG quality.
Even with better indexing, standard Retrieve-then-Generate pipelines share a structural weakness: they are single-pass. The system bets on a linear, optimistic chain (vector search finds the right document, the document contains the answer, the LLM extracts it correctly) and ships whatever comes out the other end.