Notes
Notes on retrieval, evals, observability, and the engineering that starts once the demo is the easy part.
A practical blueprint for legal QA, shaped in part by work around the Agentic RAG Legal Challenge: document identity, hybrid retrieval, structured answers, page-level grounding, telemetry, and evals.
HyDE, query rewrite, decomposition, step-back prompting, and fusion for RAG: which query transformation technique fixes which retrieval failure, and when the extra latency pays off.
How to reduce hallucinations in LLM systems with better retrieval, abstention, verification, evals, and guardrails.
Chunking, titles, metadata, parent-child structure, reranking, and corpus QA for RAG systems.