Notes
Notes on retrieval, evals, observability, and the engineering that starts once the demo is the easy part.
Query rewrite, decomposition, step-back prompting, HyDE, fusion, and when each one is worth the extra latency.
Chunking, titles, metadata, parent-child structure, reranking, and corpus QA for RAG systems.