Notes
Notes on retrieval, evals, observability, and the engineering that starts once the demo is the easy part.
HyDE, query rewrite, decomposition, step-back prompting, and fusion for RAG: which query transformation technique fixes which retrieval failure, and when the extra latency pays off.
Prompt design now means response formats, examples, tools, and eval loops, not incantations.