Notes
Notes on retrieval, evals, observability, and the engineering that starts once the demo is the easy part.
Query rewrite, decomposition, step-back prompting, HyDE, fusion, and when each one is worth the extra latency.
Prompt design now means response formats, examples, tools, and eval loops, not incantations.