Notes
Notes on retrieval, evals, observability, and the engineering that starts once the demo is the easy part.
How to run LLM evals in production with gold sets, graders, trace checks, online signals, and release gates.