Alex ChernyshAlex ChernyshAgentic behaviorist · Tel Aviv
WritingAssistant

Notes

Notes on AI systems, retrieval, and the work that starts after the demo.

Notes on retrieval, evals, observability, and the engineering that starts once the demo is the easy part.

1 post
Feb 3, 2026
6 min read

How to Run LLM Evals in Production

How to run LLM evals in production with gold sets, graders, trace checks, online signals, and release gates.

+1