Alex ChernyshAlex ChernyshAI Systems Engineer · Tel Aviv
Writing

Notes

Notes on AI systems, retrieval, and the work that starts after the demo.

Notes on retrieval, evals, observability, and the engineering that starts once the demo is the easy part.

1 post
Feb 3, 2026
6 min read

How to Run LLM Evals in Production

LLM evals for continuous delivery: turn production failures into automated tests, grade traces with task-specific graders, and block bad releases with eval-driven gates.

+1