side10:10–10:30

Always Be Committing: The Scalable LLM Eval Loop

// ABOUT THIS SESSION

Shipping LLM features with confidence isn't a one-time act — it's a continuous loop of offline experiments, online validation, and annotation-driven eval tuning that evolves alongside your prompts, tools, and product surfaces. The goal is a quality system that forces decisions: clear signal from dev to prod, so experiments conclude rather than linger, and every deployment is backed by evidence, not instinct.

// SPEAKER

Carl Brenssell

AI and ML Product, Quality and Trust · Slack

AI-powered workplace communication for teams

Workplace AI