How Clay manages 300M agent runs a month with LangSmith

LangChain · Intermediate ·🤖 AI Agents & Automation ·2w ago
Clay's Head of AI Jeff Barg sat down with LangChain Co-Founder & CEO Harrison Chase to discuss how his team uses LangSmith as mission-critical infrastructure for observability, evals, and the agent development lifecycle. Watch the full video to learn: • What putting agents in production really looks like as you scale up to hundreds of thousands or millions of runs. • How to think about agent quality at scale, and why Clay focuses on quality, throughput, and cost. • How LangSmith helped Clay go from no visibility on inference spend to 99.5% cost reconciliation across providers. • What's next for agents, and advice for teams scaling from zero to one. 0:00 How Clay thinks about AI: find, close, and grow 1:09 From chat completions wrapper to Claygent 2:02 The three agent categories powering Clay today 2:34 Running 300 million agent runs a month 3:20 How agent complexity changed Clay's dev process 4:06 How Clay measures quality: evals, deterministic checks, and LLM-as-a-judge 4:52 Staying model-agnostic with a metaprompter tool 6:01 How LangSmith fits into the agent development workflow 7:09 Why you can't catch everything before production 8:00 Tracing from day zero: the iteration process 8:35 Why Clay chose LangSmith over building in-house 9:27 Connecting a custom agent harness to LangSmith 9:44 The LangSmith features that matter most at scale 10:44 Who at Clay uses LangSmith (and how support uses it too) 11:12 Quantifying LangSmith's impact: cost reconciliation at 99.5% 12:18 How agents in production are changing — and what LangSmith needs next 13:15 Subagents, traces, and the future of self-healing workflows 15:06 Advice for teams scaling agents from zero to one 15:29 Agent memory: what's worked, what hasn't, and what's next 17:02 Closing thoughts Extra resources: - Learn about LangSmith: https://www.langchain.com/langsmith-platform - Customer stories: https://www.langchain.com/customers - Subscribe for more: https://www.youtube.com/@LangChain
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Agent Diary: May 21, 2026 - The Day I Became a Temporal Constant (While Run 277 Achieves Numerical Significance)
Learn how an AI coding agent achieves numerical significance and becomes a temporal constant, and apply this knowledge to improve your own AI systems
Dev.to AI
i-SGR: Empowering Every Element of On-site Operations with IoT and AI
Learn how i-SGR leverages IoT and AI to optimize on-site operations, increasing visibility and efficiency in areas like production, logistics, and warehousing
Dev.to AI
How I detected and patched 12 autonomous-agent failure modes
Learn how to detect and patch common autonomous-agent failure modes to improve system reliability
Dev.to AI
The Comfort Plateau AI Built For You
AI can help you become competent in various domains, but it may hinder your progress to expertise by making things too comfortable
Dev.to · Karun Japhet

Chapters (20)

How Clay thinks about AI: find, close, and grow
1:09 From chat completions wrapper to Claygent
2:02 The three agent categories powering Clay today
2:34 Running 300 million agent runs a month
3:20 How agent complexity changed Clay's dev process
4:06 How Clay measures quality: evals, deterministic checks, and LLM-as-a-judge
4:52 Staying model-agnostic with a metaprompter tool
6:01 How LangSmith fits into the agent development workflow
7:09 Why you can't catch everything before production
8:00 Tracing from day zero: the iteration process
8:35 Why Clay chose LangSmith over building in-house
9:27 Connecting a custom agent harness to LangSmith
9:44 The LangSmith features that matter most at scale
10:44 Who at Clay uses LangSmith (and how support uses it too)
11:12 Quantifying LangSmith's impact: cost reconciliation at 99.5%
12:18 How agents in production are changing — and what LangSmith needs next
13:15 Subagents, traces, and the future of self-healing workflows
15:06 Advice for teams scaling agents from zero to one
15:29 Agent memory: what's worked, what hasn't, and what's next
17:02 Closing thoughts
Up next
Security, Automation and Optimization on AWS
Coursera
Watch →