Building and Evaluating Data Agents

External: Coursera Courses ↗ · Coursera

Open Course on External: Coursera

Free to audit · Opens on External: Coursera

Building and Evaluating Data Agents

Coursera · Intermediate ·🤖 AI Agents & Automation ·2mo ago
Learn how to build and evaluate a data agent in “Building and Evaluating Data Agents,” a course created in collaboration with Snowflake, and taught by Anupam Datta, AI Research Lead, and Josha Reini, Developer Advocate at Snowflake. You’ll design a data agent that connects to data sources (databases, files) and performs web searches to respond to users’ queries. The agent will consist of sub-agents, each specialized in connecting to a particular data source, and other sub-agents that summarize or visualize the results. To answer a particular query, the agent will use a planner that identifies which sub-agents to call and in what order. You’ll add observability to the agent’s workflow and evaluate the quality of its output. Using an LLM-as-a-judge approach, you’ll assess whether the final answer is relevant to the user’s query and grounded in the collected data. You’ll also evaluate the process by determining whether the agent’s goal, plan, and actions (GPA) are all aligned. Finally, you’ll apply inline evaluations to evaluate the agent’s performance during runtime. At every retrieval step, you’ll evaluate if the collected data is relevant to the user’s query. The agent will use this evaluation score to decide if it needs to adjust its plan. What you’ll do, in detail: Understand what data agents are and how they can be trustworthy when their goal, plan, and actions are properly aligned. Build a data agent that plans, performs web searches ,and visualizes or summarizes the results, using a multi-agent workflow implemented in LangGraph. Expand the agent’s capabilities by adding a Cortex sub-agent that retrieves information from structured and unstructured data stored in Snowflake. Add tracing to the agent’s workflow to log the steps it takes to answer a query. Evaluate the context relevance of the retrieved results, the groundedness of the final answer, and its relevance to the user’s query. Measure the alignment of the agent’s goal, plan, and actions (GPA) b
Watch on External: Coursera ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

RecursiveMAS Playground: Browser-Native Implementation of Recursive Multi-Agent Systems
Learn about RecursiveMAS Playground, a browser-native implementation of recursive multi-agent systems, and how to apply it in practice
Dev.to · vishalmysore
You Don't Own Your AI Agent. And Even If You Did, Would You Trust It?
Learn why owning an AI agent doesn't necessarily mean you can trust it, and what this means for AI development and deployment
Dev.to · Agent-Risk
If AI Replaces Juniors, Who Replaces Seniors? The Hidden Cost of Misusing AI in Tech
Misusing AI in tech can lead to hidden costs, including the loss of senior talent, and it's crucial to consider the long-term consequences of relying on AI for junior tasks
Dev.to · Nino Ross Rodriguez
Teaching AI is the fastest way to sharpen your own thinking
Teaching AI helps sharpen your own thinking by forcing you to crystallize complex concepts
Dev.to · Alex Tong
Up next
Codex Can Now "Copy" Your Tasks
Matt Wolfe
Watch →