Large-scale agentic quant research with Weights & Biases

Weights & Biases · Beginner ·🤖 AI Agents & Automation ·3w ago

Skills: Tool Use & Function Calling90%Agent Foundations80%

In this video, we walk through how to use Weights & Biases to improve reliability, reproducibility, and explainability in large-scale, agent-driven quantitative research. *We cover two key use cases:* *1. Alpha research with backtesting* Learn how to build, debug, and iterate on a multi-agent system for market analysis. A market event is routed through a classifier to specialized agents (macro, historical, sentiment, and quant), each making tool calls to gather live data. A synthesis agent then generates a probabilistic forecast. You’ll learn how to use W&B Weave to inspect agent traces, including tool calls, latency, cost, and outputs and drill into low-confidence predictions to identify root causes and improve agent behavior. *2. Strategy testing* See how to move beyond manual tuning with an automated optimization loop. You’ll see how to run multiple trials with different agent weight configurations and use a meta-optimizer to iteratively minimize metrics like Brier score. You’ll also learn how to use track convergence and performance using W&B Models. With the visualization tool in W&B Models, you can easily explore how weight configurations impact outcomes using parallel coordinate plots. W&B Weave and Models are fully integrated, giving you a complete audit trail for research, validation, and compliance. Weights & Biases enables end-to-end visibility across your quant research workflows, allowing you to move from hypothesis to back-testing and compliance-grade reproducibility while optimizing strategies at scale. *Chapters:* 00:00 Introduction 00:35 Use cases overview 00:50 Event-driven pipeline architecture 01:38 Weights & biases UI demo 02:18 Debugging model output 03:44 Fixing model & improved forecast 04:26 Evaluation at scale 05:07 Comparing model performance 06:23 Automated optimization loop 08:00 Insights from optimization

Watch on YouTube ↗ (saves to browser)

Sign in to unlock AI tutor explanation · ⚡30

More on: Tool Use & Function Calling

View skill →

Adding a Phone Gateway to a Virtual Agent

Administering an AlloyDB Database

Cloud Storage: Qwik Start - CLI/SDK

Cloud Composer: Copying BigQuery Tables Across Different Locations

Getting started with Firebase Cloud Firestore

Getting Started with Liquid to Customize the Looker User Experience

Related AI Lessons

Why Companies Will Stop Asking “Do You Know AI?” and Start Asking This Instead

Companies will shift from asking about AI knowledge to inquiring about ability to architect integrated systems using MCP, RAG, and Agents, highlighting the need for professionals to understand these technologies.

MCP, A2A, AND THE AGENT INTERNET

Learn about the Model Context Protocol (MCP) and A2A protocol and their impact on the agent internet and enterprise AI

Agentic AI Startup Landscape (2026): Where the Real Opportunities Are

Discover the opportunities in the agentic AI startup landscape as it shifts to an enterprise platform

Medium · Startup

I Built an AI System That Converts BRDs into Jira Tickets, Here’s Why

Learn how to automate converting Business Requirement Documents into Jira tickets using AI, streamlining project management and reducing manual effort

Chapters (10)

Introduction

0:35 Use cases overview

0:50 Event-driven pipeline architecture

1:38 Weights & biases UI demo

2:18 Debugging model output

3:44 Fixing model & improved forecast

4:26 Evaluation at scale

5:07 Comparing model performance

6:23 Automated optimization loop

8:00 Insights from optimization

Why Anthropic, Meta, and Tesla All Chose the Same Database | Aaron Katz, ClickHouse

Weights & Biases