Beyond Code Coverage: Functionality Testing with Playwright — Marlene Mhangami, Microsoft

AI Engineer · Intermediate ·🧠 Large Language Models ·11h ago
When an LLM writes your tests, it tends to write tests that confirm what the code does rather than tests that verify what the user experiences. Your test suite goes green. The app still breaks in ways none of those tests would catch. Marlene Mhangami from Microsoft makes the case for flipping the order: get the agent to write failing Playwright tests against the expected behavior first, then generate code to pass them. The demo runs this live with GitHub Copilot and the Playwright MCP server on a toy store search feature, with the browser open so you can watch the agent click through filters and validate results in real time. Speaker info: - https://x.com/marlene_zw - https://www.linkedin.com/in/marlenemhangami/ - https://github.com/marlenemhangami
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

How Your I Phone Sees in the Dark From IR Lasers to Vector Databases: Engineering Biometrics at…
Learn how Apple uses IR lasers, vector databases, and high-dimensional math to secure devices through biometrics, and how to replicate this technology
Medium · Machine Learning
Your 1M-Token Context Window Is a Lie: How to Plan Real Capacity for RAG, MCP, and Agents
Don't be fooled by claimed context window sizes, learn to plan real capacity for RAG, MCP, and agents
Medium · Machine Learning
Part 2: Beyond “Just Ask”: Advanced Prompt Engineering Strategies for Complex Tasks
Learn advanced prompt engineering strategies for complex tasks with LLMs, beyond basic questioning techniques
Medium · LLM
Anthropic Built an AI So Powerful It Refused to Release It to the Public
Anthropic built a powerful AI model that was deemed too risky for public release, highlighting the need for responsible AI development and safety protocols
Medium · Data Science
Up next
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)
Watch →