Chip design from the bottom up – Reiner Pope

Dwarkesh Patel · Advanced ·📐 ML Fundamentals ·2h ago
New blackboard lecture with Reiner Pope: how do chips actually work - starting with basic logic gates, and working up to why GPUs, TPUs, FPGAs, and the human brain each look the way they do. Reiner is CEO of MatX, a new chip startup (full disclosure - I’m an angel investor). He was previously at Google, where he worked on software efficiency, compilers, and TPU architecture. 𝐄𝐏𝐈𝐒𝐎𝐃𝐄 𝐋𝐈𝐍𝐊𝐒 * Transcript: https://www.dwarkesh.com/p/reiner-pope-2 𝐒𝐏𝐎𝐍𝐒𝐎𝐑𝐒 * Crusoe was one of only five GPU clouds that made the gold tier in SemiAnalysis' most recent ClusterMAX report. Gold-tier providers like Crusoe delivered 5-15% lower TCO than silver-tier clouds, even with identical GPU pricing. This is because optimizations like early fault detection and rapid node replacement don't necessarily show up in the sticker price, but still matter a ton in the real world. Learn more at https://crusoe.ai/dwarkesh * Cursor is where I do most of my work—from reading research papers to visualizing technical concepts to coding up internal tools for the podcast. Most recently, I used it to build two different review interfaces for my essay contest, one that anonymizes submissions for scoring and another that lets me see applicants' essays next to their resumes and websites. Whatever you're working on, you should try doing it in Cursor. Get started at https://cursor.com/dwarkesh * Jane Street let me ask Ron Minsky and Dan Pontecorvo, two senior Jane Streeters, a bunch of questions about how they use AI. We discussed everything from the types of models they're training to how they think about the future of trading to why they're more bullish than ever on hiring technical talent. You can watch the full conversation and learn more about their open positions at https://janestreet.com/dwarkesh 𝐓𝐈𝐌𝐄𝐒𝐓𝐀𝐌𝐏𝐒 00:00:00 – Building a multiply-accumulate from logic gates 00:16:20 – Muxes and the cost of data movement 00:25:59 – How systolic arrays work 00:39:00 – Clock cycles
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Why One Model Is Never Enough: Routing Incident Analysis With cascadeflow
Learn how to use cascadeflow for routing incident analysis and why a single model is insufficient for this task
Dev.to · Riddhiman
When Your Data Chooses Itself: The Problem Tobit Doesn’t Solve
Learn about the limitations of Tobit models in handling self-selected data and why it's a problem in machine learning
Medium · Machine Learning
VRAM vs CUDA Cores for AI: What Actually Makes Your Models Run Faster in 2026?
Understand how VRAM and CUDA cores impact AI model performance to optimize your GPU setup
Medium · Machine Learning
From Messy Kitchen to Five-Star Restaurant: Jupyter Notebooks vs. Modular Coding in VSCode
Learn how to transition from Jupyter Notebooks to modular coding in VSCode for improved productivity and code organization
Medium · Machine Learning

Chapters (4)

Building a multiply-accumulate from logic gates
16:20 Muxes and the cost of data movement
25:59 How systolic arrays work
39:00 Clock cycles
Up next
Managing Polars Schemas Issues & Profiling GitHub Users | Real Python Podcast #296
Real Python
Watch →