Stanford CS153 Frontier Systems | Amit Jain from Luma AI on Unified Intelligence Systems

Stanford Online · Advanced ·🤖 AI Agents & Automation ·2w ago
For more information about Stanford's online Artificial Intelligence programs, visit: https://stanford.io/ai Follow along with the course schedule and syllabus, visit: https://cs153.stanford.edu/ In week three of CS153, the instructor hosts Amit Jain from Luma to discuss “Unified Intelligence Systems” as a follow-up to a prior lecture on visual intelligence. Jain recounts his Apple work on LiDAR for projects including Titan and Vision Pro, and how early exploration of generative models and differentiable 3D led to founding Luma with an initial focus on large-scale 3D capture. Luma then shifted to generative video in 2023 to leverage the scale of internet video data, releasing the Dream Machine model in March 2024 and rapidly reaching millions of users, while building preference-based feedback loops and human annotation pipelines. Jain explains Luma’s multimodal AI factory—pretraining, post-training, deployment, and reinforcement learning—its security constraints for studio clients, and a move toward unified transformer architectures that jointly reason across text, images, video, and audio to enable end-to-end creative and professional workflows. Guest speaker: Amit Jain is the CEO and co-founder of Luma AI, a research lab developing multimodal foundation models aimed at "unified intelligence." Under his leadership, Luma has scaled from a 3D-capture pioneer into a leader in generative video, raising a $900M Series C following the success of its Dream Machine and Ray video-reasoning models. By 2026, he has steered the company into large-scale infrastructure projects including Project Halo — a 2-gigawatt AI supercluster — to build the next generation of "world models" capable of simulating physical reality. He founded Luma in 2022 from Apple, where he was a Systems and Machine Learning Engineer. At Apple, he led development of the Passthrough feature for Apple Vision Pro and was instrumental in integrating the first LiDAR sensors into the iPhone — foundational wor
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

AMRs in Indian warehouses: How 3PL and e-commerce firms can make automation work
Learn how Autonomous Mobile Robots (AMRs) can improve warehouse efficiency in India's growing e-commerce and logistics sector
Dev.to AI
SEARCH
Learn how AiFinPay SDK empowers AI agents with seamless financial integration, and how to apply it in your projects
Dev.to AI
Models shouldn't have execution authority. Why we built a deterministic FSM runtime for AI agents.
Learn why probabilistic models shouldn't have execution authority and how a deterministic FSM runtime can improve safety for AI agents
Dev.to AI
Google I/O 2026 Turned Gemini Into An Agent Platform
Google I/O 2026 introduces Gemini as an agent platform, reframing its products around AI agents, and learn how this impacts AI development
Forbes Innovation
Up next
New Gemini App: Automate & Build ANYTHING!
Julian Goldie SEO
Watch →