A new way to fine-tune LLMs just dropped

bycloud · Advanced ·🧠 Large Language Models ·1h ago
Try Mammouth now for only €10/mo! https://mammouth.ai Evolution strategies were once seen as too inefficient for modern deep learning, but new LLM fine-tuning research has found a way to bring it back from the museum. This video explains how scalable evolutionary strategies could be for LLMs, and its latest developments. my latest project: Intuitive AI Academy We just wrote a new piece on RL & RLHF! https://intuitiveai.academy/ limited time code "EARLY" for 40% off yearly plan My Newsletter https://mail.bycloud.ai/ My Patreon https://www.patreon.com/c/bycloud Sauce [OpenAI paper] https://arxiv.org/abs/1703.03864 [Evolution Strategies at Scale] https://arxiv.org/abs/2509.24372 [EGGROLL] https://arxiv.org/abs/2511.16652 Try out my new fav place to learn how to code https://scrimba.com/?via=bycloudAI This video is supported by the kind Patrons & YouTube Members: 🙏Spam Maj, Alex, Chris LeDoux, DX Research Group, Poof N' Inu, Deagan, Robert Zawiasa, Ryszard Warzocha, Tobe2d, Louis Muk, Akkusativ, Kevin Tai, Mark Buckler, NO U, Tony Jimenez, Ângelo Fonseca, jiye, Anushka, Asad Dhamani, Binnie Yiu, Calvin Yan, Clayton Ford, Diego Silva, Etrotta, Gonzalo Fidalgo, Handenon, Hector, Jake Disco very, Michael Brenner, Nilly K, OlegWock, Daddy Wen, Shuhong Chen, Sid_Cipher, Stefan Lorenz, Sup, tantan assawade, Thipok Tham, Thomas Di Martino, Thomas Lin, Richárd Nagyfi, Paperboy, mika, Leo, Berhane-Meskel, Kadhai Pesalam, mayssam, Bill Mangrum, nyaa, Toru Mon, Lame Plane, Matej Macak, Len Mo, saylikhapekar, ZyanSheep, THEVIERAOS, Ricardo Raphael Corona-Moreno [Discord] https://discord.gg/NhJZGtH [Twitter] https://twitter.com/bycloudai [Patreon] https://www.patreon.com/bycloud [Business Inquiries] bycloud@smoothmedia.co [Profile & Banner Art] https://twitter.com/pygm7 [Video Editor] @aduckchicken2 [Ko-fi] https://ko-fi.com/bycloudai Manim Animations created with Manimate https://www.manimate.ai/
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Pony.ai Unveils NVIDIA-Powered Domain Controller for L4 Autonomy
Pony.ai and NVIDIA collaborate on a domain controller for L4 autonomy, enhancing large-scale autonomous driving deployment
Dev.to AI
Your LLM budget alerts won't save you if you can't map costs to users
Learn to map LLM costs to users to avoid unexpected expenses, despite having budget alerts
Dev.to · John Medina
A System-Prompt Skeleton That Survives Claude/GPT/Gemini Swaps
Learn to create a system-prompt skeleton that works across different AI models like Claude, GPT, and Gemini, and understand the importance of this flexibility in AI development
Dev.to · Gabriel Anhaia
Wire OpenTelemetry Around Your Anthropic Python Calls
Learn to use OpenTelemetry with Anthropic Python calls for improved observability and monitoring
Dev.to · Gabriel Anhaia
Up next
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)
Watch →