How do LLM Inference Optimizations Work? NVIDIA Coffee Chat

Brev · Advanced ·🧠 Large Language Models ·1y ago
How do LLM inference Optimizations Work? Nader (Director of Dev Tech at NVIDIA) had this question so he sat down with the legend, Kyle Kranen to learn more about them. NVIDIA is full of experts on anything AI related, is there something you want to learn about?
Watch on YouTube ↗ (saves to browser)
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)