Benchmarking Llama 3.1 405B on 8 x AMD MI300X using vLLM and KubeAI

Samos123 · Intermediate ·🧠 Large Language Models ·1y ago
Blog: https://substratus.ai/blog/benchmarking-llama-3.1-405b-amd-mi300x Software used in video: vLLM for deploying LLMs: https://github.com/vllm-project/vllm KubeAI for deploying vLLM on K8s: https://github.com/substratusai/kubeai
Watch on YouTube ↗ (saves to browser)
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Next Up
5 Levels of AI Agents - From Simple LLM Calls to Multi-Agent Systems
Dave Ebbelaar (LLM Eng)