NEW Grok 4.3 TESTED: Needs Multiple Iterations

Discover AI · Advanced ·📰 AI News & Updates ·2w ago
In this video I perform my causal reasoning test (an elevator test) on the newly released Grok 4.3 to evaluate its reasoning capabilities for unpublished complex reasoning and scientific tasks. Complete YouTube playlist of my test available here https://www.youtube.com/playlist?list=PLgy71-0-2-F0Rla8lu5ZldpYQUfXM_5bT 00:00 New Grok 4.3 01:33 Live test (arena.ai) 05:46 Grok 4.3 FAILS 07:47 2nd run Grok 4.3 12:44 First result by Grok 4.3 14:00 Validation run 15:43 Optimization run Grok 4.3 #grok #grokai #nextgenai #aitesting
Watch on YouTube ↗ (saves to browser)
Sign in to unlock AI tutor explanation · ⚡30

Related AI Lessons

Maybe AI Was Meant for Ordinary People After All
Discover how AI can be accessible to ordinary people, not just experts, and why it matters for democratizing technology
Medium · AI
Anthropic says it’s about to have its first profitable quarter
Anthropic expects to more than double revenue to $10.9 billion in Q2, marking its first profitable quarter, a significant milestone for the AI company
TechCrunch AI
AMD says its $4K Ryzen AI Halo workstation practically pays for itself
AMD claims its $4K Ryzen AI Halo workstation can pay for itself due to increased productivity and efficiency, learn how to evaluate ROI for AI workstations
The Register
SpaceX Is Spending $2.8 Billion to Buy Gas Turbines for Its AI Data Centers
SpaceX is investing $2.8 billion in gas turbines for its AI data centers, aiming to become a major cloud computing player while addressing carbon emission concerns
Wired AI

Chapters (7)

New Grok 4.3
1:33 Live test (arena.ai)
5:46 Grok 4.3 FAILS
7:47 2nd run Grok 4.3
12:44 First result by Grok 4.3
14:00 Validation run
15:43 Optimization run Grok 4.3
Up next
ChethanAIChronicles Hits 1K Subscribers 🎉 #shorts #ai
ChethanAIChronicles
Watch →