
Elon Musk's xAI launches Grok 4, claiming top spot among ... - Neowin
- by Neowin
- Jul 10, 2025
- 0 Comments
- 0 Likes Flag 0 Of 5

comments
Elon Musk's xAI today announced Grok 4, its latest flagship multimodal AI model. xAI claims that Grok 4 is a top-tier AI model with state-of-the-art performance in academic, mathematical, and reasoning benchmarks. The Grok 4 Heavy version, with multi-agent tools, delivers even more impressive gains in popular AI benchmarks.
Academic and Reasoning Benchmarks:
Humanity’s Last Exam (HLE): Grok 4 (no tools) achieved 25.4%, outperforming Google’s Gemini 2.5 Pro (21.6%) and OpenAI’s o3-high (21%). Grok 4 Heavy (multi-agent + tools) reached 44.4%, compared to Gemini 2.5 Pro with tools at 26.9%.
ARC-AGI-2: Grok 4 scored 16.2%, nearly double the next-highest model (Claude Opus 4).
MMLU-style evaluations: Achieved a 0.866 score (86.6%) on MMLU with an overall Intelligence Index of 73, leading the industry.
STEM & Coding Benchmarks:
Please first to comment
Related Post
Stay Connected
Tweets by elonmuskTo get the latest tweets please make sure you are logged in on X on this browser.
Sponsored
Popular Post
Sam Altman's OpenAI Takes On Elon Musk's Grok in AI Chess Tournament Final - Who Won?
28 ViewsAug 09 ,2025