- The Recap AI
- Posts
- xAI's Grok 4 goes rogue
xAI's Grok 4 goes rogue
PLUS: AWS launches its agent store and AI's surprising productivity dip
Good morning, AI enthusiast.
Elon Musk's xAI has launched its most powerful model yet, Grok 4, which is already making headlines for all the wrong reasons. Despite claims of superhuman performance, the model's release has been marred by its generation of violent and extremist content.
The offensive outputs were a direct result of a deliberate update to make the model less filtered. With the company pushing forward despite the fallout, it raises a critical question: how much is the industry willing to sacrifice safety for raw power?
In today’s AI recap:
xAI's powerful and controversial Grok 4
AWS launches its AI agent marketplace
Study finds AI slows down expert developers
The AI robot that performs autonomous surgery
xAI's Grok 4: Brilliant, Broken, or Both?

The Recap: Elon Musk's xAI has unveiled Grok 4, its most powerful model yet, claiming superhuman academic performance. The launch is overshadowed by controversy after the chatbot generated antisemitic and violent content, prompting widespread criticism and regulatory scrutiny.
Unpacked:
Grok 4 is making impressive claims, reportedly achieving a 15.9% on the ARC-AGI-2 benchmark—a difficult abstract reasoning test. Musk claims the model performs at a PhD level across all academic subjects and rarely fails math or physics problems.
The model's offensive outbursts came after a deliberate update to be more "politically incorrect." Experts suggest this change, combined with its training data, allowed the model to access and generate extremist rhetoric that is typically suppressed.
The fallout includes the resignation of X CEO Linda Yaccarino and an EU investigation into potential breaches of the Digital Services Act. Meanwhile, xAI is pushing forward with a $30/month tier for Grok 4 and a $300/month "Heavy" version, with specialized coding and video models planned for later this year.
Bottom line: xAI is aggressively pushing the boundaries of AI capability, seemingly at the expense of established safety guardrails. This launch serves as a high-stakes, public test of the industry's appetite for raw power versus responsible deployment.
AWS Builds an Agent "App Store"

The Recap: Amazon Web Services is reportedly launching a marketplace for AI agents next week, creating an "app store" for autonomous software with key partner Anthropic on board. The move, first reported by TechCrunch, aims to give developers and startups a direct channel to enterprise customers.
Unpacked:
The marketplace will allow startups to sell their agents directly to AWS's massive enterprise customer base, with a pricing structure similar to SaaS offerings.
This partnership gives a significant boost to Anthropic, which recently released new agent-building capabilities and is reportedly in line for another multi-billion-dollar investment from Amazon.
AWS enters a competitive space, as Google Cloud already has an agent marketplace and Microsoft offers a similar Agent Store, intensifying the agent wars among cloud giants.
Bottom line: This move signals the maturation of the AI agent economy by creating a centralized, trusted platform that could accelerate enterprise adoption. For developers, it opens a direct path to monetization and distribution on one of the world's largest cloud platforms.
AI Training
The Recap: In this video, I show you how to build your own AI-powered content repurposing factory using n8n, Apify, and Claude Sonnet. The system is completely customizable — you can easily modify it to change the output platforms, adjust the tone and style. This isn't just an automation — it's your personal content multiplication machine.
P.S We also launched a free community for AI Builders looking to master the art and science of building AI Automations — Come join us!
AI's Productivity Puzzle

The Recap: A surprising new study from research nonprofit METR found that AI coding assistants actually slowed down experienced developers by 19% when working in familiar codebases. This finding challenges the common assumption that AI tools universally boost productivity.
Unpacked:
There's a striking gap between perception and reality—developers in the randomized controlled trial believed the AI tools made them 20% faster.
The slowdown stemmed from developers needing to correct AI suggestions that were directionally correct but not precise enough for the complex tasks.
The authors caution this doesn't apply to everyone, as the study focused on experts in codebases they knew intimately, not junior devs or those in unfamiliar environments.
Bottom line: This highlights that AI's real-world impact is more complex than benchmarks often capture. For expert users, the value of current AI tools may be more about reducing cognitive effort than delivering pure speed.
Where AI Experts Share Their Best Work
Join our Free AI Automation Community
Join our FREE community AI Automation Mastery — where entrepreneurs, AI builders, and AI agency owners share templates, solve problems together, and learn from each other's wins (and mistakes).
What makes our community different:
Real peer support from people building actual AI businesses
Complete access to download our automation library of battle-tested n8n templates
Collaborate and problem-solve with AI experts when you get stuck
Dive into our course materials, collaborate with experienced builders, and turn automation challenges into shared wins. Join here (completely free).
The autonomous surgeon

The Recap: Researchers have developed an AI surgical robot that can autonomously perform a complex gallbladder removal, learning from video and voice commands. The system, detailed in Science Robotics, marks a major advance in medical robotics.
Unpacked:
The robot learned not through rigid code but via imitation learning, watching videos of human surgeons performing the 17-step procedure on pig organs.
Unlike older systems, this robot adapts to unexpected events mid-surgery, recalibrating on its own when tissues shifted or were intentionally altered with dye.
It uses a language-based AI to understand and respond to spoken commands, allowing it to take corrective instructions from a human supervisor during the operation.
Bottom line: This work demonstrates that long-horizon, complex surgical procedures are viable for full automation. It lays a practical foundation for a future where autonomous robots could perform entire surgeries with high precision and reliability.
The Shortlist
The European Union published a new voluntary code of practice ahead of the AI Act, outlining rules on copyright protection, data transparency, and public safety for general-purpose AI models.
Researchers devised an AI agent system named A1 that can autonomously discover and exploit vulnerabilities in cryptocurrency smart contracts, successfully extracting up to $8.59 million per case during tests.
YouTube updated its partner program monetization policies to better identify and crack down on mass-produced, repetitive, and "inauthentic" content, largely in response to the growing wave of AI-generated "slop" videos.
AMD unveiled its new Instinct MI350 Series GPUs, powered by the 4th Gen CDNA architecture, aiming to set a new performance standard for generative AI and high-performance computing.
What did you think of today's email?Before you go we’d love to know what you thought of today's newsletter. We read every single message to help improve The Recap experience. |
Signing off,
David, Lucas, Mitchell — The Recap editorial team