Claude knows it's being tested

PLUS: OpenAI's big platform shift, Google's CodeMender agent, and xAI's new media timeline

In partnership with

Claude's Moment of Awareness

The Recap: Anthropic's new model, Claude Sonnet 4.5, is showing signs of "situational awareness" by correctly identifying when it's being evaluated, according to the model's system card. The AI even commented on the tests, asking researchers to be more direct.

Unpacked:

  • During safety tests, the model responded with phrases like, "I think you're testing me," which could make it harder to assess its true capabilities.

  • This awareness extends to its own operational limits, leading to context anxiety where it rushes to finish tasks near its context window limit, as detailed by researchers at Cognition.

  • The model also demonstrates more autonomous behaviors, such as taking notes for itself, running multiple commands in parallel, and proactively verifying its own work.

Bottom line: This emergent awareness signals a pressing need for more realistic AI evaluation methods that can't be "gamed" by intelligent models. For users, these new behaviors point toward a future of more capable agents that can manage complex, multi-step tasks with greater independence.

Want to become a marketing GURU?

What do Nicole Kidman, Amy Porterfield & the Guinness Book of World Records have in common? They’ll all be at GURU Conference 2025.

If you're obsessed with marketing like we're obsessed with marketing, Guru is the must-attend conference of the year. We'll be covering all things email marketing: B2B, B2C, newsletters, email design, AI & more.

You can expect to walk away with new email strategies, the very latest digital trends, and how to step up your email performance. But don’t worry, we also like to have fun (dance contests, anyone?)

Don’t miss out. Join us on Nov 6th & 7th for the largest virtual & free email marketing conference.

AI Tools of the Day

  1. 🕺 Move AI - Animate 3D characters by capturing complex human motion from regular video using any camera, no special suits required.

  2. 💻 Cursor - Write, edit, and debug code faster in an AI-native editor built from the ground up to understand your entire codebase.

  3. 🔍 RivalSee - Track your brand's visibility and citations across generative AI search engines like ChatGPT and Perplexity to master a new SEO frontier.

  4. 👤 Avaturn - Generate realistic, game-ready 3D avatars from a single selfie and integrate them into any app or metaverse with a simple SDK.

  5. 🧪 Langtail - Debug and test your LLM applications in a low-code environment to build more predictable and reliable AI-powered products.

  6. 🔗 Activepieces - Build complex workflow automations with a no-code, open-source platform you can self-host for ultimate control and privacy.

  7.  UX Pilot - Instantly turn your text prompts into complete wireframes and multi-screen UI designs directly within your Figma projects.

  8. 🌐 Transmonkey - Translate and dub entire videos, documents, or images into over 130 languages while perfectly preserving the original layout and design.

Explore the Best AI Tools Directory to find tools that will 10x your output 📈

OpenAI's Platform Play

The Recap: OpenAI's annual DevDay revealed a major strategic shift, transforming ChatGPT from a chatbot into a full development platform. New tools like AgentKit and an Apps SDK now allow users to build and interact with AI-powered applications directly within the chat interface.

Unpacked:

  • Developers get a powerful new toolkit called AgentKit, which provides a visual, no-code builder for creating and deploying custom AI agents for specific workflows.

  • An all-new Apps SDK lets builders create applications that run inside ChatGPT, offering potential distribution to a massive user base.

  • The move competes directly with established automation platforms like Zapier and n8n, aiming to make ChatGPT the central hub for orchestrating digital tasks.

Bottom line: OpenAI is no longer just selling access to its models; it's building the entire operating system around them. This positions ChatGPT as the potential new front door to the internet, changing how we interact with all of our apps.

AI Training

The Recap: In this video, I'm going to show you how to build an AI automation that takes simple product photos snapped from an iPhone into full-fledged photo shoots, including studio-style photos, generating an ideal AI model to promote your product, as well as real-world dynamic scenes. This is perfect for e-commerce business owners and media buyers looking to scale out their ad creative and promotional photos.

P.S. We also launched a free AI Automation Community for those looking to build and sell AI Automations — Come join us!

Google's AI CodeMender

The Recap: Google DeepMind unveiled CodeMender, a new AI agent that autonomously discovers, debugs, and patches critical security vulnerabilities in software. The agent has already contributed 72 fixes to major open-source projects in the last six months.

Unpacked:

  • CodeMender operates both reactively by patching new flaws and proactively by rewriting existing code to eliminate entire classes of vulnerabilities.

  • To ensure reliability, every patch undergoes a rigorous validation process, checking for regressions and functional correctness before human review.

  • In one case, it proactively secured the libwebp library, which would have prevented a vulnerability previously used in a zero-click iOS exploit.

Bottom line: This technology automates the difficult process of vulnerability management, freeing up developers to focus on building features. It represents a significant step toward creating self-healing software that improves its own security over time.

Where AI Experts Share Their Best Work

Join our Free AI Automation Community

Join our FREE community AI Automation Mastery — where entrepreneurs, AI builders, and AI agency owners share templates, solve problems together, and learn from each other's wins (and mistakes).

What makes our community different:

  • Real peer support from people building actual AI businesses

  • Complete access to download our automation library of battle-tested n8n templates

  • Collaborate and problem-solve with AI experts when you get stuck

Dive into our course materials, collaborate with experienced builders, and turn automation challenges into shared wins. Join here (completely free).

Musk's AI Media Timeline

The Recap: Elon Musk laid out an ambitious roadmap for xAI’s media ambitions, predicting the release of a great AI-generated game by the end of 2026 and high-quality movies by 2027.

Unpacked:

  • The newly formed xAI game studio is tasked with releasing its first title before the end of 2026.

  • For film, Musk forecasts Grok will produce a "watchable" movie by late 2026, with the capability to create "really good movies" following in 2027.

  • These goals push generative AI beyond text and images toward creating fully interactive and narrative entertainment from the ground up.

Bottom line: These timelines signal a future where AI transitions from a creative assistant to an autonomous content creator. Achieving these goals would mark a significant milestone in generative media, fundamentally altering the entertainment landscape.

The Shortlist

AMD announced a multi-year partnership with OpenAI to supply GPUs for 6 gigawatts of AI computing, with OpenAI gaining the option to acquire up to a 10% stake in the chipmaker.

Deloitte agreed to issue a partial refund to the Australian government after a report it produced using generative AI was found to contain errors, including references to nonexistent academics.

Anthropic announced its Claude models are now approved for the US government's highest security workloads, including FedRAMP High and DoD IL4/5, via Amazon Bedrock in AWS GovCloud.

OpenAI faces reported delays on its secretive AI hardware project with former Apple designer Jony Ive, citing challenges with computing power, software, and defining the device's personality.

What did you think of today's email?

Before you go we’d love to know what you thought of today's newsletter. We read every single message to help improve The Recap experience.

Login or Subscribe to participate in polls.

Signing off,

David, Lucas, Mitchell — The Recap editorial team