article

AI Dev Essentials #33: Gemini 3 Pro, Antigravity IDE, & GPT-5.1 Codex Max

Gemini 3 Pro & Antigravity IDE launch, OpenAI ships GPT-5.1-Codex-Max, and Cloudflare buys Replicate. Get the full breakdown in AI Dev Essentials #33.

Hey Everyone 👋,

John Lindquist here with the 33rd issue of AI Dev Essentials!

Google launched Gemini 3 Pro alongside their new Antigravity IDE, both representing significant advances in multimodal understanding and agentic development. OpenAI released GPT-5.1-Codex-Max with 24-hour autonomous task capabilities and ChatGPT for Teachers, while making group chats available in pilot regions. Claude models became available in Microsoft Foundry with structured outputs now in public beta. Cloudflare acquired Replicate to strengthen their AI infrastructure.

I've been spending the week diving deep into Anthropic's engineering guide on Code Execution with MCP and exploring their frontend design plugin to prepare for the upcoming workshop. I've also been putting Gemini 3 through its paces to determine if it earns a spot in my daily workflow.

🚨 Happening TOMORROW! 🚨 Last chance to sign up for the Claude Code Power User Workshop - European Timezone Editon 🌍: https://egghead.io/workshop/claude-code


🚀 Major Announcements

Google Launches Gemini 3 Pro with Record Benchmark Scores [Breaking]

Google DeepMind announced Gemini 3 Pro on November 18, 2025, achieving the highest benchmark scores for a publicly available model with 1501 on LMArena and introducing new capabilities in multimodal understanding and generative UI.

Model capabilities:

Gemini 3 Deep Think variant:

(Google Official Blog, Google Developers Blog, TechCrunch, CNBC)

This is really big. Gemini 3 has been absolutely crushing it designs and one-shotting incredible demos. I've had it build a 3d playable piano, recreate Wordle and other games, and create almost perfect designs of Slack and other UI-heavy apps. I've had much worse luck in longer programming tasks and I've heard evidence that it requires a lot of "hand holding" and specific instuctions, but maybe that's because it's still considered a preview? Either way, it's definitely a tool I will be using a lot for prototyping designs. Google is quickly integrating Gemini 3 into every product, even the main Google "AI Mode" which I highly recommend checking out. Things are really heating up.

Google Unveils Antigravity: Agent-First IDE Powered by Gemini 3 [New IDE]

Google launched Antigravity on November 18, 2025, a new agentic development platform featuring agent-first architecture with direct access to editor, terminal, and browser for autonomous execution.

Platform features:

Strategic positioning:

(TechCrunch, VentureBeat, The New Stack)

Apparently they forked Windsurf (they forgot to remove some references to "Cascade") and are attempting a Cursor clone of sorts. I've tried it a handful of times, but keep running into model limits and errors, so I don't really have a read on it. I've had much better luck using Gemini 3 in the Gemini CLI and Cursor. I'll check back in on AntiGravity once they sort out the launch hiccups.

OpenAI Releases GPT-5.1-Codex-Max with 24-Hour Task Capabilities [Model Update]

OpenAI announced GPT-5.1-Codex-Max on November 19, 2025, introducing the first model natively trained to operate across multiple context windows using compaction technology for autonomous work spanning 24+ hours.

Technical capabilities:

(OpenAI Blog, VentureBeat, The Decoder)

This has been out less than 24 hours, but it's apparently the best coding model available. I haven't had enough time to properly evaluate it, but I've heard nothing but good things.

GPT-5.1 Pro Spotted in the Wild [Model Update]

Reports are flooding in that OpenAI has begun rolling out GPT-5.1 Pro to Pro and Team users, with early access users calling it "easily the most capable and impressive model" to date.

Rollout details:

(TestingCatalog News, Matt Shumer Review)

GPT-5.1 Pro showed up in my account a few minutes ago. I packed up a bunch of files from my Script Kit project and dumped them into the model and came away pretty impressed. I imagine I'll be using this a lot for planning. I'm curious if/when it will hit APIs so we could script our own workflows.

Claude Models Available in Microsoft Foundry with $30B Azure Commitment [Model Update]

Anthropic announced on November 18, 2025, that Claude Sonnet 4.5, Haiku 4.5, and Opus 4.1 are now available in public preview in Microsoft Foundry, alongside a $30 billion Azure cloud capacity commitment and $15 billion combined investment from NVIDIA and Microsoft.

Partnership details:

(Anthropic Newsroom, Microsoft Azure Blog, NVIDIA Blog)

It's good to see Anthropic making moves. Claude Code and Sonnet 4.5 are still my go-to models for writing code and just kicking off random tasks without having to give it a ton of direction. I'm really hoping to see an Opus 4.5 release soon with Google and OpenAI making big moves.

OpenAI Launches ChatGPT for Teachers Free Through June 2027 [Education]

OpenAI announced ChatGPT for Teachers on November 19, 2025, providing free access to GPT-5.1 Auto with unlimited messages for verified U.S. K-12 educators through June 2027.

Program details:

(OpenAI Blog, CNBC, Axios, Engadget)

Smart play by OpenAI. Education is such an excellent use case for AI, especially when it comes to customizing lessons for specific classrooms and students. I just hope the teachers realize they still need to review the output for accuracy (especially anything Math related). I'm curious if they'll have some sort of teacher onboarding experience to help them get started.


🛠️ Developer Tooling Updates

Cloudflare Acquires Replicate to Strengthen AI Infrastructure [Acquisition]

Cloudflare announced on November 17, 2025, an agreement to acquire Replicate, an AI model deployment platform hosting 50,000+ production-ready models, with completion expected in two months.

Acquisition details:

(Cloudflare Press Release, Business Wire, Replicate Blog)

Seems like a nice pairing. Although I think Cloudflare lost a lot of the good will from this acquisition thanks to the major outage this week...

Gemini CLI 0.16.0 Ships Major UI Overhaul [CLI]

Google released Gemini CLI v0.16.0 on November 15-18, 2025, featuring a complete UI rendering overhaul eliminating visual noise and adding mouse support.

UI improvements:

(GitHub Releases, Google Developers Blog, Google Developers Blog: Gemini 3 in CLI)

I've heard rumors that the Gemini CLI is making big moves. I'm really excited to see what they're up to. I just feel like all the other CLI tools are so far behind Claude Code when it comes to customization and scriptability.

Claude Structured Outputs Launch in Public Beta [Beta]

Anthropic released Structured Outputs in public beta on November 14, 2025, for Claude Sonnet 4.5 and Opus 4.1, guaranteeing JSON schema compliance and eliminating parsing errors.

Technical features:

(Claude Developer Platform Blog, Anthropic Release Notes)

We've all had a lot of hacks and workarounds in place do to the lack of structured outputs that we're finally going to be able to retire. Woo!

NotebookLM Adds Image Support for Research Sources

Google announced on November 13, 2025, that NotebookLM now supports images as source material, including photos, screenshots, infographics, handwritten notes, and whiteboards.

New capabilities:

(Google Blog, TechCrunch, Digital Trends)

My biggest use case is screenshots from my phone. Such an awesome experinece to be able to capture anything and drop it into a long term memory bank.


🤖 AI Ecosystem Updates

OpenAI Pilots Group Chats in ChatGPT Across Four Regions

OpenAI launched a pilot for group chats in ChatGPT on November 13, 2025, enabling up to 20 participants to collaborate with ChatGPT in shared conversations.

Pilot details:

(OpenAI Blog, TechCrunch, Engadget, Axios)

Anthropic Partners with Rwanda and ALX for Chidi AI Learning Companion

Anthropic announced on November 18, 2025, a partnership with the Government of Rwanda and ALX Africa to deploy Chidi, a Socratic learning companion built on Claude, reaching 200,000+ students across Africa.

Partnership details:

(Anthropic Newsroom, APO Group, EdTech Innovation Hub)

Google Releases WeatherNext 2 with 8x Faster Forecasting

Google DeepMind introduced WeatherNext 2 on November 17, 2025, as the most advanced and efficient weather forecasting model using a new Functional Generative Network architecture.

Model capabilities:

(Google Blog, 9to5Google, Android Authority, TechRadar)


Sponsored by Membrane

Why coding agents struggle with integrations — and how Membrane fixes it

AI coding agents are getting frighteningly good—but there’s still one place they consistently fall down: building real product integrations. Not toy examples. Not one-off scripts. Actual customer-facing integrations that need correct OAuth flows, rate limits, pagination, webhooks, retries, schema drift handling, tests, observability—the whole messy universe of APIs across thousands of SaaS apps.

That’s where Membrane comes in. It’s the only AI built specifically for product integrations, with a purpose-made engine and a deep knowledge system about how APIs behave in the real world. Describe the integration you need, and Membrane generates a production-ready package—complete with correct API calls, field mappings, data structures, tests, and a runtime built for reliability. Teams that used to spend 5–6 weeks building an integration now ship in 5 minutes.

The best part: Membrane connects directly into your AI coding workflow. Whether you’re using Claude Code, Cursor, or any MCP-compatible agent, Membrane gives them the missing integration-infrastructure superpowers. Your agent can generate, adjust, test, and deploy integrations without leaving your IDE. You stay in your repo, your CI, your environment—just with far more leverage.

If you’ve ever asked a coding agent to “build an integration with X,” and watched it hallucinate endpoints or ignore pagination rules, this is the fix. Membrane gives agents the context they’ve always lacked: how APIs actually work, how integrations should run, and how to keep them healthy over time.

In short: coding agents write code. Membrane ships integrations.

Check it out here: getmembrane.com/agent/claude-code


⚡ Quick Updates

GPT-5.1 Models Available in Cursor

(Cursor)

Claude Code Frontend Design Plugin Released

(Claude Blog, GitHub)

Google Releases 54-Page AI Agent Production Guide

(Google Cloud)


✨ Workshop Spotlight

Claude Code Power User Workshop

European Timezone Editon 🌍 (🚨 TOMORROW! 🚨)

Date: Tomorrow, Friday November 21, 2025 Time: 2:00 PM - 7:00 PM (CET) Platform: Zoom

Pricing:

What You'll Learn:

Learn the essential skills to ship reliable AI-generated code with confidence. This hands-on workshop covers everything from foundational prompting to advanced automation using the Claude Code SDK and custom integrations.

Core Skills:

👉 Register: https://egghead.io/workshop/claude-code


Read this far? Share "AI Dev Essentials" with a friend! - https://egghead.io/newsletters/ai-dev-essentials

https://egghead.io