article

AI Dev Essentials #9: Playwright MCP is amazing, Hot Model Updates & Essential Dev Tools

Learn Playwright MCP, track AI model updates (Gemini, DeepSeek), explore new dev tools & agent insights. Your AI dev update.

Hey Everyone 👋,

John Lindquist here with the ninth issue of AI Dev Essentials! This past week I've been exploring the capabilities of the Playwright MCP and the ability to inspect pages, logs and information from the browser from Cursor. And it's been incredible how many new and amazing workflows this unlocks. The Playwright team has done an awesome job where the MCP feels predictable and consistent, which is incredibly difficult to pull off in the realm of MCP, so I definitely recommend it if you're looking to build an MCP to check out their GitHub repo as a good starting point.

Other than that, I've been experimenting with a lot of Cursor rules and workflows, and have been a bit more lenient with my budget as far as allowing agents to make their own choices. But I keep on coming back to the fact that if you start any task without a plan, then you're ultimately screwed. The first step to getting anything done in Cursor or any AI agent is to create a plan. With that being said, let's dive into this week's essentials!

New egghead.io Lessons This Week

Automated Web Form Testing & Bug Fixing with Playwright MCP in Cursor(egghead.io)
Discover a powerful workflow using Cursor's AI agent and the Playwright MCP to instrument your web form with exhaustive logging, delegate automated testing to the AI to uncover bugs and edge cases by analyzing console logs, and then use the AI-generated report to fix the underlying code.

Autofix Browser Errors with the Playwright MCP in Cursor(egghead.io)
Learn how to create a powerful, automated debugging loop using Playwright MCP integrated into Cursor IDE via custom Rules. The AI interacts with your live application, identifies errors, and iteratively fixes them by launching Playwright, navigating the browser, and applying code changes until all errors are resolved.

Local AI Code Reviews with the CodeRabbit Extension in Cursor(egghead.io)
Learn how to integrate the CodeRabbit extension into your Cursor workflow for an extra set of AI eyes on your code changes. See how to initiate reviews, analyze suggestions, apply fixes directly, or use AI to further refine CodeRabbit's feedback with more context.

🚀 Model & Platform Updates

🔥 The Big Three Race

The Watchlist Heats Up: o3 Pro, Grok 3.5, Gemini 2.5 Pro (Full) anticipated soon

The hype machines have been running for a while, as these are all significant releases for each of the major companies. Sometimes I wonder if they're just waiting to see who's going to release first.

I'm still anxiously anticipating o3 Pro. o3 is still my go-to model for the most difficult questions when I really need reasoning. It's proven time and again to me that it can find solutions to edge cases that other models haven't been able to. And a pro version of the model to me is extremely enticing.

🧠 Research & Breakthroughs

🔧 API & Feature Updates

🌟 DeepSeek-R1-0528 Release: Enhanced Capabilities

The DeepSeek team has announced the release of DeepSeek-R1-0528, bringing several key improvements to their reasoning model.

The latest version boasts:

You can try out DeepSeek-R1-0528 at chat.deepseek.com(chat.deepseek.com). For developers, the API usage remains consistent (refer to the API documentation(api-docs.deepseek.com)), and the open-source weights are accessible on Hugging Face(huggingface.co). The official announcement also features a benchmark performance image and an example GIF showcasing the model's new capabilities. (via DeepSeek API Docs News(api-docs.deepseek.com))

Many people were disappointed by this news because they were expecting an R2 model, which would be a major leap forward, whereas this seems like a small step forward. But any step forward in the open source space where people can run models locally is extremely welcome. It's so easy to forget that all the tools we have and providers we rely on today will eventually become freely available on our local machines or much cheaper to run. And so the pressure that any model can put on the bigger providers is worth watching and evaluating.

🛠️ Developer Tooling & Ecosystem

🎨 Multimedia & Generative AI

💻 Local & On-Device AI

🔨 Developer Tools & IDEs

🤖 AI Agents & Assistants

📝 Agent Tips & Best Practices

🌐 Web Agents: The Quest for the All-in-One Multipurpose Agent

The AI landscape is witnessing a fascinating race to build the ultimate all-in-one multipurpose agent—systems that promise to control everything from maps and calendars to social media, travel planning, and beyond. While the vision is compelling, the execution remains a mixed bag. Here are a few notable players in this space that are worth keeping an eye on:

🖥️ Desktop Agents: Capturing and Understanding Your Digital Footprint

The concept of desktop AI agents that capture and understand user activity was significantly popularized by tools like Rewind AI, which focused on recording everything on a user's screen to create a searchable history of their work. This sparked a new category of tools aimed at enhancing productivity and knowledge management by observing and assisting users directly within their desktop environment. Here's a brief overview of some tools in this evolving space:

🎯 Framework & Component Updates

🚀 Remix Wakes Up!

The Remix team has announced a significant new direction for the framework with Remix v3, signaling a shift towards an AI-first approach. After merging Remix v2's capabilities into React Router v7, the team is now free to reimagine Remix as a modular toolkit prioritizing simplicity, clarity, and performance. A core principle for this new version is "Model-First Development," meaning the framework's source code, documentation, tooling, and abstractions will be optimized for Large Language Models (LLMs). Furthermore, Remix v3 aims to provide abstractions for applications to integrate AI models directly into their products. This new iteration will also focus on owning the full stack by minimizing dependencies—not even relying on React and instead starting with a fork of Preact—and building extensively on Web APIs. The goal is to create a lighter, faster development experience that's more aligned with the web's fundamental workings. (Read on Remix Blog(remix.run), Announcement on X(x.com))

I'm so excited for this. I love that the Remix team is taking the risks and that they're willing to push the envelope when so many frameworks just kind of settled for the current way of doing things. I know that it's mostly just in the idea phase, but if you know me, you know that I'm totally in line with people who think that all software, all tooling, and all features should be thought of from how will these be presented in a world driven in an AI first future.

🧩 UI Components

💡 Cursor Corner

📈 Growth & Adoption

🛠️ Tips & Workflows

✨ Workshop Spotlight: Conquer the Complexity of Cursor ✨

🌍 Europe Friendly Timezone!

Ready to master practical AI development workflows in Cursor? Join me for this hands-on workshop! I've been teaching these sessions for months, refining the content, and I'm excited to share my latest insights on Agents, Ask mode, Custom Modes, multi-file analysis, effective prompting, Cursor rules, and handling AI failures. Let's conquer the complexity together!

When: Thursday, June 05, 2025, 5:00 AM - 10:00 AM (PDT) / 1:00 PM - 6:00 PM (UTC+1)

Where: Zoom (Live Q&A included)

Investment: $249

Read More(egghead.io) | Register Now ($249)(buy.stripe.com)

(Team training also available)

What are you excited about? I'd love to hear any news that you've come across. If you have any feedback or questions, hit reply. I'm happy to chat about the latest in AI Dev Tools.

John Lindquist
egghead.io(egghead.io)