Meet the new king of AI coding: Google’s Gemini 2.5 Pro I/O Edition dethrones Claude 3.7 Sonnet

by | May 6, 2025 | Technology

Join our daily and weekly newsletters for the latest updates and exclusive content on industry-leading AI coverage. Learn More

There’s a new king on the throne of AI coding models: Today, Google’s DeepMind AI research unit unveiled Gemini 2.5 Pro “I/O” edition, a new version of its hit Gemini 2.5 Pro multimodal large language model (LLM) released back in March that DeepMind CEO Demis Hassabis said on X is “the best coding model we’ve ever built!”

Indeed, the initial benchmarks released by the company indicate Google has taken the lead — for the first time since the generative AI race began in earnest with the late 2022 launch of ChatGPT — above all other models on at least one important coding benchmark.

The new version, labeled “gemini-2.5-pro-preview-05-06,” replaces the previous 03-25 release and is now available for indie developers in Google AI Studio and for enterprises in the Vertex AI cloud platform, as well as to individual users in the Gemini app. Google’s blog post said it also powers the Gemini mobile app’s Canvas and other features.

The new version powers feature development in apps like Gemini 95, where the model helps match visual styles across components automatically. It also enables workflows like converting YouTube videos into full-featured learning applications and crafting highly styled components—such as responsive video players or animated dictation UIs—with little to no manual CSS editing.

It’s a proprietary model, meaning enterprises will have to pay Google to use it and access it only through Google’s web services. However, it doesn’t alter pricing or rate limits; current users of Gemini 2.5 Pro will be automatically routed to the updated model which costs $1.25/$10 per million tokens in/out (for context lengths of 200,000 tokens) compared to Claude 3.7 Sonnet’s $3/$15.

The company frames this move — ahead of Google’s annual I/O (input/output) developer conference later this month in Mountain View and online, May 20-21 — as a response to strong community feedback around Gemini’s practical utility in real-world code generation and interface design.

Logan Kilpatrick, Senior Product Manager for Gemini API and Google AI Studio, confirmed in a developer blog post that the update also addresses key developer feedback around function calling, with improvements in error reduction and trigger reliability.

Top scores from human raters at generating web apps

On WebDev Arena Leaderboard, a third-party metric that ranks models by human preference based on their ability to generate visually appealing and functional web apps, Gemini 2.5 …

Article Attribution | Read More at Article Source