

Claude Opus 4 and Claude Sonnet 4 are able to enterprise long-running duties and may work constantly for a number of hours. Claude Opus 4 excels at coding and complicated problem-solving, whereas Claude Sonnet 4 improves on Sonnet 3.7 and balances efficiency and effectivity.
Along with releasing these new fashions, the corporate additionally revealed a beta for prolonged pondering with instrument use, the flexibility to make use of instruments in parallel, and basic availability of Claude Code.
The Anthropic API additionally added 4 new capabilities: the code execution instrument, MCP connector, Information API, and the flexibility to cache prompts for as much as one hour.
OpenAI provides new instruments and options to the Responses API
New additions embrace distant MCP server help, help for the most recent picture era mannequin, the flexibility to make use of the Code Interpreter instrument, and the flexibility to make use of the file search instrument in OpenAI’s reasoning fashions.
The corporate has additionally added background mode, which permits the mannequin to execute complicated reasoning duties asynchronously; reasoning summaries; and the flexibility to reuse reasoning gadgets throughout totally different API requests.
Mistral launches LLM for coding brokers
Devstral is a light-weight open supply mannequin designed particularly for agentic coding duties. Based on the SWE-Bench Verified benchmark, Devstral outperforms GPT-4.1-mini and Claude 3.5 Haiku. Its small dimension permits it to run on a single RTX 4090 or a Mac with 32GB RAM, enabling it to be utilized for native, on-device use.
“Whereas typical LLMs are glorious at atomic coding duties reminiscent of writing standalone capabilities or code completion, they presently wrestle to resolve real-world software program engineering issues. Actual-world improvement requires contextualising code inside a big codebase, figuring out relationships between disparate elements, and figuring out delicate bugs in intricate capabilities. Devstral is designed to sort out this downside. Devstral is skilled to resolve actual GitHub points,” Mistral wrote in its announcement.
AI updates from Google I/O
Google I/O was stuffed with updates on AI, together with new fashions reminiscent of the brand new textual content mannequin Gemini Diffusion and Gemma 3n, a multimodal mannequin designed for working on telephones, laptops and tablets, able to dealing with audio, textual content, picture, and video.
Google additionally revealed two new Gemma mannequin variants: MedGemma for well being purposes and SignGemma for translating signal language into spoken language textual content.
Gemini Code Help for people and Gemini Code Help for GitHub are each now usually out there as effectively, and are powered by Gemini 2.5. This instrument was first launched as a preview again in February, and at the moment’s GA launch contains a number of new updates, together with chat historical past and threads, the flexibility to specify guidelines to use to each AI era within the chat, customized instructions, and the flexibility to evaluate and settle for code ideas in components, throughout information, or all collectively.
The corporate additionally introduced a reimagined model of Colab, a brand new instrument that generates UI elements from wireframes or textual content prompts known as Sew, and new options in Firebase Studios, reminiscent of the flexibility to translate Figma designs into purposes.
AI updates from Microsoft Construct
A brand new coding agent has been added to GitHub Copilot that will get activated when a developer assigns it a GitHub problem or calls it by way of a immediate in VS Code. It may possibly help with a lot of duties, together with including options, fixing bugs, extending exams, refactoring code, and bettering documentation. The entire agent’s pull requests require human approval earlier than they run, GitHub confirmed.
Microsoft additionally introduced Home windows AI Foundry, a platform that helps the AI developer life cycle throughout coaching and inference. Builders will be capable of handle and run open-source LLMs via Foundry Native or deliver proprietary fashions and convert, fine-tune, and deploy them throughout purchasers and cloud.
Help for the Mannequin Context Protocol (MCP) was additionally added throughout Microsoft’s platforms and providers, together with GitHub, Copilot Studio, Dynamics 365, Azure AI Foundry, Semantic Kernel, and Home windows 11.
Microsoft additionally introduced a brand new open supply undertaking known as NLWeb to assist builders create conversational AI interfaces for his or her web sites utilizing any mannequin or knowledge supply they’d like. NLWeb endpoints additionally act as MCP servers, so builders will be capable of simply make their content material discoverable to AI brokers in the event that they’d like.
Shopify releases new developer instruments
It’s launching a brand new unified developer platform that integrates the Dev Dashboard and CLI and affords AI-powered code era. Builders can even now create “dev shops” the place they’ll preview apps in take a look at environments, a characteristic that was beforehand solely out there to Plus plans, and is now out there to all builders.
Different new options introduced at the moment embrace declarative customized knowledge definitions, a unified Polaris UI toolkit, and Storefront MCP, which permits builders to construct AI brokers that may act as buying assistants for shops.
HeyMarvin launches AI Moderated Interviewer
The AI Moderated Interviewer conducts moderated consumer interviews with probably 1000’s of contributors and not using a human facilitator. It may possibly additionally analyze the interview responses to floor insights and developments.
“What makes it so highly effective is that it allows free-flowing, qualitative, participating conversations — however on demand and at scale,” mentioned Prayag Narula, CEO and co-founder of HeyMarvin. “We’re speaking a whole lot, even 1000’s of individuals, one thing that was beforehand solely seen at giant scale utilizing a small military of volunteers in moments like presidential elections. Now, even a small group can have that very same in-depth dialogue with their prospects. It’s not only a higher survey, and it’s not changing conventional consumer interviews. It’s an entire new means of doing analysis that merely didn’t exist a number of months in the past.”
Zencoder broadcasts Autonomous Zen Brokers for CI/CD
These brokers run immediately in CI/CD pipelines and could be triggered by webhooks from problem trackers or code occasions. They’ll resolve points, implement fixes, enhance code high quality, generate and run exams, and create documentation.
“The following evolution in AI-powered improvement isn’t nearly coding sooner – it’s about accelerating the entire software program improvement lifecycle, the place coding is only one step,” mentioned Andrew Filev, CEO and founding father of Zencoder. “By bringing autonomous brokers into CI/CD pipelines, we’re enabling groups to eradicate routine work and speed up hand-offs, sustaining momentum 24/7, whereas preserving people in command of what in the end ships.”
Learn final week’s AI updates right here: OpenAI Codex, AWS Rework for .NET, and extra — Could 16, 2025


Claude Opus 4 and Claude Sonnet 4 are able to enterprise long-running duties and may work constantly for a number of hours. Claude Opus 4 excels at coding and complicated problem-solving, whereas Claude Sonnet 4 improves on Sonnet 3.7 and balances efficiency and effectivity.
Along with releasing these new fashions, the corporate additionally revealed a beta for prolonged pondering with instrument use, the flexibility to make use of instruments in parallel, and basic availability of Claude Code.
The Anthropic API additionally added 4 new capabilities: the code execution instrument, MCP connector, Information API, and the flexibility to cache prompts for as much as one hour.
OpenAI provides new instruments and options to the Responses API
New additions embrace distant MCP server help, help for the most recent picture era mannequin, the flexibility to make use of the Code Interpreter instrument, and the flexibility to make use of the file search instrument in OpenAI’s reasoning fashions.
The corporate has additionally added background mode, which permits the mannequin to execute complicated reasoning duties asynchronously; reasoning summaries; and the flexibility to reuse reasoning gadgets throughout totally different API requests.
Mistral launches LLM for coding brokers
Devstral is a light-weight open supply mannequin designed particularly for agentic coding duties. Based on the SWE-Bench Verified benchmark, Devstral outperforms GPT-4.1-mini and Claude 3.5 Haiku. Its small dimension permits it to run on a single RTX 4090 or a Mac with 32GB RAM, enabling it to be utilized for native, on-device use.
“Whereas typical LLMs are glorious at atomic coding duties reminiscent of writing standalone capabilities or code completion, they presently wrestle to resolve real-world software program engineering issues. Actual-world improvement requires contextualising code inside a big codebase, figuring out relationships between disparate elements, and figuring out delicate bugs in intricate capabilities. Devstral is designed to sort out this downside. Devstral is skilled to resolve actual GitHub points,” Mistral wrote in its announcement.
AI updates from Google I/O
Google I/O was stuffed with updates on AI, together with new fashions reminiscent of the brand new textual content mannequin Gemini Diffusion and Gemma 3n, a multimodal mannequin designed for working on telephones, laptops and tablets, able to dealing with audio, textual content, picture, and video.
Google additionally revealed two new Gemma mannequin variants: MedGemma for well being purposes and SignGemma for translating signal language into spoken language textual content.
Gemini Code Help for people and Gemini Code Help for GitHub are each now usually out there as effectively, and are powered by Gemini 2.5. This instrument was first launched as a preview again in February, and at the moment’s GA launch contains a number of new updates, together with chat historical past and threads, the flexibility to specify guidelines to use to each AI era within the chat, customized instructions, and the flexibility to evaluate and settle for code ideas in components, throughout information, or all collectively.
The corporate additionally introduced a reimagined model of Colab, a brand new instrument that generates UI elements from wireframes or textual content prompts known as Sew, and new options in Firebase Studios, reminiscent of the flexibility to translate Figma designs into purposes.
AI updates from Microsoft Construct
A brand new coding agent has been added to GitHub Copilot that will get activated when a developer assigns it a GitHub problem or calls it by way of a immediate in VS Code. It may possibly help with a lot of duties, together with including options, fixing bugs, extending exams, refactoring code, and bettering documentation. The entire agent’s pull requests require human approval earlier than they run, GitHub confirmed.
Microsoft additionally introduced Home windows AI Foundry, a platform that helps the AI developer life cycle throughout coaching and inference. Builders will be capable of handle and run open-source LLMs via Foundry Native or deliver proprietary fashions and convert, fine-tune, and deploy them throughout purchasers and cloud.
Help for the Mannequin Context Protocol (MCP) was additionally added throughout Microsoft’s platforms and providers, together with GitHub, Copilot Studio, Dynamics 365, Azure AI Foundry, Semantic Kernel, and Home windows 11.
Microsoft additionally introduced a brand new open supply undertaking known as NLWeb to assist builders create conversational AI interfaces for his or her web sites utilizing any mannequin or knowledge supply they’d like. NLWeb endpoints additionally act as MCP servers, so builders will be capable of simply make their content material discoverable to AI brokers in the event that they’d like.
Shopify releases new developer instruments
It’s launching a brand new unified developer platform that integrates the Dev Dashboard and CLI and affords AI-powered code era. Builders can even now create “dev shops” the place they’ll preview apps in take a look at environments, a characteristic that was beforehand solely out there to Plus plans, and is now out there to all builders.
Different new options introduced at the moment embrace declarative customized knowledge definitions, a unified Polaris UI toolkit, and Storefront MCP, which permits builders to construct AI brokers that may act as buying assistants for shops.
HeyMarvin launches AI Moderated Interviewer
The AI Moderated Interviewer conducts moderated consumer interviews with probably 1000’s of contributors and not using a human facilitator. It may possibly additionally analyze the interview responses to floor insights and developments.
“What makes it so highly effective is that it allows free-flowing, qualitative, participating conversations — however on demand and at scale,” mentioned Prayag Narula, CEO and co-founder of HeyMarvin. “We’re speaking a whole lot, even 1000’s of individuals, one thing that was beforehand solely seen at giant scale utilizing a small military of volunteers in moments like presidential elections. Now, even a small group can have that very same in-depth dialogue with their prospects. It’s not only a higher survey, and it’s not changing conventional consumer interviews. It’s an entire new means of doing analysis that merely didn’t exist a number of months in the past.”
Zencoder broadcasts Autonomous Zen Brokers for CI/CD
These brokers run immediately in CI/CD pipelines and could be triggered by webhooks from problem trackers or code occasions. They’ll resolve points, implement fixes, enhance code high quality, generate and run exams, and create documentation.
“The following evolution in AI-powered improvement isn’t nearly coding sooner – it’s about accelerating the entire software program improvement lifecycle, the place coding is only one step,” mentioned Andrew Filev, CEO and founding father of Zencoder. “By bringing autonomous brokers into CI/CD pipelines, we’re enabling groups to eradicate routine work and speed up hand-offs, sustaining momentum 24/7, whereas preserving people in command of what in the end ships.”
Learn final week’s AI updates right here: OpenAI Codex, AWS Rework for .NET, and extra — Could 16, 2025