AI advance: Anthropic says Claude Sonnet 4.5 ‘best coding model’ in world

AI advance: Anthropic says Claude Sonnet 4.5 'best coding model' in world
UPI

Sept. 29 (UPI) — Anthropic on Monday unveiled its latest artificial intelligence model, called Claude Sonnet 4.5, which the tech company called “the best coding model in the world.”

That AI boast is based on industry benchmarks, including software engineer bench tests that measure an AI system’s software coding abilities, the company said in a news release. That includes following instructions more reliably, the company said.

It can operate 30 hours by itself.

“People are just noticing with this model, because it’s just smarter and more of a colleague, that it’s kind of fun to work with it when encountering problems and fixing them,” Jared Kaplan, Anthropic’s co-founder and chief science officer, told CNBC in an interview.

Anthropic’s coding competitors are OpenAI’s GitHub Copilot and Google’s Gemini.

“It’s the strongest model for building complex agents,” the company said. “It’s the best model at using computers. And it shows substantial gains in reasoning and math.

“Code is everywhere. It runs every application, spreadsheet, and software tool you use. Being able to use those tools and reason through hard problems is how modern work gets done.”

Claude Sonnet 4.5, which is available to all users, is better than other companies’ AI products on coding with computers and meeting practical business needs, including cybersecurity, finance and research, the company said.

OSWorld, a benchmark that tests AI models on real-world computer tasks, showed Sonnet 4.5 leads at 61.4%. Four months ago, Sonnet 4 held the lead at 42.2%.

“Experts in finance, law, medicine, and STEM found Sonnet 4.5 shows dramatically better domain-specific knowledge and reasoning compared to older models, including Opus 4.1,” the company said.

In August, Anthropic launched Claude Opus 4.1 and Claude Sonnet 4.

Anthropic was founded in 2021 by OpenAI researchers in 2021.

OpenAI began the generative artificial intelligence boom when it released it ChatGPT chatbot in 2022.

“It’s been actually really helpful to have a continuous running prompt that I use,” Dianne Penn, a head of product management at Anthropic, told The Verge. “That’s been really, really helpful. And I’ve seen the Sonnet 4.5 just do even better than in the past, on the quality and the depth of the searches and actually generating a spreadsheet with LinkedIn profiles so then I can email them.”

Mike Krieger, Anthropic’s chief product officer, said Claude Sonnet 4.5 will be the default for users.

Claude Sonnet 4.5 is smaller than Claude Opus 4.1 but smarter than it in “almost every single way,” Krieger said.

“We have found it, and our customers are finding it, very useful for real, actual work.”

COMMENTS

Please let us know if you're having issues with commenting.