Anthropic's Claude 3.7 Sonnet is here and results are insane

“Claude Code was my ‘Feel the AGI moment.’ I’ve thrown bugs at this thing that no other models could fix, but Claude Code blasted through them," one user wrote in a Reddit thread. Additionally, Claude 3.7 Sonnet appears to excel in most categories, with its “extended thinking” mode boosting accuracy on tasks like math and science. Anthropic has started rolling out Claude 3.7 Sonnet, the company's most advanced model and the first hybrid reasoning model it has shipped. Early tests show that Claude 3.7 Sonnet is outperforming rivals, including OpenAI's ChatGPT models and China's DeepSeek. In a blog post, Anthropic noted that its newest model combines fast, straightforward answers with the ability to “think” step-by-step for complex tasks. Another user added: “3.7 just slapped out an entire project I had been working on for months—5000 lines of code, front-end, debugging example, all from scratch. These results show that Claude 3.7 Sonnet is significantly ahead of its competitors in terms of coding. For example, in a thread, Reddit users noted that the model delivered outstanding results when they used it to create apps or even games. "Software engineering (SWE-bench verified)" is a benchmarking standard to see how well an AI model does when asked to code a program.

This Cyber News was published on www.bleepingcomputer.com. Publication date: Tue, 25 Feb 2025 13:10:17 +0000

Anthropic's Claude 3.7 Sonnet is here and results are insane

Cyber News related to Anthropic's Claude 3.7 Sonnet is here and results are insane