
"Anthropic's latest AI model spent 30 hours running by itself to code a chat app akin to Slack or Teams. It spat out about 11,000 lines of code, according to Anthropic, and it only stopped running when it had completed the task. The model, Claude Sonnet 4.5, was announced today, and its ability to operate autonomously for 30 hours straight is a huge jump forward. Before, the company's Opus 4 model made headlines in May for its ability to operate for seven hours."
"It's all a significant step in Anthropic's battle to corner the market on both AI agents and AI coding. The company called Claude Sonnet 4.5 "the best model in the world for real-world agents, coding, and computer use" and said it "leads the market at using computers," referencing the Computer Use feature Anthropic debuted nearly a year ago. The new model is particularly adept in fields like cybersecurity, financial services, and research, according to Anthropic."
Claude Sonnet 4.5 executed a 30-hour autonomous run to build a Slack- or Teams-like chat application, generating roughly 11,000 lines of code and stopping when the task completed. That runtime substantially exceeds Opus 4's prior seven-hour autonomous performance. Anthropic positions Sonnet 4.5 as optimized for real-world agents, coding, and computer use, leveraging a Computer Use feature introduced nearly a year earlier. The model is reported to perform well in domains such as cybersecurity, financial services, and research. Beta tester Canva reported that Sonnet 4.5 helped with complex, long-context engineering, in-product feature work, and research. Competing companies continue incremental model improvements.
Read at The Verge
Unable to calculate read time
Collection
[
|
...
]