Anthropic just dropped Opus 4.6

artesia · 5 February 2026 23:35

Anthropic’s new Claude Opus 4.6 model offers major improvements over Opus 4.5, including better long-term autonomous task handling, a 1 million token context window, and enhanced code and enterprise capabilities. Key features like agent teams, adaptive reasoning, and improved safety make it a leading AI model for complex, large-scale applications.

artesia · 5 February 2026 23:56

Anthropic has released Claude Opus 4.6, marking a significant advancement over its predecessor, Opus 4.5. The new model demonstrates improved planning, sustains agentic tasks for longer periods, operates more reliably in large codebases, and offers enhanced code review and debugging capabilities. A key highlight is its ability to maintain agentic autonomy for extended durations, aligning with the broader industry trend toward more autonomous, long-running AI agents. This progress is evident in benchmarks showing Opus 4.6 outperforming previous models and competitors like GPT-5.2 and Gemini 3 Pro in tasks requiring sustained, autonomous operation.

One of the most notable features of Opus 4.6 is its 1 million token context window, currently in beta. This is a major leap from the previous standard of around 200,000 tokens and is matched only by Google’s Gemini. The expanded context window is particularly beneficial for handling large codebases and complex document analysis. However, the model’s true strength lies in its ability to maintain high-quality performance across this vast context, minimizing issues like context rot, where models struggle to extract relevant information from large inputs.

Opus 4.6 also introduces significant improvements in enterprise applications, such as financial analysis, research, and document creation. Benchmarks from Box, an early access partner, show substantial gains in complex reasoning tasks across industries like public sector, financial services, healthcare, and legal. For example, report drafting accuracy and due diligence performance saw notable increases, with some scores nearly doubling compared to Opus 4.5. These improvements suggest that Opus 4.6 can unlock real value from large sets of documents and complex workflows.

A major new feature is “agent teams,” which allows multiple Claude Code instances to work together in parallel, coordinated by a lead agent. Unlike traditional sub-agents, agent teams can communicate directly with each other and operate independently, making them ideal for tasks that benefit from parallel exploration, such as research, debugging, and cross-layer coordination. This approach, while more resource-intensive, offers greater flexibility and efficiency for complex, multi-faceted tasks.

Additional enhancements include adaptive thinking, which lets the model dynamically adjust its reasoning depth, and new effort controls for balancing intelligence, speed, and cost. Opus 4.6 is priced the same as 4.5, despite its expanded capabilities. The model also shows improved alignment and safety, with a slight reduction in misaligned behavior. Overall, Opus 4.6 represents a major step forward in AI autonomy, context handling, and enterprise utility, positioning it as one of the most advanced models currently available.