Claude Opus 4.7 is Here: Better at Coding, Vision, and Not Making Stuff Up

Claude Opus 4.7 is Here: Better at Coding, Vision, and Not Making Stuff Up

7 0 0

Anthropic just dropped Claude Opus 4.7 into general availability, and honestly, the early feedback makes it sound like a meaningful upgrade rather than another incremental bump.

Opus 4.7 improves on Opus 4.6 across the board—coding, vision, multi-step tasks, and consistency. The headline claim is that developers can now hand off their hardest coding work to it without hovering over its shoulder. That’s a big deal if you’ve ever watched an AI model go confidently off the rails halfway through a complex refactor.

What’s Actually New

Coding is the star here. Early testers report a 13% lift in resolution on a 93-task coding benchmark, with four tasks that neither Opus 4.6 nor Sonnet 4.6 could solve. That’s not just incremental—those are concrete wins on problems that were previously out of reach.

Vision also got a real upgrade. The model can see images at higher resolution now, which matters for things like reading chemical structures, interpreting technical diagrams, or just building better UIs. One tester from Solve Intelligence mentioned it’s helping them build tools for life sciences patent workflows—drafting, prosecution, infringement detection. That kind of detail work needs good vision.

There’s also a behavioral shift that I find interesting: multiple testers noted Opus 4.7 is more willing to push back when data is missing or instructions are ambiguous. Instead of fabricating a plausible-sounding answer, it’ll flag the gap. That’s the kind of honesty I wish more models had.

The Mythos Shadow

Anthropic’s been transparent that Opus 4.7 sits below Claude Mythos Preview in raw capability. Mythos is their most powerful model, but they’re keeping it on a short leash due to cybersecurity risks—specifically around Project Glasswing, which they announced last week.

Opus 4.7 is the first model where they’ve deliberately reduced certain cyber capabilities during training. They’ve also added safeguards that automatically detect and block prohibited or high-risk cybersecurity requests. If you’re a legitimate security professional—vulnerability research, pen testing, red-teaming—they’ve set up a Cyber Verification Program to get approved access.

This is a pragmatic approach. Rather than holding back an entire class of models, they’re testing safeguards on a capable-but-not-frontier model first. If it works, we’ll likely see Mythos get a wider release down the line.

Pricing and Availability

Pricing stays the same as Opus 4.6: $5 per million input tokens, $25 per million output tokens. That’s not cheap, but it’s competitive for this tier of capability. You can access it through all Claude products, the API, Amazon Bedrock, Google Cloud’s Vertex AI, and Microsoft Foundry. Developers use the model name claude-opus-4-7.

What Early Testers Are Saying

The testimonials in the announcement are worth reading. A few stood out to me:

  • Hex called it “the strongest model” they’ve evaluated, noting it correctly reports missing data instead of inventing plausible fallbacks. They specifically mentioned it resists “dissonant-data traps” that even Opus 4.6 falls for.
  • Replit said it was “an easy upgrade decision”—always a good sign when a platform that runs a lot of AI-assisted coding says something just works better.
  • Devin reported it “takes long-horizon autonomy to a new level,” working coherently for hours and pushing through hard problems rather than giving up.
  • Cognition highlighted improvements in multimodal understanding, from chemical structures to complex technical diagrams.

The Bottom Line

Opus 4.7 isn’t the frontier model Anthropic is holding back, but it’s a solid step forward. If you’re doing serious coding work—especially multi-step, long-running tasks—this is likely a meaningful upgrade over Opus 4.6. The vision improvements are welcome, and the behavioral changes around honesty and instruction-following are the kind of quality-of-life improvements that actually matter day-to-day.

I’m curious to see how the cybersecurity safeguards hold up in practice, and whether the Cyber Verification Program proves workable for legitimate researchers. If it does, we might see a faster path to broader Mythos access than I’d expected.

Comments (0)

Be the first to comment!