Claude Opus 4.7

https://www.anthropic.com/claude/opus

182•AlphaWeaver•2h ago

Comments

AlphaWeaver•1h ago

Might be better to update the URL to this, actually: https://www.anthropic.com/news/claude-opus-4-7

constantius•1h ago

Not related to this release, but is anyone aware of what's happening with Deepseek? The usual cascade of synced releases has been lacking this frontier lab whale for a while now.

rvz•1h ago

> Not related to this release, but is anyone aware of what's happening with Deepseek?

Given that no-one is talking about DeepSeek, I assume it is coming this month.

They are still releasing research papers and that is what really matters and not the .1 increment releases of AI models to massage benchmarks or create hype around.

cmrdporcupine•1h ago

There's been months of "DeepSeek v4 next week!" rumours and none have panned out.

They're either stuck/dead or they're sitting on something really fantastic that they only want to release once they've perfected it.

My realistic side thinks the former, my optimism on the latter.

In the meantime, GLM 5.1 is actually really good.

bsaul•1h ago

i tried to find an API pricing for GLM 5.1 but couldn't find any on the homepage. How are you using it ?

cmrdporcupine•1h ago

per-token via DeepInfra, who hosts it as one of their models.

https://deepinfra.com/zai-org/GLM-5.1

hansmayer•1h ago

Ah, here we go again.

ChrisArchitect•1h ago

Some more discussion on announcement post: https://www.anthropic.com/news/claude-opus-4-7 (https://news.ycombinator.com/item?id=47793411)

tomhow•1h ago

Comments moved thither. Thanks!

grandinquistor•1h ago

Quite a big improvement in coding benchmarks, doesn’t seem like progress is plateauing as some people predicted.

jameson•1h ago

How should one compare benchmark results?

For example, SWE-bench Pro improved ~11% compared with Opus 4.6. Should one interpret it as 4.7 is able to solve more difficult problems? or 11% less hallucinations?

vomayank•1h ago

Curious how people are evaluating real-world gains with this version.

Are you seeing meaningful improvements in reasoning reliability, or mostly incremental quality changes compared to previous releases?

A practitioner's framework for engineering trust from unreliable agents

The Multitasking Myth (2006)

KLM cancels 160 flights due to fuel shortage

Uigen: A runtime front end for any OpenAPI described API

Mneme – project memory injection for LLM workflows

50% of AI Data Centers Have Been Cancelled or "Delayed" [video]

Trying to Build Your Own Consumer-Grade Router in 2026

Inertia Moves to Commercialize Fusion

Project Think: building the next generation of AI agents on Cloudflare

AMD ROCm: 40x slower at linear algebra than older Nvidia GPUs

Using government APIs to track AI job displacement

Show HN: A tool for automatically grading/sorting astrophotography FITS frames

C++20 Modules: The Tooling Gap

Are psychedelics better than antidepressants? New study says no

Your AI Excitement Is Someone's AI Apprehension

Code is not even close to half the battle

Show HN: AriaType v0.3 Has Come – Fast and Private Voice to Text Input Client

Tech Goes Invisible [video]

Mnemo – a local-first notepad that acts as memory for AI agents

Starlink outage hit drone tests, exposing Pentagon's growing reliance on SpaceX

Building an Unverified Compiler with Agents

They Hacked Claude, Gemini, and Copilot (and No One Told You)

Claude is about to begin its KYC verification process

How Do You See What Cannot Be Seen?

How are you connecting cloud spend to business outcomes?

Show HN: Marky – A lightweight Markdown viewer for agentic coding

Data breach at edtech giant McGraw Hill affects 13.5M accounts

Succinct Data Structures: Cramming 80k words into a JavaScript file

Skwik – Turn iPhone photos into scaled measurements for CAD work

Show HN: Stack – the control plane for AI agents