frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Elevated error rates on Opus 4.7

https://status.claude.com/incidents/8z7l5zcy0v3b
48•rob•1h ago

Comments

sinak•1h ago
Sonnet is giving an overloaded message as well.
kristianc•1h ago
They're having quite the day for devrel..
gopalv•1h ago
Sonnet is also throwing overloaded error.

My systems are hitting exponential delay retries, so this might not get better because retries overload things again.

> {'type': 'error', 'error': {'details': None, 'type': 'overloaded_error', 'message': 'Overloaded'}, 'request_id': 'req_ ...

I can see a weird spike in my cache hit-rate a few minutes before, so this might actually be some extra caching they have thrown in.

keithnz•1h ago
https://status.claude.com/
cyanydeez•1h ago
so, all those CEOs moving all those remaining engineers to be dependent on a cloud service to the extent that there's no local development capability are gonna appologize right
claaams•32m ago
in a year or two when AI tool costs go from 5M per year to 15M per year...even then, maybe not.
9cb14c1ec0•1h ago
Do they need a waiting list, or what?
lepuski•1h ago
I can't see why anyone still chooses Claude. Codex outperforms it in most respects, and its quotas are about ten times larger. A $100 Codex plan gets me through the whole week with 6–12 hours of coding per day.
elahieh•1h ago
One reason might be that Claude Opus 4.7 thinking benchmarks better on Arena Coding at https://arena.ai/leaderboard/text/coding ... hopefully that effectively assesses correctness. It doesn't account for reliability though.
nothinkjustai•1h ago
Because of marketing and vibes mostly.

Heck I prefer DeepSeek to both of those.

zmmmmm•58m ago
interestingly I had the same experience, and weirdly it's in part because it is clearly less intelligent. It's more of a mechanistic tool just doing what I ask (but still very smart and very competent about it) and less trying to win a nobel prize with each answer. Turns out I actually like that.
josephg•53m ago
Wow, I'm really surprised. I tried deepseek (their best model, through the official API). Its extremely cheap, but its clearly not as good at programming as Opus 4.7. It seems nowhere near as good at making high level design choices. Deepseek also seems to get stuck in whack-a-mole fixing loops much more than opus. I stopped it at one point, and asked opus to solve the problem it was trying to solve and it saw the solution immediately.

I was running deepseek through claude's code agent harness. Maybe it works better through a different tool?

esafak•46m ago
You tried v4?
josephg•34m ago
Yeah, v4.

I would have been much more impressed with v4 about 6 months ago. But I've been spoiled by opus 4.7. Deepseek isn't at the same level.

codybontecou•17m ago
I tried to like it, but it eventually got stuck in a near-infinite loop trying to debug an extra curly bracket in an iOS app.

That and the lack of image-read support surprised me. I'm a big fan of feeding screenshots into my llm and that killed it for me.

zmmmmm•37m ago
I've given V4 Pro some curly things and I was impressed at how it figured them out. I agree high level design is not its forte. But it sat in a loop and dogmatically debugged a crazy dependency issue to come to the right answer over the course of 15 minutes which impressed me.
Thaxll•1h ago
I think it's impossible to say that codex x.y.z is better than Sonnet x.y.z, I used many "high" end models and they're just all good.
echelon•1h ago
Claude is significantly better at Rust in my experience, and Rust is my favorite language to emit from LLMs.

Opus 4.7 + Rust is a killer combo.

squirrellous•48m ago
Corporate reasons. AWS hasn't opened codex models to everyone yet.
SatvikBeri•43m ago
I've never actually run into the issues that people talk about online, like Claude suddenly getting dumb or running out of usage. So there's just not a lot of incentive for me to shop around. I've used Amp a bit, and it's quite nice, but a bit more expensive without the subsidized subscription.
dboreham•39m ago
Same here. Works every time. Never ran into usage limits either.
gardnr•22m ago
Are you using Opus? Sonnet remains as useful as it was while Opus efficacy and token burn rate has soured over the last 4 months.
fny•13m ago
I'm using Opus on xhigh 10+ hours a day, and I've only reached 80% of weekly limits when doing massive ports or refactors. I haven't once hit hourly limits, and I've used Claude very, very aggressively. I guess its a pain point for power users.
mbreese•11m ago
When do you use it the most? I’ve noticed that it most often starts to degrade during 10-5 US East coast time. Late at night, I have the least amount of issues, but without fail, if I’m trying to do anything complex during the day, Claude gets loopy.
raincole•2m ago
It has always been like this. We actually know that the model performance has been mostly steady[0], but you cannot beat the notion of "evil companies secretly serving us worse models." The meme value is too strong.

[0]: https://marginlab.ai/trackers/claude-code/

taspeotis•40m ago
Claude Max 20x gives me unlimited (for my level of usage) Opus 4.7 - how much money do I have pay OpenAI for that?
arcanemachiner•10m ago
Based on the experience of people using the $20 Claude Pro subscription and exhausting their quotas in a manner of minutes, the answer to your question is probably "less". (I would guess that the $100 plan would do the trick.)
SeanAnderson•35m ago
You get a discount for paying for a full year on Teams and Enterprise can involve contractual obligations. It's a lot of effort to get buy-in to change providers and to shift an entire organization. The winds change frequently in this space and the pain needs to get to a certain level before it's worth rolling the dice.
yieldcrv•32m ago
because my shard isn’t erroring

I use Codex when Claude Code is down, and I only began using Claude when ChatGPT was down

yes codex is very fast, I go back to Claude for now

kylemaxwell•31m ago
Corporate policies and agreements. In large corporations, using external non-approved models with proprietary source code is a good way to have significant career issues.
hansvm•30m ago
Claude is the only AI coding tool I've found worth a damn. Without it I'd just do everything by hand save for a few bash scripts or whatever.
arcanemachiner•13m ago
Have you tried other harnesses, such as OpenCode?
hansvm•6m ago
Yeah, harness quality matters too, but the underlying model capabilities are night and day.
jjice•28m ago
I found GPT 5.5 is pretty solid, but I keep getting impressed by opus. It's tracked down some insane stuff while I look away during a meeting. 5.5 is way closer than previous OpenAI models to Anthropic IMO.

These things are so tricky because everyone has a seemingly conflicting experience. Part of the fun I guess!

CompoundEyes•11m ago
In my org the teams doing agent engineering at scale are all on Codex using gpt-5.5. By scale I mean fully agent authored code workflows with long running / multi hour plans.
etchalon•2m ago
I'd rather not give money to Sam Altman.
FergusArgyll•1h ago
I thought the deal with xai was supposed to solve this? Is this basically the adding lanes paradox?
josephg•50m ago
You're assuming the elevated error rates are due to the system being overloaded. We have no evidence this is actually the case. Its much more likely due to a simple misconfiguration or failing router or something.
imperio59•15m ago
I love Claude but I hate waiting a minute or two for any inference to start. I hope they can get their xAI capacity online ASAP and that it helps!

Removing the modem and GPS from my 2024 RAV4 hybrid

https://arkadiyt.com/2026/05/13/removing-the-modem-and-gps-from-my-rav4/
625•arkadiyt•8h ago•365 comments

A few words on DS4

https://antirez.com/news/165
158•caust1c•3h ago•48 comments

Elevated error rates on Opus 4.7

https://status.claude.com/incidents/8z7l5zcy0v3b
48•rob•1h ago•39 comments

First public macOS kernel memory corruption exploit on Apple M5

https://blog.calif.io/p/first-public-kernel-memory-corruption
261•quadrige•7h ago•45 comments

Have a Coherent AI Policy

https://brianmeeker.me/2026/05/14/have-a-coherent-ai-policy/
42•ai_critic•2h ago•29 comments

RTX 5090 and M4 MacBook Air: Can It Game?

https://scottjg.com/posts/2026-05-05-egpu-mac-gaming/
491•allenleee•10h ago•135 comments

Codex is now in the ChatGPT mobile app

https://openai.com/index/work-with-codex-from-anywhere/
180•mikeevans•5h ago•86 comments

New Nginx Exploit

https://github.com/DepthFirstDisclosures/Nginx-Rift
292•hetsaraiya•8h ago•63 comments

RISC-V Router

https://router.start9.com/
73•janandonly•5h ago•34 comments

Tesla Wall Connector bootloader bypasses the firmware downgrade ratchet

https://www.synacktiv.com/en/publications/exploiting-the-tesla-wall-connector-from-its-charge-por...
61•p_stuart82•5h ago•21 comments

Porting 3D Movie Maker to Linux

https://benstoneonline.com/posts/porting-3d-movie-maker-to-linux/
81•speckx•3d ago•13 comments

OVMS: Open source electric vehicle remote monitoring, diagnosis and control

https://www.openvehicles.com/home
39•BHSPitMonkey•4h ago•5 comments

More than sixty percent of the United States is experiencing drought conditions

https://news.vt.edu/articles/2026/05/drought-united-states-la-nina-expert.html
93•littlexsparkee•3h ago•33 comments

New arXiv policy: 1-year ban for hallucinated references

https://twitter.com/tdietterich/status/2055000956144935055
287•gjuggler•5h ago•86 comments

HDD Firmware Hacking

https://icode4.coffee/?p=1465
135•jsploit•9h ago•14 comments

Infracost (YC W21) Is Hiring Sr Dev Advocate to make agents cloud cost-aware

https://www.ycombinator.com/companies/infracost/jobs/NzwUQ7c-senior-developer-advocate
1•akh•5h ago

Show HN: GridTravel- A community based travel app for users to share routes

https://www.gridtravel.app
22•knuaym9•4h ago•6 comments

Ontario auditors find doctors' AI note takers routinely blow basic facts

https://www.theregister.com/ai-ml/2026/05/14/ontario-auditors-find-doctors-ai-note-takers-routine...
104•sohkamyung•3h ago•34 comments

Computer Hobby Movement in Canada

https://museum.eecs.yorku.ca/exhibits/show/hobby_canada/hobby_canada
188•rbanffy•13h ago•74 comments

What's in a GGUF, besides the weights – and what's still missing?

https://nobodywho.ooo/posts/whats-in-a-gguf/
94•bashbjorn•8h ago•41 comments

UFerris a Versatile Learner Board for Rust Embedded Beginners

https://www.theembeddedrustacean.com/uferris
3•stmw•1h ago•0 comments

CSS Rhythmic Sizing Module Level 1

https://www.w3.org/TR/css-rhythm-1/
6•gudzpoz•2d ago•1 comments

The Power of a Free Popsicle (2018)

https://www.gsb.stanford.edu/insights/power-free-popsicle
70•NaOH•7h ago•27 comments

Velonus – Open-source AppSec scanner that deduplicates SAST noise

https://github.com/AliAmmar15/Velonus
3•AliAmmar15•1h ago•0 comments

Rewrite Bun in Rust has been merged

https://github.com/oven-sh/bun/pull/30412
492•Chaoses•17h ago•588 comments

A message from President Kornbluth about funding and the talent pipeline

https://president.mit.edu/writing-speeches/video-transcript-message-president-kornbluth-about-fun...
576•dmayo•11h ago•640 comments

Amazonbot is finally respecting robots.txt

https://xeiaso.net/notes/2026/amazonbot-respecting-robots-txt/
139•xena•5h ago•26 comments

Int a = 5; a = a++ + ++a; a =? (2011)

https://gynvael.coldwind.pl/?id=372
105•e-topy•2d ago•167 comments

DIY open-source ultrasound hardware on the rp2040/rp2350

http://un0rick.cc/pic0rick
64•kelu124•8h ago•6 comments

Fossils show millipede and centipede ancestors evolved legs underwater

https://phys.org/news/2026-05-ancient-sea-fossils-millipede-centipede.html
75•gmays•3d ago•2 comments