frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

AI Darwin Awards Launch

https://www.theregister.com/2025/09/09/ai_darwin_awards/
1•jjgreen•1m ago•0 comments

Ask HN: E-ink devices with real AI/LLM integration?

1•arbayi•1m ago•0 comments

Treat Your Career Like a Product

https://www.leadinginproduct.com/p/treat-your-career-like-a-product
1•benkan•2m ago•0 comments

I'm going to buy bybit you add me to the account verified

1•Fahadidris•3m ago•0 comments

QuantumScape and PowerCo Debut Solid-State Batteries in Ducati Motorcycle

https://www.quantumscape.com/quantumscape-and-powerco-debut-solid-state-batteries-in-ducati-motor...
1•taubek•3m ago•0 comments

Show HN: I built a tool to create ad variations to fight ad fatigue

https://vibecarousels.com/
1•Lindadao•4m ago•0 comments

Why monsoon rains have been so deadly in India this year

https://www.bbc.com/news/articles/c9wdr08wq2zo
1•Brajeshwar•6m ago•0 comments

Aurora season and the Russell-McPherron effect

https://earthsky.org/sun/aurora-season-auroras-equinox-connection/
1•NKosmatos•8m ago•1 comments

Politics, Payrolls, and Policy: Markets Brace for a Volatile September

https://nxtribes.com/blog/politics-payrolls-and-policy-markets-brace-for-a-volatile-september
1•douglas5•9m ago•1 comments

US tech companies enabled the surveillance and detention in China

https://apnews.com/article/chinese-surveillance-silicon-valley-uyghurs-tech-xinjiang-8e000601dadb...
1•c420•11m ago•1 comments

Wikimedia will sunset separate mobile domains

https://www.mediawiki.org/wiki/Requests_for_comment/Mobile_domain_sunsetting/2025_Announcement
1•Recursing•12m ago•0 comments

Show HN: Smile – an open source language for structuring prompts

https://github.com/DrThomasAger/smile
1•DrThomasAger•13m ago•0 comments

Anthropic Judge Rejects $1.5B AI Copyright Settlement

https://news.bloomberglaw.com/ip-law/anthropic-judge-blasts-copyright-pact-as-nowhere-close-to-done
2•nobody9999•13m ago•1 comments

Built an AI news agent that stops information overload

https://reckoning.dev/posts/news-agent-reactive-intelligence
2•sadanand4singh•14m ago•1 comments

Indexing Jsonb in PostgreSQL

https://www.crunchydata.com/blog/indexing-jsonb-in-postgres
1•fanf2•18m ago•0 comments

Seedream 4.0

https://seed.bytedance.com/en/seedream4_0
2•BoorishBears•20m ago•0 comments

Dev3000 – The browser for AI-based development by Vercel

https://d3k.vercel.sh/
1•mustaphah•21m ago•0 comments

RSA Signhash exception since last W11 update

https://github.com/microsoft/SymCrypt/issues/52
1•osivertsson•21m ago•0 comments

Disaggregation: A New Architecture for Cloud Databases

http://muratbuffalo.blogspot.com/2025/09/disaggregation-new-architecture-for.html
1•furkansahin•22m ago•0 comments

Show HN: CuckooTimer – Cuckoo Clock Productivity Timer

https://cuckootimer.com/
2•bribri•26m ago•0 comments

One mother for two species via obligate cross-species cloning in ants

https://www.nature.com/articles/s41586-025-09425-w
2•mighty_plant•26m ago•0 comments

Recreating the Apollo AI adoption rate chart with GPT-5, Python and Pyodide

https://simonwillison.net/2025/Sep/9/apollo-ai-adoption/
2•simonw•27m ago•0 comments

Chinese robotics firm Unitree eyeing $7B IPO valuation

https://www.reuters.com/business/autos-transportation/chinese-robotics-firm-unitree-eyeing-7-bill...
3•defrost•29m ago•0 comments

Caterham Seven 160 achieves 57.6mpg (24.5kml)

https://www.greencarguide.co.uk/2014/01/caterham-seven-160-57-mpg/
3•palmfacehn•29m ago•3 comments

Outcome-Based Exploration for LLM Reasoning

https://arxiv.org/abs/2509.06941
1•badmonster•30m ago•0 comments

Cyborgtest – the chill way to evolve your QA game from manual to automated

https://github.com/CyborgTests/playwright-manual-step-automation
10•epaminond•31m ago•1 comments

TailGuard: A way to connect your home WireGuard router into Tailscale via Docker

https://github.com/juhovh/tailguard
5•daduke•35m ago•2 comments

A Novel Technique for SQL Injection in PDO's Prepared Statements

https://slcyber.io/assetnote-security-research-center/a-novel-technique-for-sql-injection-in-pdos...
1•Bogdanp•41m ago•0 comments

iPhone Isn't Listening to You. But the Truth Is Worse

https://www.cnet.com/tech/services-and-software/features/no-your-iphone-isnt-listening-to-you-her...
2•koolhead17•45m ago•0 comments

Show HN: Quadrant – OKRs with Focus for small teams

https://mygoals.io/
1•davkh•45m ago•0 comments
Open in hackernews

Incident Report for Anthropic

https://status.anthropic.com/incidents/72f99lh1cj2c
66•bashtoni•7h ago

Comments

paradite•4h ago
Announcement on X:

https://x.com/claudeai/status/1965208247302029728

viraptor•3h ago
https://xcancel.com/claudeai/status/1965208247302029728
jondwillis•3h ago
It’s like a thread of rabid animals replying. So much unbridled entitlement and frustration without any hope of recourse.

I’d almost say it’s hard to understand how people don’t realize that grok has all of the same power and incentive structures behind it as Anthropic’s cloud models.

watwut•3h ago
Grok has Musk behind it and that has ... much worst implications then the background of the other companies. Not that those wpuld be saints, but they are not openly like Musk.
metadat•3h ago
Do they credit your account if you were impacted? Or it's just "sorry 'bout 'dat month of trash"?

Unfortunate timing, as I am rooting for Anthropic as the underdog, but feel compelled to use whatever works best. Since mid-August I've demoted Claude to only putting the fire on UIs and am getting amazing results with GPT-5 for everything else. Given the nonstop capacity warnings on codex cli, I might not be the only one.

behnamoh•3h ago
> Unfortunate timing, as I am rooting for Anthropic as the underdog...

Give me a break... Anthropic has never been the underdog. Their CEO is one of the most hypocrite people in the field. In the name of "safety" and "ethics", they got away with not releasing even a single open-weight (or open-source) model, calling out OpenAI as the "bad guys", and constantly trying to sabotage pro-competition and pro-consumer AI laws in the US.

watwut•3h ago
Well OpenAI and Sam Altman are "bad guys". At least that part is true. It is just that Anthropic is not better.
behnamoh•3h ago
> Well OpenAI and Sam Altman are "bad guys".

Define "bad". Sama is a businessman and at least doesn't pretend to be a saint like Amodei does.

testfrequency•2h ago
If you know sama you’d know damn well he’s not nice, but believe whatever you want of course
pdksam•2h ago
You don't need to be nice to be a good businessman
manojlds•48m ago
Oh as in being upfront with a for profit organization from the get go?
cma•3h ago
They were also the first to work with the NSA, years before their change to support military uses, according to Dean Ball, former Whitehouse AI Policy advisor, in an interview with Nathan Labenz.
nextworddev•2h ago
Til
rfoo•1h ago
You are absolutely right! But China bad Dario good Anthropic the only firm caring about AI safety /s
paulddraper•3h ago
Rooting for the underdog is a moving target.
andy_ppp•3h ago
So they aren’t saying what the bug was that caused this issue? Would love a more detailed explanation, what could possibly cause the model degradation apart from potentially pointing the queries to the wrong model?
qsort•3h ago
If I had to guess, something related to floating point operations. FP additions and multiplications are neither commutative nor associative.
allisdust•3h ago
Opus has been utter garbage for the last one month or so.
Aeolun•2h ago
I’ve definitely been more annoyed with it recently. I never had to curse at it because it was taking the lazy way out before.

Oh, let me just fix that!

Comments out test

speedgoose•2h ago
When it happens, I stop it and tell that we aren’t working for one of the IT consulting companies I hate, and a "you are absolutely right" later we are back on track.
CuriouslyC•2h ago
OH MY GOD YES! I actually had it inject synthetic data into my experiments! I had to go back through all my work and re-validate so much to make sure I found all instances of it (it happened in a few different projects).

I now have a system of automated tripwires on all experimental scripts that notifies me and terminates the experiment when any sort of statistical irregularity is detected.

ares623•3h ago
One man’s bug is another man’s load balancing experiment.
slacktivism123•3h ago
>Importantly, we never intentionally degrade model quality as a result of demand or other factors, and the issues mentioned above stem from unrelated bugs.

Sure. I give it a few hours until the prolific promoters start to parrot this apologia.

Don't forget: the black box nature of these hosted services means there's no way to audit for changes to quantization and model re-routing, nor any way to tell what you're actually getting during these "demand" periods.

mccoyb•3h ago
Here’s a report: Claude Code (the software) is getting worse by the day.

Removing the shown token comsumption rates (which allowed understanding when tokens were actually being sent / received!) … sometimes hiding the compaction percentage … the incredible lag on ESC interruption on long running sessions, the now broken clearing of the context window content on TASK tool usage

Who the fuck is working on this software and do they actually use it themselves?

Maybe the quality of Claude Code on any given day is indicative of whether their models are degraded …

yurifury•3h ago
Use /config and enable verbose output to see the token consumption/usage per message.
CuriouslyC•2h ago
Claude Code is indeed legit bad. You'd never know that this was a billion dollar company by the mess of javascript they hacked together. You have to periodically close and re-open the client because otherwise it starts to lag the system from constantly scanning and saving a big JSON file, and they didn't think to shard their storage or use a database. I have 128GB of ram on my workstation and running 8 claude code instances at once sometimes causes heavy thrashing and desktop responsiveness issues... That's just insane.

Needless to say I built my own agent (just needs a good web UI, last step!). The only thing keeping me with anthropic right now is the economics of the plan, my inference bill would be a second mortgage without it.

pton_xd•1h ago
Lately I've noticed even the Claude web interface for chat is laggy on my 16 core / 32 GB RAM laptop. How is that possible?! It's just text!
behnamoh•3h ago
Anthropic only has _one_ product that people want: Claude Code. Everything else about their offerings sucks compared to the competition:

- shitty voice to text (why not just use Whisper at this point?)

- clunky website

- no image/video generation models

- DeepResearch sucks big time

- "Extended Thinking" doesn't seem to do much "thinking". I get the same results without it.

- API too expensive for what it is.

- No open-weight model to boost their reputation. Literally every other player has released an open model at this point..

viraptor•3h ago
That's a weird summary given how many people around me use Claude with Cursor and still prefer it over gpt5. I don't think you can claim a complete view of what their customers want.
visarga•3h ago
This is why it is hard to take a subscription or dependency on them, if they degrade the services willy nilly. Bait and switch tactic.

In Cursor I am seeing varying degrees of delays after exhausting my points, for On-Demand Usage. Some days it works well, other days it just inserts a 30s wait on each message. What am I paying for? You never know when you buy.

behnamoh•3h ago
You should never buy annual AI subs. This field moves so fast and companies often change their ToS. Poe.com did the same and I was out (one day they decided to change the quotas/month for the SOTA models and turned off the good old GPT-4 and replaced it with GPT-4-Turbo which was quantized and bad).
andrewinardeer•2h ago
Could you not ask for, and be entitled to, a refund for your remaining time on an annual subscription if ToS change n months into it?
bakugo•59m ago
Could you ask them? Sure, but good luck getting it. In theory, forcefully changing the terms of a service after payment without offering a refund should clearly not be allowed, but in practice, it very much is unless you're willing to waste disproportionate amounts of money taking them to court.
troupo•3h ago
One of the many reasons why any advice du jour on "just use this methodology to make agentic coding produce amazing results" is utter crap.
naiv•3h ago
I think this is directly related to https://x.com/sama/status/1965110064215458055

And I think it was 100% on purpose that they degraded the model performance as Claude Code got so popular and they either ran out of capacity or were losing money too fast.

But now that people are fleeing to Codex as it improved so much during the time, they had to act now.

deepdarkforest•1h ago
They will probably also release sonnet 4.2 or something soon to make people jump back again to try it and hopefully restick
BhavdeepSethi•3h ago
You're absolutely right! The degraded model quality finally pushed me to stop paying for the max plan. Still on the Pro for now.
irthomasthomas•2h ago

  "we often make changes intended to improve the efficiency and throughput of our models.." 
https://status.anthropic.com/incidents/h26lykctfnsz

I thought Anthropic said they never mess with their models like this? Now they do it often?

mccoyb•2h ago
I read this as changes to quantization and batching techniques. The latter shouldn’t affect logits, the former definitely will …
jjani•1h ago
They already have a track record of messing with internal system prompts (including those that affect the API) which obviously directly change outputs given the same prompts. So in effect, they've already been messing with the models for a long time. It's well known among founders who run services based on their products that this happened, everyone who does long output saw the same. It happened around November last year. If you had a set of evals running that expected an output of e.g. 6k tokens in length on 3.5 Sonnet, overnight it suddenly started cutting off at <2k, ending the message with something like "(Would you like me to continue?)". This is on raw API calls.

Never seen or heard of (from people running services at scale, not just rumours) this kind of API behaviour change for a the same model from OpenAI and Google. Gemini 2.5 Pro did materially change at time of prod release despite them claiming they had simply "promoted the final preview endpoint to GA", but in that case you can give them the benefit of it being technically a new endpoint. Still lying, but less severe.

simonw•35m ago
Can you expand on "messing with internal system prompts" - this is the first I have heard of that.
simonw•36m ago
Anthropic have frequently claimed that they do not change the model weights without bumping the version number.

I think that is compatible with making "changes intended to improve the efficiency and throughput of our models" - i.e. optimizing their inference stack, but only if they do so in a way that doesn't affect model output quality.

Clearly they've not managed to do that recently, but they are at least treating these problems as bugs and rolling out fixes for them.

simianwords•2h ago
There are loads of people who just used Claude and left unimpressed and moved on to something else. They would never know about this regression.

And this bad memory might stick for a while.

avishai2112•2h ago
what kind of incident report is this ? “It’s a bug, we fixed it !” - Anthropic
naiv•2h ago
The model providers should analyse the tone of the instructions.

Before I finally gave up on Claude Code, I noticed that I got more aggressive towards it, the more stupid it got as I could not believe how dumb it started to be.

And I am sure I was not the only one.

stpedgwdgfhgdd•2h ago
This RCA is too vague: ‘a bug’

I want to know how i could have been impacted.

fxtentacle•2h ago
My guess would be that they tried to save money with speculative decoding and they had too loose thresholds for the verification stage.

As someone who has implemented this myself, I know that it’s pretty easy to make innocent mistakes there. And the only visible result is a tiny distortion of the output distribution which only really becomes visible after analysing thousands of tokens. And I would assume that all providers are using speculative decoding by now because it’s the only way to have good inference speed at scale.

As a quick recap, you train a small model to quickly predict the easy tokens, like filler words, so that you can jump over them in the recurrent decoding loop. That way, a serial model can predict multiple tokens per invocation, thereby easily doubling throughput.

And the fact that they need lots of user tokens to verify that it works correctly would nicely explain why it took them a while to find and fix the issue.

metadat•1h ago
Speculative Decoding, for the uninitiated (like me..): https://research.google/blog/looking-back-at-speculative-dec...
buildbot•1h ago
Standard speculative decoding without relaxed acceptance has no accuracy impact as far as I understand things. If you always run the verification; you always have the true target model output.