frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

How do we keep apps maintained on Flathub? (or building a more respectful App S

https://tim.siosm.fr/blog/2025/11/24/building-better-app-store-flathub/
1•todsacerdoti•1m ago•0 comments

Gramma, a Galápagos Tortoise at the San Diego Zoo, Dies at About 141

https://www.nytimes.com/2025/11/24/science/gramma-galapagos-tortoise-san-diego-zoo-dies.html
1•quapster•1m ago•0 comments

Old-school rotary phone dials into online meetings hangs up when slam it down

https://www.theregister.com/2025/11/24/rotary_phone_online_meetings/
1•Bender•1m ago•0 comments

A.I. is a printed birthday card train to Paris

https://filiph.net/text/ai-is-a-printed-birthday-card-train-to-paris.html
1•markdog12•2m ago•0 comments

Claude Advanced Tool Use

https://www.anthropic.com/engineering/advanced-tool-use
1•lebovic•4m ago•0 comments

Riding Uphill: The Tariff Squeeze on America's Bike Scene

https://micromobility.io/news/riding-uphill-the-tariff-squeeze-on-americas-bike-scene
1•prabinjoel•4m ago•0 comments

Zapier Security Incident Packages and Zapier Developers

https://status.zapier.com/incidents/01KAV9DDHMYT7R6MFHSB8C09E3
2•Kerrick•5m ago•0 comments

Signal Secure Backups now available on iOS

https://twitter.com/signalapp/status/1993018186690871409
1•raybb•5m ago•0 comments

Bringing organ-scale cryopreservation into existence (Hunter Davis, #6) [video]

https://www.owlposting.com/p/bringing-organ-scale-cryopreservation
1•crescit_eundo•6m ago•0 comments

North Carolina Town First to Deploy Defibrillator Drones During 911 Emergencies

https://gizmodo.com/north-carolina-town-first-in-u-s-to-deploy-defibrillator-drones-during-actual...
1•speckx•7m ago•0 comments

AlmaLinux 10.1 Released – Complete with Btrfs Support

https://www.phoronix.com/news/AlmaLinux-10.1-Released
2•Bender•8m ago•0 comments

Mind-altering 'brain weapons' no longer only science fiction, say researchers

https://www.theguardian.com/world/2025/nov/22/mind-altering-brain-weapons-no-longer-only-science-...
3•thinkingemote•9m ago•0 comments

Synthetic boundaries for enhanced performance in densified thick electrodes

https://www.nature.com/articles/s41467-025-65257-2
1•PaulHoule•10m ago•0 comments

Scientists issue ominous warning over mind-altering 'brain weapons'

https://www.dailymail.co.uk/sciencetech/article-15320293/Scientists-warning-mind-altering-weapons...
2•Bender•10m ago•0 comments

System Card: Claude Opus 4.5 [pdf]

https://assets.anthropic.com/m/64823ba7485345a7/Claude-Opus-4-5-System-Card.pdf
2•alvis•10m ago•0 comments

Association of Screen Time with ADHD

https://www.nature.com/articles/s41398-025-03672-1
1•wjb3•10m ago•0 comments

WinBoat v0.9.0 Released

https://github.com/TibixDev/winboat/releases/tag/v0.9.0
1•PhilippGille•11m ago•0 comments

Rebble in Your Own World

https://rebble.io/2025/11/24/rebble-in-your-own-world.html
1•osculum•11m ago•0 comments

Streamline Structured and Unstructured Data Flows from Postgres with AI

https://cocoindex.io/blogs/postgres-source
1•badmonster•11m ago•0 comments

Olfactory White

https://en.wikipedia.org/wiki/Olfactory_white
2•sans_souse•11m ago•0 comments

Yes, the Universe Can Expand Faster Than Light

https://www.universetoday.com/articles/yes-the-universe-can-expand-faster-than-light
1•amichail•12m ago•0 comments

Former soldier pleads guilty to attempting to share military secrets with China

https://www.justice.gov/opa/pr/former-jblm-soldier-pleads-guilty-attempting-share-military-secret...
1•737min•14m ago•0 comments

Show HN: Open-source medical data(bloodwork, genetics, etc.) chat agent

https://github.com/ianrowan/OpenMed
1•imr_med•14m ago•0 comments

The Experience Machine

https://unpublishablepapers.substack.com/p/the-experience-machine
1•benrostike•15m ago•0 comments

Debugging When Everything Is on Fire (Using AI)

https://theahura.substack.com/p/debugging-when-everything-is-on-fire
1•theahura•16m ago•0 comments

A New Bridge Links the Math of Infinity to Computer Science

https://www.quantamagazine.org/a-new-bridge-links-the-strange-math-of-infinity-to-computer-scienc...
1•layer8•19m ago•0 comments

Building an XDP (EXpress Data Path) Based BGP Peering Router

https://toonk.io/index.html
1•ofrzeta•22m ago•0 comments

Electricity is about to become the new base currency and China figured it out

https://electrek.co/2025/11/21/electricity-is-about-to-become-the-new-base-currency-and-china-fig...
3•thelastgallon•22m ago•0 comments

Efficient boundary value problems solving in SciML [video]

https://www.youtube.com/watch?v=x_LyFbwHncA
1•adgjlsfhk1•23m ago•0 comments

Blue Owl Is the Sum of All Investor Fears

https://www.bloomberg.com/opinion/articles/2025-11-24/ai-private-credit-blue-owl-is-the-sum-of-al...
1•zerosizedweasle•24m ago•1 comments
Open in hackernews

Claude Opus 4.5

https://www.anthropic.com/news/claude-opus-4-5
137•adocomplete•32m ago

Comments

jumploops•27m ago
> Pricing is now $5/$25 per million tokens

So it’s 1/3 the price of Opus 4.1…

> [..] matches Sonnet 4.5’s best score on SWE-bench Verified, but uses 76% fewer output tokens

…and potentially uses a lot less tokens?

Excited to stress test this in Claude Code, looks like a great model on paper!

jmkni•17m ago
> Pricing is now $5/$25 per million tokens

For anyone else confused, it's input/output tokens

$5 for 1million tokens in $25 for 1million tokens out

alach11•12m ago
This is the biggest news of the announcement. Prior Opus models were strong, but the cost was a big limiter of usage. This price point still makes it a "premium" option, but isn't prohibitive.
elvin_d•25m ago
Great seeing the price reduction. Opus historically was prices at 15/75, this one delivers at 5/25 which is close to Gemini 3 Pro. I hope Anthropic can afford increasing limits for the new Opus.
rishabhaiover•24m ago
Is this available on claude-code?
greenavocado•22m ago
What are you thinking of trying to use it for? It is generally a huge waste of money to unleash Opus on high content tasks ime
rishabhaiover•21m ago
I use claude-code extensively to plan and study for my college using the socrates learning mode. It's a great way to learn for me. I wanted to test the new model's capabilities on that front.
flutas•19m ago
My workflow has always been opus for planning, sonnet for actual work.
elvin_d•22m ago
Yes, the first run was nice - feels faster than 4.1 and did what Sonnet 4.5 struggled to execute properly.
rishabhaiover•18m ago
damn, I need a MAX sub for this.
stavros•11m ago
You don't, you can add $5 or whatever to your Claude wallet with the Pro subscription and use those for Opus.
bnchrch•22m ago
Seeing these benchmarks makes me so happy.

Not because I love Anthropic (I do like them) but because it's staving off me having to change my Coding Agent.

This world is changing fast, and both keeping up with State of the Art and/or the feeling of FOMO is exhausting.

Ive been holding onto Claude Code for the last little while since Ive built up a robust set of habits, slash commands, and sub agents that help me squeeze as much out of the platform as possible.

But with the last few releases of Gemini and Codex I've been getting closer and closer to throwing it all out to start fresh in a new ecosystem.

Thankfully Anthropic has come out swinging today and my own SOP's can remain in tact a little while longer.

stavros•19m ago
Did anyone else notice Sonnet 4.5 being much dumber recently? I tried it today and it was really struggling with some very simple CSS on a 100-line self-contained HTML page. This never used to happen before, and now I'm wondering if this release has something to do with it.

On-topic, I love the fact that Opus is now three times cheaper. I hope it's available in Claude Code with the Pro subscription.

EDIT: Apparently it's not available in Claude Code with the Pro subscription, but you can add funds to your Claude wallet and use Opus with pay-as-you-go. This is going to be really nice to use Opus for planning and Sonnet for implementation with the Pro subscription.

However, I noticed that the previously-there option of "use Opus for planning and Sonnet for implementation" isn't there in Claude Code with this setup any more. Hopefully they'll implement it soon, as that would be the best of both worlds.

kjgkjhfkjf•13m ago
My guess is that Claude's "bad days" are due to the service becoming overloaded and failing over to use cheaper models.
bryanlarsen•12m ago
On Friday my Claude was particularly stupid. It's sometimes stupid, but I've never seen it been that consistently stupid. Just assumed it was a fluke, but maybe something was changing.
827a•18m ago
I've played around with Gemini 3 Pro in Cursor, and honestly: I find it to be significantly worse than Sonnet 4.5. I've also had some problems that only Claude Code has been able to really solve; Sonnet 4.5 in there consistently performs better than Sonnet 4.5 anywhere else.

I think Anthropic is making the right decisions with their models. Given that software engineering is probably one of the very few domains of AI usage that is driving real, serious revenue: I have far better feelings about Anthropic going into 2026 than any other foundation model. Excited to put Opus 4.5 through its paces.

visioninmyblood•16m ago
The model is great it is able to code up some interesting visual tasks(I guess they have pretty strong tool calling capapbilities). Like orchestrate prompt -> image generate -> Segmentation -> 3D reconstruction. Checkout the results here https://chat.vlm.run/c/3fcd6b33-266f-4796-9d10-cfc152e945b7. Note the model was only used to orchestrate the pipeline, the tasks are done by other models in an agentic framework. They much have improved tool calling framework with all the MCP usage. Gemini 3 was able to orchestrate the same but Claude 4.5 is much faster
Squarex•15m ago
I have heard that gemini 3 is not that great in cursor, but excellent in Antigravity. I don't have a time to personally verify all that though.
incoming1211•14m ago
I think gemini 3 is hot garbage in everything. Its great on a greenfield trying to 1 shot something, if you're working on a long term project it just sucks.
koakuma-chan•6m ago
Nothing is great in Cursor.
rishabhaiover•14m ago
I suspect Cursor is not the right platform to write code on. IMO, humans are lazy and would never code on Cursor. They default to code generation via prompt which is sub-optimal.
viraptor•10m ago
> They default to writing code via prompt generation which is sub-optimal.

What do you mean?

rishabhaiover•5m ago
If you're given a finite context window, what's the most efficient token to present for a programming task? sloppy prompts or actual code (using it with autocomplete)
behnamoh•11m ago
i’ve tried Gemini in Google AI studio as well and was very disappointed by the superficial responses it provided. It seems like at the level of GPT-5-low or even lower.

On the other hand, it’s a truly multi modal model whereas Claude remains to be specifically targeted at coding tasks, and therefore is only a text model.

poszlem•9m ago
I’ve trashed Gemini non-stop (seriously, check my history on this site), but 3 Pro is the one that finally made me switch from OpenAI. It’s still hot garbage at coding next to Claude, but for general stuff, it’s legit fantastic.
enraged_camel•6m ago
My testing of Gemini 3 Pro in Cursor yielded mixed results. Sometimes it's phenomenal. At other times I either get the "provider overloaded" message (after like 5 mins or whatever the timeout is), or the model's internal monologue starts spilling out to the chat window, which becomes really messy and unreadable. It'll do things like:

>> I'll execute.

>> I'll execute.

>> Wait, what if...?

>> I'll execute.

Suffice it to say I've switched back to Sonnet as my daily driver. Excited to give Opus a try.

vunderba•5m ago
My workflow was usually to use Gemini 2.5 Pro (now 3.0) for high-level architecture and design. Then I would take the finished "spec" and have Sonnet 4.5 perform the actual implementation.
GodelNumbering•17m ago
The fact that the post singled out SWE-bench at the top makes the opposite impression that they probably intended.
grantpitt•15m ago
do say more
GodelNumbering•6m ago
Makes it sound like a one trick pony
alvis•15m ago
What surprise me is that Opus 4.5 lost all reasoning scores to Gemini and GPT. I thought it’s the area the model will shine the most
viraptor•12m ago
Has there been any announcement of a new programming benchmark? SWE looks like it's close to saturation already. At this point for SWE it may be more interesting to start looking at which types of issues consistently fail/work between model families.
llamasushi•8m ago
The burying of the lede here is insane. $5/$25 per MTok is a 5x price drop from Opus 4. At that price point, Opus stops being "the model you use for important things" and becomes actually viable for production workloads.

Also notable: they're claiming SOTA prompt injection resistance. The industry has largely given up on solving this problem through training alone, so if the numbers in the system card hold up under adversarial testing, that's legitimately significant for anyone deploying agents with tool access.

The "most aligned model" framing is doing a lot of heavy lifting though. Would love to see third-party red team results.

keeeba•8m ago
Oh boy, if the benchmarks are this good and Opus feels like it usually does then this is insane.

I’ve always found Opus significantly better than the benchmarks suggested.

LFG

aliljet•8m ago
The real question I have after seeing the usage rug being pulled is what this costs and how usable this ACTUALLY is with a Claude Max 20x subscription. In practice, Opus is basically unusable by anyone paying enterprise-prices. And the modification of "usage" quotas has made the platform fundamentally unstable, and honestly, it left me personally feeling like I was cheated by Anthropic...