frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Open-source AI assistant for interview reasoning

https://github.com/evinjohnn/natively-cluely-ai-assistant
1•Nive11•36s ago•0 comments

Tech Edge: A Living Playbook for America's Technology Long Game

https://csis-website-prod.s3.amazonaws.com/s3fs-public/2026-01/260120_EST_Tech_Edge_0.pdf?Version...
1•hunglee2•4m ago•0 comments

Golden Cross vs. Death Cross: Crypto Trading Guide

https://chartscout.io/golden-cross-vs-death-cross-crypto-trading-guide
1•chartscout•6m ago•0 comments

Hoot: Scheme on WebAssembly

https://www.spritely.institute/hoot/
2•AlexeyBrin•9m ago•0 comments

What the longevity experts don't tell you

https://machielreyneke.com/blog/longevity-lessons/
1•machielrey•10m ago•1 comments

Monzo wrongly denied refunds to fraud and scam victims

https://www.theguardian.com/money/2026/feb/07/monzo-natwest-hsbc-refunds-fraud-scam-fos-ombudsman
2•tablets•15m ago•0 comments

They were drawn to Korea with dreams of K-pop stardom – but then let down

https://www.bbc.com/news/articles/cvgnq9rwyqno
2•breve•17m ago•0 comments

Show HN: AI-Powered Merchant Intelligence

https://nodee.co
1•jjkirsch•20m ago•0 comments

Bash parallel tasks and error handling

https://github.com/themattrix/bash-concurrent
2•pastage•20m ago•0 comments

Let's compile Quake like it's 1997

https://fabiensanglard.net/compile_like_1997/index.html
2•billiob•21m ago•0 comments

Reverse Engineering Medium.com's Editor: How Copy, Paste, and Images Work

https://app.writtte.com/read/gP0H6W5
2•birdculture•26m ago•0 comments

Go 1.22, SQLite, and Next.js: The "Boring" Back End

https://mohammedeabdelaziz.github.io/articles/go-next-pt-2
1•mohammede•32m ago•0 comments

Laibach the Whistleblowers [video]

https://www.youtube.com/watch?v=c6Mx2mxpaCY
1•KnuthIsGod•33m ago•1 comments

Slop News - HN front page right now as AI slop

https://slop-news.pages.dev/slop-news
1•keepamovin•38m ago•1 comments

Economists vs. Technologists on AI

https://ideasindevelopment.substack.com/p/economists-vs-technologists-on-ai
1•econlmics•40m ago•0 comments

Life at the Edge

https://asadk.com/p/edge
3•tosh•46m ago•0 comments

RISC-V Vector Primer

https://github.com/simplex-micro/riscv-vector-primer/blob/main/index.md
4•oxxoxoxooo•49m ago•1 comments

Show HN: Invoxo – Invoicing with automatic EU VAT for cross-border services

2•InvoxoEU•50m ago•0 comments

A Tale of Two Standards, POSIX and Win32 (2005)

https://www.samba.org/samba/news/articles/low_point/tale_two_stds_os2.html
3•goranmoomin•54m ago•0 comments

Ask HN: Is the Downfall of SaaS Started?

3•throwaw12•55m ago•0 comments

Flirt: The Native Backend

https://blog.buenzli.dev/flirt-native-backend/
2•senekor•56m ago•0 comments

OpenAI's Latest Platform Targets Enterprise Customers

https://aibusiness.com/agentic-ai/openai-s-latest-platform-targets-enterprise-customers
1•myk-e•59m ago•0 comments

Goldman Sachs taps Anthropic's Claude to automate accounting, compliance roles

https://www.cnbc.com/2026/02/06/anthropic-goldman-sachs-ai-model-accounting.html
4•myk-e•1h ago•5 comments

Ai.com bought by Crypto.com founder for $70M in biggest-ever website name deal

https://www.ft.com/content/83488628-8dfd-4060-a7b0-71b1bb012785
1•1vuio0pswjnm7•1h ago•1 comments

Big Tech's AI Push Is Costing More Than the Moon Landing

https://www.wsj.com/tech/ai/ai-spending-tech-companies-compared-02b90046
5•1vuio0pswjnm7•1h ago•0 comments

The AI boom is causing shortages everywhere else

https://www.washingtonpost.com/technology/2026/02/07/ai-spending-economy-shortages/
4•1vuio0pswjnm7•1h ago•0 comments

Suno, AI Music, and the Bad Future [video]

https://www.youtube.com/watch?v=U8dcFhF0Dlk
1•askl•1h ago•2 comments

Ask HN: How are researchers using AlphaFold in 2026?

1•jocho12•1h ago•0 comments

Running the "Reflections on Trusting Trust" Compiler

https://spawn-queue.acm.org/doi/10.1145/3786614
1•devooops•1h ago•0 comments

Watermark API – $0.01/image, 10x cheaper than Cloudinary

https://api-production-caa8.up.railway.app/docs
2•lembergs•1h ago•2 comments
Open in hackernews

GPT-5.1 for Developers

https://openai.com/index/gpt-5-1-for-developers/
112•tedsanders•2mo ago

Comments

felixbraun•2mo ago
Already live in Cursor btw
kevinkatzke•2mo ago
This got only a single comment and 34 points in 3 hours. Crazy how the dynamics have changed around model releases in just a single year.
throwup238•2mo ago
There was already an announcement post for 5.1 yesterday: https://news.ycombinator.com/item?id=45904551
dang•2mo ago
Thanks! Macroexpanded:

GPT-5.1: A smarter, more conversational ChatGPT - https://news.ycombinator.com/item?id=45904551 - Nov 2025 (672 comments)

amelius•2mo ago
More of the same, I suppose.

You have to be called Apple to get raving reviews for that.

observationist•2mo ago
This is the first low-key, silent feature rollout, treated like "just another software update", with no hype or buzz beforehand. Prior to this point, every other feature release was pumped for weeks or even months with "leaks" from insiders and deliberately getting people amped. I don't know if OpenAI changed marketing tactics, or if they're in a new chapter in some book, but this is a radical shift from what they were doing before.
voc•2mo ago
I feel like the rollout was a bit rushed. Benchmarks for 5.1 came out a day after the launch. New models weren't immediately available through the API. And then there's 5-Codex-Mini which was deprecated only six days later by 5.1-Codex-Mini. Wondering if Gemini 3 forced their hand here?
anuramat•2mo ago
sounds like this is just a new snapshot, so I don't think anything changed (upd: anything about their marketing I mean)
__jl__•2mo ago
The prompt caching change is awesome for any agent. Claude is far behind with increased costs for caching and manual caching checkpoints. Certainly depends on your application but prompt caching is also ignored in a lot of cost comparisons.
pants2•2mo ago
Though to be fair, thinking tokens are also ignored in a lot of cost comparisons and in my experience Claude generally uses fewer thinking tokens for the same intelligence
miohtama•2mo ago
> On coding, we’ve worked closely with startups like Cursor, Cognition, Augment Code, Factory, and Warp to improve GPT‑5.1’s coding personality, steerability, and code quality.

Why no GitHub?

conception•2mo ago
Microsoft isn’t a startup and I suspect open AI is working closely with Microsoft already.
mmusc•2mo ago
Model is available on copilot.
dweekly•2mo ago
A few hours of playing around and I'm suitably impressed.

Claude 4.5 Sonnet definitely struggles with Swift 6.2 Concurrency semantics and has several times gotten itself stuck rather badly. Additionally Claude Code has developed a number of bugs, including rapidly re-scrolling the terminal buffer, pegging local CPU to 100%, and consuming vast amounts of RAM. Codex CLI was woefully behind a few months ago and, despite overly conservative out-of-the-box sandbox settings, has quite caught up to Claude Code. (Gemini CLI is an altogether embarrassing experience, but Google did just put a solid PM behind it and 3.0 Pro should be out this month if we're lucky.)

Codex with 5.1 high managed to thoughtfully paw through the documentation and source code and - with a little help pulling down parts of the Swift Book - managed to correctly resolve the issue.

I remember getting the thread manager right being one of the harder parts of my operating systems course doing an undergrad in computer science; testing threaded programs has always been a challenge. It's a strange circle-of-life moment to realize that what was hard for undergrads also serves as a benchmark for coding agents!

CharlesW•2mo ago
> Claude 4.5 Sonnet definitely struggles with Swift 6.2 Concurrency semantics and has several times gotten itself stuck rather badly.

What solved that for me was to leverage the for-LLM docs Apple ships with Xcode, and then build a swift6-concurrency skill. Here's an example script to copy the Xcode docs into your repo: https://gist.github.com/CharlesWiltgen/75583f53114d1f2f5bae3...

dweekly•2mo ago
Lovely find!

/Applications/Xcode.app/Contents/PlugIns/IDEIntelligenceChat.framework/Versions/A/Resources/AdditionalDocumentation/Swift-Concurrency-Updates.md

is exactly the primer to give an agent.

WhyOhWhyQ•2mo ago
"including rapidly re-scrolling the terminal buffer" Yes this bug is brutal.

"consuming vast amounts of RAM" Also this. Claude will leave hanging instances all the time. If you check your task manager after a few days of using it without doing a full reset you'll see a number of hanging Claude processes using up 400 mb of RAM.

Claude actually has a huge number of very painful bugs. I'm aware of at least a dozen.

gigatree•2mo ago
The iOS app has also gotten pretty buggy. Not a great sign for the future of software, in terms of stability.
htrp•2mo ago
>but Google did just put a solid PM behind it

Citation?

gedy•2mo ago
The "apply_patch" addition is nice, as have been struggling to get any AI API to correctly return diffs
anuramat•2mo ago
what's the point of apply_patch and shell tools though? can't you just define your custom tools with exactly the same behaviour, since you're implementing the actual execution on your side anyway? sounds like vendor lock in for the sake of vendor lock in
gedy•2mo ago
In my case, I don't want to do diff tool on my side as the diff is much smaller to send. Versus the LLM sending the whole file (slowly), just to send it back.
anuramat•2mo ago
I thought you still need to implement the patching on your side? judging by <https://platform.openai.com/docs/guides/tools-apply-patch>
gedy•2mo ago
You do, but the issue this helps with is it's difficult to get LLMs to return accurate unified diffs, which are valuable if you are editing some larger text via their APIs. The alternative of letting it send back the entire edited text is pretty slow. So them sending an accurate server-side diff (likely from some actual diff tool and not just LLM generated thing that sort of looks like a diff) is really helpful.
sunaookami•2mo ago
Man these names are so confusing and now reasoning_effort "minimal" was renamed to "none"? And the error message says only "medium" is supported?? Also the docs make no mention if gpt-5.1-chat-latest is included in the "free" offer (when having prompt sharing turned on). The popup says gpt-5.1 is included but not gpt-5.1-chat even though gpt-5-chat-latest is included. Why is it even called "chat" when it's official name is "Instant"? And what even IS the difference between gpt-5.1 and gpt-5.1-chat if both support reasoning_effort??
selbyk•2mo ago
It's all vibe coded
tedsanders•2mo ago
- reasoning_effort "minimal" was not renamed to "none"; "none" is a new, faster level supported by GPT-5.1 but not GPT-5

- there's no good reason it's called "chat" instead of "Instant"

- gpt-5.1 and gpt-5.1-chat are different models, even though they both reason now. gpt-5.1 is more factual and can think for much longer. most people want gpt-5.1, unless the use case is ChatGPT-like or they prefer its personality.

jtrn•2mo ago
So is this better, different or replacing current codex ?
Tankenstein•2mo ago
This is the first time since GPT 4.1 that I think I can upgrade our main agent model. Any noticeable amount of reasoning has been too slow for us, since the model is having a real-time conversation with the user. "minimal" reasoning GPT-5 performs terribly, it's significantly dumber than GPT 4.1 in a long, multi-turn conversation with tools.

This time, I just dropped it in and at first glance it seems to work well. I'll probably upgrade over the weekend if I see a boost in performance somewhere after tuning the prompts.