frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

We estimate that Claude Opus 4.6 has a 50%-time-horizon of around 14.5 hours

https://twitter.com/METR_Evals/status/2024923422867030027
4•doener•1h ago

Comments

ben_w•1h ago
Also asked on Telegram, but Hacker News may have additional input:

I've just begun, since this morning, to wonder what I realise is a basic question I've never seen: what's the longest/largest task a human can do with n% accuracy?

For big tasks, we break them down, so we often *don't* do one huge single task. No one person actually makes an entire biro, or even an entire pencil; a human can write something like DOOM, but not usually by themselves, especially bug-free as even Carmak got help testing from the rest of id.

Is it perhaps possible to work this out from the same data used in the METR model itself? Were there tasks which several humans attempted, but only half of those humans succeeded at?

David Beazley: An End to Week-Long Immersion Courses

https://buttondown.com/dabeaz/archive/an-end-to-week-long-immersion-courses/
1•kurinikku•1m ago•0 comments

Hemmi/Post 1460 Versalog (Sliderule emulator)

https://thingsabove.github.io/Sliderule-Simulator-with-Solver/react/hemmi_versalog.html
1•todsacerdoti•2m ago•0 comments

Pentagon and Energy Department airlifts nuclear reactor from California to Utah

https://www.pbs.org/newshour/nation/u-s-military-airlifts-small-reactor-for-the-first-time-as-tru...
1•ck2•2m ago•0 comments

NASA astronauts' moon mission delayed due to rocket issue

https://www.bbc.com/news/articles/c626v265zqlo
2•tartoran•4m ago•0 comments

The Problem with AI Agents Isn't Identity, It's Authorization

https://fusionauth.io/blog/ai-authorization
1•mooreds•5m ago•0 comments

Amazon blames human employees for an AI coding agent's mistake

https://www.theverge.com/ai-artificial-intelligence/882005/amazon-blames-human-employees-for-an-a...
2•mooreds•6m ago•0 comments

The 4th Factor

https://lifanzeng.com/the-4th-factor
1•LeafyLi•6m ago•0 comments

Forget Greenland: This Arctic NATO Island Has a Russian Presence

https://www.wsj.com/world/forget-greenland-this-arctic-nato-island-already-has-a-russian-presence...
1•malshe•7m ago•1 comments

OpenAI is Suddenly in Trouble? [video]

https://www.youtube.com/watch?v=-q2n5DkDoMQ
2•CHB0403085482•9m ago•0 comments

Applejak: Interpreter for a subset of K programming language for Super CHIP-8

https://internet-janitor.itch.io/applejak
1•tosh•12m ago•0 comments

Ask HN: What Matters Most in Tech? Awards, Media Praise, or Peer Respect

2•SoundsDebatable•14m ago•3 comments

Show HN: Dq – pipe-based CLI for querying CSV, JSON, Avro, and Parquet files

https://github.com/razeghi71/dq
1•razeghi71•14m ago•0 comments

ChatGPT's hidden bias about your state or city

https://www.washingtonpost.com/technology/interactive/2026/see-chatgpts-hidden-bias-about-your-st...
1•Sherl•14m ago•1 comments

Show HN: GitHub Tray for GNOME gets a big update: notifications, Actions, issues

1•debba•18m ago•0 comments

Metabolism (Architecture)

https://en.wikipedia.org/wiki/Metabolism_(architecture)
1•azhenley•19m ago•0 comments

Ask HN: What's one thing that interested you this week?

1•subdomain•24m ago•0 comments

BreakPoint: Local-first CI gate for LLM output changes (cost, PII, drift)

https://github.com/cholmess/breakpoint-ai
1•cholmess21•27m ago•1 comments

Amazon dethrones Walmart as the world's biggest company by sales

https://www.npr.org/2026/02/19/nx-s1-5719173/amazon-walmart-biggest-company-by-sales
1•geox•28m ago•0 comments

Tube passengers targeted in 'smishing' scam, court told

https://www.bbc.co.uk/news/articles/cg4gkzw971go
1•edward•32m ago•0 comments

Your Android phone has a desktop mode you're probably not using

https://www.makeuseof.com/android-phone-has-desktop-mode-youre-probably-not-using/
3•teleforce•38m ago•0 comments

Democratizing cryptographic silicon verification with Infra-Red imaging (2024)

https://www.bunniestudios.com/blog/2024/iris-infra-red-in-situ-project-updates/
2•transpute•38m ago•0 comments

Symplex Protocol – semantic intent vectors for AI agent communication (Go, v0.1)

https://github.com/olserra/symplex
1•olserra•39m ago•1 comments

The Rise of Invisible Unemployment in Tech: 2026 Will Be the Year It Changes

https://www.saastr.com/the-rise-of-invisible-unemployment-in-tech-2026-will-be-the-year-when-ever...
1•bentobean•41m ago•0 comments

Ask HN: How do you monitor and retry failed webhooks in production?

2•GoatPerfect•41m ago•2 comments

Write Perfect Emails in Seconds

4•vinayofc•42m ago•0 comments

The New Mexico cave expanding our search for alien life

https://www.bbc.com/future/article/20260130-how-deep-caves-are-transforming-our-search-for-extrat...
2•marc__1•42m ago•0 comments

Functionalized Coatings as Biohybrid UV-Sensors

https://advanced.onlinelibrary.wiley.com/doi/10.1002/admi.202500125
1•PaulHoule•42m ago•0 comments

Show HN: PrivateOS: An AI agent that runs on your phone

https://private-os.vercel.app
3•pruthvi77•43m ago•1 comments

CipherDrop

https://www.cipherdrop.app/
1•shablulman•44m ago•0 comments

Announcing Oracle Solaris 11.4 SRU90

https://blogs.oracle.com/solaris/announcing-oracle-solaris-11-4-sru90
3•pjmlp•45m ago•0 comments