frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

AI Fails at 96% of (General Work) Jobs (New Study)

https://www.youtube.com/watch?v=z3kaLM8Oj4o
3•swolpers•1h ago

Comments

adyashakti•1h ago
translation: "96% of people trying to replace workers with AI don't know how to prompt it effectively or supervise its output."
devnonymous•1h ago
So what you're saying is the interface fails the common case?
BoredPositron•31m ago
The 4% is using it to write posts about ai on linkedin.
ben_w•28m ago
Actual paper: https://www.remotelabor.ai/paper.pdf

Sounds about right.

With those test parameters for how long it would take a human to complete the same work, it fits a similar pattern to METR; i.e. at "humans would take 11.5 hours" (Figure 4, median) you're pushing your luck for any success with all but the most recent models*, and METR is testing software where AI has the possibility of fully automating a lot of its own tests.

Even more recent models than they tested, like Opus 4.5, are only 50% successful for tasks that take humans 5h20m: https://metr.org/time-horizons/

Assuming the bubble doesn't pop/WW3 doesn't start first (IDK, 25% and 5% respectively?), and if trends continue (???), I expect a similar paper this time next year to show something like 50% success at automation of similar tasks.

* which they didn't test, I don't blame them for that because this field moves too fast

Vitalik Buterin says prediction markets need to stop catering to 'dumb opinions'

https://www.theblock.co/post/389984/polymarket-investor-vitalik-buterin-says-prediction-markets-n...
1•thm•27s ago•0 comments

Show HN: I've build a self hosted convex/Firebase/Supabase alternative

https://linkedrecords.com/
1•WolfOliver•2m ago•0 comments

The seven programming ur-languages

https://madhadron.com/programming/seven_ur_languages.html
1•tosh•2m ago•0 comments

Show HN: VPS-Harden, an Idempotent Bash Script to Harden Ubuntu VPS – OpenClaw

https://github.com/ranjith-src/vps-harden
1•ranbo•5m ago•1 comments

Anthropic tries to hide Claude's AI actions. Devs hate it

https://www.theregister.com/2026/02/16/anthropic_claude_ai_edits/
2•beardyw•8m ago•0 comments

Memoh - Multi-Member, Structured Long-Memory, Containerized AI Agent System

https://github.com/memohai/Memoh
2•acbox_liu•8m ago•0 comments

Show HN: Rivestack – Managed PostgreSQL with pgvector, $29/mo

https://www.rivestack.io/
1•stranger90•10m ago•0 comments

India plans AI 'data city' on staggering scale

https://techxplore.com/news/2026-02-india-ai-city-staggering-scale.html
1•geox•11m ago•0 comments

Lawsuit: AI used women's faces without consent for sexual content

https://www.12news.com/article/news/local/arizona/arizona-women-sue-men-sexually-explicit-ai-gene...
3•nizbit•11m ago•0 comments

Show HN: Interactive SQL Tutorial with a visual query builder

1•mootoday•11m ago•0 comments

Show HN: ClawSouls – Open registry of shareable personas for AI agents

https://clawsouls.ai/en
1•tomleelive•14m ago•1 comments

macOS Accessibility Inspector

https://developer.apple.com/documentation/accessibility/accessibility-inspector
1•keepamovin•14m ago•1 comments

EU Software Patents v3.0 via the Unified Patent Court [pdf]

https://fosdem.org/2026/events/attachments/G3ZWYU-lightning_lightning_talks_2/slides/267684/eu_so...
1•zoobab•19m ago•0 comments

Show HN: cc-hdrm v1.3 – macOS menu bar app that tracks your Claude subscription

2•rajish•19m ago•0 comments

Show HN: Chisel for Claude. Vibe code 2X faster using your voice

https://jorgtron.github.io/chisel-for-claude/
1•jorgtron•21m ago•0 comments

Qwen 3.5

https://huggingface.co/collections/Qwen/qwen35
5•xnhbx•26m ago•0 comments

The current state of LLM-based multi-agent systems for software engineering

https://chuniversiteit.nl/papers/llm-based-multi-agent-systems-for-software-engineering
1•ibobev•26m ago•0 comments

Sharing in Dada

https://smallcultfollowing.com/babysteps/blog/2026/02/14/sharing-in-dada/
1•ibobev•27m ago•0 comments

Sprite Graphics Traditions

https://bumbershootsoft.wordpress.com/2026/02/14/sprite-graphics-traditions/
1•ibobev•27m ago•0 comments

TaskForge – OpenClaw in contained permission based platform

https://github.com/romanklis/openclaw-contained
1•roman_klis•30m ago•1 comments

Lloyds to investigate its use of staff banking data in pay talks

https://www.thetimes.com/business/companies-markets/article/lloyds-investigates-using-staff-bank-...
1•petethomas•31m ago•0 comments

EU Parliament backs digital euro, aligns with Council

https://www.reuters.com/business/finance/eu-parliament-backs-digital-euro-aligns-with-council-onl...
1•janandonly•32m ago•1 comments

Show HN: Upvotics – Track Reddit conversations where people need your product

https://upvotics.com/
1•Yaramsa-Gautham•32m ago•0 comments

Sam Altman: Codex weekly users tripled since the beginning of the year

https://twitter.com/sama/status/2023233085509410833
1•alecco•34m ago•1 comments

GPU Virtualization Architecture for Multi-Desktop Containers

https://blog.helix.ml/p/gpu-virtualization-architecture-for
1•lewq•35m ago•0 comments

Moonshot AI's Founder: His Pursuit of AGI and the Company's –. Business Model

https://aiproem.substack.com/p/moonshot-ais-founder-his-pursuit
1•totetsu•39m ago•0 comments

Kitesurfing

https://en.wikipedia.org/wiki/Kiteboarding
1•kaycebasques•41m ago•0 comments

Show HN: Vocalinux // 100% offline voice typing for Linux

https://vocalinux.com/
2•jatinkrmalik•42m ago•0 comments

Latin Competition – Google Translate vs. the BBC

https://medium.com/luminasticity/latin-competition-google-translate-vs-the-bbc-b1cbcc2d9266
1•bryanrasmussen•46m ago•0 comments

Amazon Wins $6M in Damages Against Pirated DVD Stores, Plus Domain Takeovers

https://torrentfreak.com/amazon-wins-6-million-in-damages-against-pirated-dvd-stores-plus-domain-...
2•gslin•47m ago•0 comments