frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: SurvivalIndex – which developer tools do AI agents choose?

https://survivalindex.org/
1•scalefirst•2h ago
We've been running coding agents against standardized repos with natural-language prompts — no tool names, no hints — and measuring what they actually choose.

Early finding: Claude Code picks Custom/DIY in 12 of 20 categories. Not because it can't use the tools (BFCL scores suggest it can) but because it doesn't reach for them. That's a different failure mode than capability benchmarks measure.

We score each tool on: agent visibility, pick rate vs Custom/DIY, cross-context breadth, expert human ratings, and implementation success rate. Tools above survival=1 persist. Below it, agents synthesize around them.

Methodology is at survivalindex.org/methodology. Very curious what people think of the measurement approach, especially the human coefficient variable.

Comments

scalefirst•2h ago
One thing I'd love input on: we use expert human ratings as a variable (H) to capture whether agent choices align with what experienced engineers would actually ship. Curious if people think this is the right signal or whether it introduces too much subjectivity.

AI Error May Have Contributed to Girl's School Bombing in Iran

https://thisweekinworcester.com/exclusive-ai-error-girls-school-bombing/
1•apolloartemis•30s ago•0 comments

How many options fit into a boolean?

https://herecomesthemoon.net/2025/11/how-many-options-fit-into-a-boolean/
1•luu•10m ago•0 comments

SK lays off nearly 1k workers at Georgia plant amid cooling automaker EV plan

https://apnews.com/article/georgia-electric-vehicle-battery-manufacturing-layoffs-workers-79a4ec7...
1•josephh•10m ago•0 comments

Ships in Gulf declare themselves Chinese to dodge attack

https://www.ft.com/content/eaecc3ce-ba4e-4e19-9d0a-6cf1536a5aed
4•KnuthIsGod•14m ago•0 comments

Doomscroll 14,333 cat pictures

https://cat.aadishv.dev/
2•aadishv•14m ago•2 comments

Unemployment Reasons, by Age and Education

https://flowingdata.com/2026/03/05/unemployment-reasons/
3•gslin•17m ago•0 comments

Using Rust and Postgres for everything: patterns learned over the years

https://kerkour.com/rust-postgres-everything
3•olalonde•19m ago•0 comments

Show HN: Quantum-PULSE – compress-then-encrypt vault for LLM training data

https://github.com/Naveenub/quantum-pulse
1•naveenub•20m ago•0 comments

You can get better code by exploiting model weights

https://kelvinfichter.com/pages/thoughts/train-harder/
1•kfichter•25m ago•0 comments

Show HN: BurnRate – Track what you spend on AI coding tools

https://getburnrate.io/
1•mwong17•28m ago•0 comments

Worming out molecular secrets behind collective behaviour

https://iisc.ac.in/events/worming-out-molecular-secrets-behind-collective-behaviour/
2•rainhacker•28m ago•0 comments

Show HN: Resend-CLI, unofficial Resend CLI built for AI agents and humans

https://github.com/shubhdeep12/resend-cli
1•Shubhdeep_12•28m ago•0 comments

Show HN: Rai – Add AI steps in your shell, scripts or CI/CD pipelines

https://appmakes.github.io/Rai/
1•perkit•29m ago•0 comments

Full-Text RSS site config files

https://github.com/fivefilters/ftr-site-config
1•microflash•29m ago•0 comments

Astronomers Spot a Cosmic Laser Halfway Across the Universe

https://www.universetoday.com/articles/astronomers-using-meerkat-spot-a-cosmic-laser-halfway-acro...
2•johnbarron•30m ago•0 comments

Trump has privately shown serious interest in U.S. ground troops in Iran

https://www.nbcnews.com/politics/white-house/trump-privately-shown-serious-interest-us-ground-tro...
7•johnbarron•31m ago•2 comments

Semi-formal reasoning helps agents reason about code without executing the code

https://arxiv.org/abs/2603.01896
1•dnw•33m ago•0 comments

Show HN: EdgeDox – Offline document AI on Android using Qwen3.5-0.8B

https://play.google.com/store/apps/details?id=io.cyberfly.edgedox&hl=en_US
1•cyberfly-labs•34m ago•0 comments

EA Javelin Anticheat job listing mentions future support for Linux and Proton

https://www.gamingonlinux.com/2026/03/ea-javelin-anticheat-job-listing-mentions-future-support-fo...
2•exploraz•42m ago•1 comments

Should AI web agents skip sponsored/ad results by default?

2•ilyasJosef•50m ago•3 comments

TCXO Failure Analysis

https://serd.es/2026/03/06/TCXO-failure-analysis.html
1•zdw•51m ago•0 comments

Google Workspace CLI Removes MCP Support

https://github.com/googleworkspace/cli/pull/275
1•waf•59m ago•0 comments

Armed robots take to the battlefield in Ukraine war

https://www.bbc.com/news/articles/c62662gzlp8o
3•dabinat•59m ago•0 comments

Show HN: CC Pocket – Control Claude Code/Codex from Your Phone

https://github.com/K9i-0/ccpocket
1•k9i•1h ago•0 comments

Readeck 0.22 Released

https://readeck.org/en/blog/202602-readeck-22/
1•Curiositry•1h ago•0 comments

Evolving Languages Faster with Type Tailoring (2024)

https://lambdaland.org/posts/2024-07-15_type_tailoring/
1•PaulHoule•1h ago•0 comments

Grammarly is using our identities without permission

https://www.theverge.com/ai-artificial-intelligence/890921/grammarly-ai-expert-reviews
7•LordAtlas•1h ago•1 comments

I compiled a list of why you should be excited about std:SIMD and C++26

https://old.reddit.com/r/cpp/comments/1rjld1s/i_compiled_a_list_of_6_reasons_why_you_should_be/
1•ibobev•1h ago•0 comments

Mars MIPS Simulator in the Browser

https://mars.nfiles.top/
2•nenepbl•1h ago•1 comments

Ask HN: Is SWE mostly just calling APIs?

1•humbleharbinger•1h ago•6 comments