frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

How to Make a Good Terminal Bench Task

https://twitter.com/neversupervised/status/2035455298417430911
2•neversupervised•1h ago

Comments

neversupervised•1h ago
I've been a contributor and reviewer for terminal bench since last August, and this post is about what I've learned designing and reviewing tasks. The guidance is broadly applicable to anyone building an agentic benchmark.I would love feedback from the HN community.

Viral DOGE Deposition Videos Can Remain Online, Judge Rules

https://www.bloomberg.com/news/articles/2026-03-23/viral-doge-deposition-videos-can-remain-online...
1•toomuchtodo•14s ago•0 comments

OpenAI CEO Sam Altman Exits Helion Energy's Board

https://www.reuters.com/sustainability/boards-policy-regulation/openai-ceo-sam-altman-exits-helio...
1•guidoiaquinti•26s ago•0 comments

Cloudflare Details Upgrade to EPYC Turin for 2x Throughput, 50% Better Perf/Watt

https://www.phoronix.com/news/Cloudflare-Gen13-Server-Turin
1•speckx•42s ago•0 comments

Crib: Just Enough Devcontainers

https://fabiorehm.com/blog/2026/03/20/crib-just-enough-devcontainers/
1•TheTaytay•1m ago•0 comments

Housing Advocates Don't Always Get Along

https://www.insidephilanthropy.com/home/housing-advocates-dont-always-get-along-funders-should-pu...
1•viajante1882•2m ago•0 comments

The Mac screenshot tool for builders

https://www.lazyscreenshots.com/
2•abouelatta•6m ago•0 comments

SpaceX hits back at Amazon in orbital datacenter dispute

https://www.theregister.com/2026/03/23/spacex_amazon_orbital_datacenters/
2•flyaway123•8m ago•0 comments

Programatically exploring Linux /proc filesystem

https://noke3.substack.com/p/programatically-exploring-linux-proc
1•sinlesschip•9m ago•0 comments

Lc command – combines ls, cat, and nano – useful when you don't have home/end

1•codingblink•9m ago•1 comments

Show HN: I Made an Open Source Swarm IDE

https://nbardy.github.io/unleashd/
1•nbardy•13m ago•0 comments

Markd. – Open annotation for research papers

https://markd-tawny.vercel.app/
1•ahusha•14m ago•1 comments

Show HN: AI That Controls Cloudflare WAF, Stripe, and Supabase in Plain English

https://flarite.com/
1•flarite•14m ago•1 comments

LUMINA: LLM-Guided GPU Architecture Exploration via Bottleneck Analysis

https://arxiv.org/abs/2603.05904
1•matt_d•16m ago•0 comments

What I'm Learning from Aviation About Incident Preparedness

https://uptimelabs.io/articles/what-im-learning-from-aviation-about-incident-preparedness/
1•sylvainkalache•16m ago•0 comments

Language as the Architecture of General Intelligence in Humans and LLMs

https://philarchive.org/rec/HUDTOS
2•fraggler•20m ago•0 comments

We analyzed 134,000 legal AI interactions. Lawyers still win

https://haqq.ai/whitepaper/legal-ai-index
3•ai_lawyer•23m ago•1 comments

'The Karpathy Loop': 700 experiments, 2 days

https://fortune.com/2026/03/17/andrej-karpathy-loop-autonomous-ai-agents-future/
1•msolujic•24m ago•1 comments

Show HN: Pglens – Postgres MCP server that lets agents look before they query

https://github.com/janbjorge/pglens
1•jeeybee•25m ago•0 comments

Most complex cloud service dependency chain you've seen?

1•rfmoz•27m ago•0 comments

Show HN: LLMs battle it out trading futures

https://arena.dbj.is/
3•retrofuturism•30m ago•0 comments

LLM Proxy for Agent Containers

https://github.com/calebfaruki/tightbeam
2•kalib_tweli•30m ago•1 comments

A pharmacist lifestyle blogger: The 'alarming' civilian cost of war in Iran

https://www.bbc.com/news/articles/c3v6ld7lv9no
3•tartoran•31m ago•0 comments

Vibecoders Can't Build for Longevity

https://blog.d11r.eu/theory-building/
2•dominicq•34m ago•4 comments

Metasystemic

https://metasystemic.xyz
1•gdss•35m ago•0 comments

KR Pres excludes officials with multiple homes from real estate policymaking

https://www.koreatimes.co.kr/economy/policy/20260322/lee-excludes-officials-with-multiple-homes-f...
3•Teever•36m ago•0 comments

Firefox Adds Tab Notes

https://blog.mozilla.org/en/firefox/tab-notes/
6•pentagrama•37m ago•1 comments

Unity of Paradigms

https://alexalejandre.com/programming/unity-of-paradigms/
2•tosh•37m ago•0 comments

Show HN: Simple Routing – Low Cost Vehicle Routing APIs

https://www.simplerouting.io
1•tristanmk•38m ago•0 comments

Seeing Everything, Understanding Nothing (The Context Trap)

https://cutlefish.substack.com/p/tbm-406-seeing-everything-understanding
1•donutshop•42m ago•0 comments

Qt 6.11 Released

https://www.qt.io/blog/qt-6.11-released
1•jandeboevrie•45m ago•0 comments