frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

No, it doesn't cost Anthropic $5k per Claude Code user

https://martinalderson.com/posts/no-it-doesnt-cost-anthropic-5k-per-claude-code-user/
1•jnord•1m ago•0 comments

Love in the Time of A.I. Companions

https://www.newyorker.com/magazine/2026/03/16/love-in-the-time-of-ai-companions
1•petethomas•3m ago•0 comments

Helios: Real Real-Time Long Video Generation Model

https://www.alphaxiv.org/abs/2603.04379
2•tzury•4m ago•0 comments

PRX Part 3 – Training a Text-to-Image Model in 24h

https://huggingface.co/blog/Photoroom/prx-part3
1•gsky•5m ago•0 comments

Open-source software could be excluded from Colorado age verification bill

https://twitter.com/carlrichell/status/2031125624711164182
1•flaburgan•11m ago•0 comments

Show HN: Hacker News Focus Comments Reader

https://chromewebstore.google.com/detail/hn-focus-reader/ibhipggecnholemnbahigagpgifkphac
1•betimd•13m ago•0 comments

The emerging role of SRAM-centric chips in AI inference

https://gimletlabs.ai/blog/sram-centric-chips
1•gmays•13m ago•0 comments

Simradar21

https://simradar21.com/
1•sssilver•14m ago•0 comments

Amid wave of kids' online safety laws, age-checking tech comes of age

https://www.reuters.com/legal/litigation/amid-wave-kids-online-safety-laws-age-checking-tech-come...
1•petethomas•14m ago•0 comments

M5 Max: Chiplets, Thermals, and Performance per Watt

https://creativestrategies.com/research/m5-max-chiplets-thermals-and-performance-per-watt/
3•zdw•14m ago•0 comments

Agentis – An AI-native programming language where the LLM is the stdlib

https://github.com/Replikanti/agentis
1•ylohnitram•16m ago•1 comments

iOS 26.4's new setting lets you disable another Liquid Glass effect

https://9to5mac.com/2026/03/09/ios-26-4s-new-setting-lets-you-disable-another-liquid-glass-effect/
2•latexr•16m ago•1 comments

Show HN: Free AI resume tailor I built after a recent layoff (300+ users so far)

https://jobbi.app/
1•djrnz•16m ago•0 comments

Closing the verification loop, Part 2: autonomous optimization

https://www.datadoghq.com/blog/ai/fully-autonomous-optimization/
1•chrisra•18m ago•1 comments

From Tool to Employee: What Claude Code's /Loop Means

https://aieatingsoftware.substack.com/p/from-tool-to-employee-what-claude
1•sidsarasvati•19m ago•0 comments

Reversing Russian spyware I installed on my iPhone [video]

https://www.youtube.com/watch?v=XQvZ2mLnZVI
1•todsacerdoti•20m ago•0 comments

Agentic development environment extension taxonomy

https://droctothorpe.github.io/adeet/
1•droctothorpe•20m ago•1 comments

Worldwide Sidewalk Joy: Adding whimsy to neighborhoods

https://worldwidesidewalkjoy.com
3•NaOH•21m ago•1 comments

10K Curl Downloads per Year

https://daniel.haxx.se/blog/2026/03/09/10k-curl-downloads-per-year/
1•donutshop•21m ago•0 comments

Superpowers 5

https://blog.fsck.com/2026/03/09/superpowers-5/
2•arittr•25m ago•0 comments

Show HN: Git Trophy – 3D print your GitHub contribution graph

https://git-trophy.com/
1•Lukabuz•25m ago•0 comments

Trump is heading for a hard reckoning over Iran

https://spectator.com/article/trump-is-heading-for-a-hard-reckoning-over-iran/
4•leiftw•26m ago•1 comments

Reinforcement fine-tuning use cases

https://developers.openai.com/api/docs/guides/rft-use-cases/
1•teleforce•26m ago•0 comments

Bromure: An ephemeral browser that runs in a disposable virtual machine on macOS

https://github.com/rderaison/bromure
1•felineflock•26m ago•0 comments

QuickTERMINAL – A 10k-line single-file terminal emulator for macOS

https://github.com/LEVOGNE/quickTerminal
1•LEVOGNE•27m ago•1 comments

Sir Tony Hoare has died

http://lefenetrou.blogspot.com/2026/03/in-memoriam-tony-hoare.html
55•nextos•28m ago•14 comments

JavaScript with a native Rust host game engine. Built for vibe coding

https://github.com/Aura-Industry/auramaxx
1•chiubaca•29m ago•0 comments

Why right-wing media can't stop Candace Owens

https://www.salon.com/2026/03/04/why-right-wing-media-cant-stop-candace-owens/
1•tzs•32m ago•0 comments

How long do electric vehicle batteries last?

https://www.npr.org/2026/03/02/nx-s1-5706658/electric-vehicle-battery-lifespan
3•tzs•36m ago•0 comments

A Modular Computer That's Bringing Back Analog

https://www.hackster.io/news/a-modular-computer-that-s-bringing-back-analog-e02f07df7bf6
1•todsacerdoti•39m ago•0 comments
Open in hackernews

Show HN: LOAB – AI agents get decisions right but skip the process [pdf]

https://github.com/shubchat/loab/blob/main/assets/loab_paper_mar2026.pdf
1•shubh-chat•1h ago
LOAB, an open-source benchmark for evaluating whether AI agents can follow regulated lending processes — not just produce the right final answer. The motivation is simple: in mortgage lending, regulators don't care if you got the right answer. They care whether you followed the right process. Skip a KYC check, pull a credit bureau report before getting privacy consent, or approve a loan without the required policy lookup — that's a compliance failure even if the outcome was correct. Current AI benchmarks don't measure this. They evaluate what the agent decided, not how it got there. LOAB simulates a fictional Australian lender with mock regulatory APIs, multi-agent roles mirroring real bank operations, and a five-dimension scoring rubric derived from actual lending law. A run only passes if the outcome is correct AND the process was correct. The main finding: frontier models achieve 67-75% outcome accuracy but only 25-42% when you also require process compliance. It's surprisingly hard to get AI to follow a prescribed sequence of steps even when it clearly "knows" the right answer.