news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

OpenAI Acquires Promptfoo

https://openai.com/index/openai-to-acquire-promptfoo/

3•benban•1h ago

Comments

benban•1h ago

This makes sense for OpenAI, my experience with Promptfoo is great at testing model outputs. But I keep wondering who's looking at the other side: the actual agent code, and what happens now for other models such as Gemini/Claude etc that are using Promptfoo being locked-in with OpenAI and OS.

Like, an eval will tell you the model gave a bad answer. It won't tell you that your agent passes that answer straight into a shell command, or that a loop has no exit condition and burns through your API budget overnight.

We've been working on this, static analysis that reads agent code and maps out what can go wrong before you deploy. Found issues in ~80% of the repos we scanned.

would be great to get your feedback: https://github.com/inkog-io/inkog

Why OpenAI's Pentagon Deal May Be the Smarter Bet for AI Safety

https://alltechmagazine.com/openais-pentagon-deal-may-be-the-smarter-bet-for-ai-safety/

1•dekhna•40s ago•0 comments

Ask HN: How are you testing AI agents before shipping to production?

1•harperlabs•43s ago•0 comments

Show HN: CocoIndex-Code open source embedded AST-based code MCP

https://github.com/cocoindex-io/cocoindex-code

1•georgehe9•1m ago•0 comments

CI should fail on your machine first

https://blog.nix-ci.com/post/2026-03-09_ci-should-fail-on-your-machine-first

1•Norfair•1m ago•0 comments

Frequency – autonomous agent pipelines coordinating through shared state

https://www.frequency.sh/blog/introducing-frequency/

1•frequencyai•2m ago•1 comments

The Fallen Primes of Tohu

https://elonlit.com/scrivings/the-fallen-primes-of-tohu/

1•elonlit•3m ago•0 comments

Parseword – new game from the creator of Wordle

https://www.parseword.com

1•knuckleheads•4m ago•0 comments

Anthropic launches code review tool to check flood of AI-generated code

https://techcrunch.com/2026/03/09/anthropic-launches-code-review-tool-to-check-flood-of-ai-genera...

1•linsomniac•5m ago•0 comments

Internet Archive Faces Copyright Lawsuit over 'Myspace Dragon Hoard'

https://torrentfreak.com/internet-archive-faces-copyright-lawsuit-over-myspace-dragon-hoard/

1•crtasm•5m ago•0 comments

Ask HN: 1 Hash/Sec paced PoW making 51% attacks impossible – seeking engineers

1•HurairahShamsi•6m ago•0 comments

There are no heroes in commercial AI

https://garymarcus.substack.com/p/there-are-no-heroes-in-commercial

1•pretext•8m ago•0 comments

Seeking help spreading passion for CS with class on DIY Arduino game consoles

https://community.arduboy.com/t/looking-for-instructors-to-teach-intro-electronics-with-breadboar...

1•jaltekruse•8m ago•1 comments

New Patch Can Boost Linux ZRAM Compression Performance by over 50%

https://www.phoronix.com/news/Linux-ZRAM-50p-Compress-Boost

1•Bender•9m ago•0 comments

From games to biology and beyond: 10 years of AlphaGo's impact

https://deepmind.google/blog/10-years-of-alphago/

1•colesantiago•9m ago•0 comments

AI Assistants Are Moving the Security Goalposts

https://krebsonsecurity.com/2026/03/how-ai-assistants-are-moving-the-security-goalposts/

1•Bender•10m ago•0 comments

Afghanistan's Airspace Is Uncontrolled

https://twitter.com/i/status/2031075921763963341

1•marklit•10m ago•0 comments

Hackers Are Automating Cyberattacks with AI. Defenders Using It to Fight Back

https://singularityhub.com/2026/03/09/hackers-are-automating-cyberattacks-with-ai-defenders-are-u...

2•Brajeshwar•10m ago•1 comments

Feral cattle and horses shape vegetation structure in a trophic rewilding area

https://esajournals.onlinelibrary.wiley.com/doi/10.1002/eap.70170

1•PaulHoule•10m ago•0 comments

How fast does a protein fold? Real-time technique captures the moment

https://www.nature.com/articles/d41586-026-00755-x

2•Brajeshwar•11m ago•0 comments

MCP Roadmap

http://blog.modelcontextprotocol.io/posts/2026-mcp-roadmap/

1•pentagrama•11m ago•0 comments

We've only just confirmed that Homo habilis existed

https://www.newscientist.com/article/2518316-weve-only-just-confirmed-that-homo-habilis-really-ex...

3•Brajeshwar•11m ago•0 comments

Scientists reveal controversial plan to launch 50k MIRRORS into space

https://www.dailymail.co.uk/sciencetech/article-15631695/Scientists-plan-launch-MIRRORS-space.html

1•Bender•11m ago•1 comments

The indexing your database has is more important than many realize

https://faucetdb.ai/blog/mcp-database-benchmark/

1•guardian17•13m ago•1 comments

Instagram to End Support for End-to-End Encrypted Messaging After May 8, 2026

https://techlomedia.in/2026/03/instagram-to-end-support-for-end-to-end-encrypted-messaging-after-...

2•deepanker70•14m ago•1 comments

Ask HN: How do you block/hide comments with links to grokipedia, on HN?

1•netfortius•14m ago•0 comments

UX in the Era of Abundant Intelligence

https://generray.substack.com/p/ux-in-the-era-of-abundant-intelligence

2•wsun19•14m ago•0 comments

Show HN: All-in-one translation and writing assistant for Slack

https://slack.com/marketplace/A099D0HP3T2?nojsmode=1

1•robgehring•15m ago•1 comments

Building a Package Manager on Top of Meson's Wrap System

https://collider.ee/blog/2026-03-09-2328_building_a_package_manager/

2•mog_dev•15m ago•0 comments

The Technological Speed Limit

https://metastable.org/speed-limit/

1•pbw•15m ago•0 comments

Reducing Europe's nuclear energy sector was 'strategic mistake', EU chief says

https://www.reuters.com/sustainability/boards-policy-regulation/reducing-nuclear-energy-strategic...

2•gostsamo•15m ago•0 comments