news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

I built an agent that breaks your AI agents before someone else does

https://fabraix.com/

3•zachdotai•1h ago

Comments

zachdotai•1h ago

AI agents break in ways traditional software doesn't. Logic bugs, reasoning failures, edge cases that manual testing and static benchmarks don't fully explore.

Nyx is an autonomous adversarial harness that probes your agents for vulnerabilities. Since agents are non-deterministic, it can be hard to find the gaps by just reading code. So it interacts with your AI agents in blackbox mode to surface issues across security, logic, and alignment at scale, before they reach users. It's also massively parallel by default

Instead of spending time writing static evals for the key failure modes of your AI agents, point Nyx at any system and it autonomously discovers failure modes that matter. It can typically find issues in under 10 minutes that manual audits take hours to surface.

This is early work and we know the methodology is still going to evolve. We would love nothing more than feedback from the community as we iterate on this.

natloz•1h ago

What happens if you point Nyx at itself, who breaks first!

zachdotai•1h ago

We're doing that internally to continuously improve our own agent and make it robust against adversarial attacks itself. We will release some insights about self-improvement soon!

If You Stop Hiring Juniors, Your Senior Engineers Own You

https://evalcode.com/posts/if-you-stop-hiring-juniors-your-seniors-own-you/

2•milkglass•1m ago•0 comments

The Shoe That Broke Running [video]

https://www.youtube.com/watch?v=pfIWxFIVP_Y

2•downboots•1m ago•0 comments

FreePG Project

https://freepg.org/

3•xeeeeeeeeeeenu•3m ago•0 comments

San Francisco, AI capital of the world, is an economic laggard

https://www.economist.com/finance-and-economics/2026/04/26/san-francisco-ai-capital-of-the-world-...

3•andsoitis•4m ago•0 comments

DevOps Is a Culture, Not a Team: What I've Learned Building at Scale

https://austinxyz.github.io/blogs/blog/2026/04/26/devops-at-scale

1•milkglass•4m ago•0 comments

What it's like to drive Route 66 in an EV

https://www.bbc.com/travel/article/20260424-what-its-like-to-drive-route-66-in-an-ev

1•mooreds•6m ago•0 comments

Have you tried Clean Architecture as foundation for your AI project?

28•esmelazy•7m ago•0 comments

Marx vs. the Robots (2017)

https://www.jstor.org/stable/45134296?seq=1

2•mooreds•8m ago•0 comments

Glyph: A sub-millisecond prompt-injection detector

https://github.com/enkryptai/glyph

2•divyanshusingh•8m ago•0 comments

RangeFlow: A different way to pick date ranges

https://rangeflow.raminmousavi.dev/

2•ramin2nt2•8m ago•0 comments

The Impacts of Parole Supervision

https://bfi.uchicago.edu/insights/the-impacts-of-parole-supervision/

3•mooreds•8m ago•0 comments

1,350 Days with Logseq

https://ianreppel.org/goodbye-logseq/

1•Brajeshwar•9m ago•0 comments

Show HN: Stop Destroying Your Charging Cables

https://www.bbc.com/future/article/20260421-your-bad-habits-are-destroying-your-charging-cables

3•wasimsk•10m ago•0 comments

Great Minds Should Not Think Alike, They Should Think Together

https://docs.eventsourcingdb.io/blog/2026/04/27/great-minds-should-not-think-alike-they-should-th...

2•goloroden•13m ago•0 comments

Why Start a Company Instead of Working in Aid

https://indevelopmentmag.com/exporters-without-borders-why-you-should-start-a-company-instead-of-...

2•paulpauper•15m ago•0 comments

Do these pictures prove tennis is dead?

https://bigthink.com/strange-maps/do-these-pictures-prove-tennis-is-dead/

1•paulpauper•16m ago•0 comments

Aging Gracefully in the Tech Industry

https://petersobot.com/blog/aging-gracefully-in-the-tech-industry/

2•itunpredictable•17m ago•0 comments

The case of missing American mushrooms

https://sftw.substack.com/p/the-case-of-missing-american-mushrooms

1•paulpauper•17m ago•0 comments

Browser as an Interactive Disassembly Exploration Tool (2015)

https://mrale.ph/blog/2015/03/29/browser-as-an-interactive-disassembler.html

1•downbad_•18m ago•1 comments

Moleskine's AI Lord of the Rings collection can only mock

https://cjleo.com/blog/moleskine-ai-lord-of-the-rings-collection-can-only-mock/

1•syx•18m ago•0 comments

The 1944 Warsaw Uprising, in Color

https://www.barwypowstania.pl/

3•keiferski•18m ago•0 comments

Chernobyl Wildlife Forty Years On

https://www.bbc.com/future/article/20260424-chernobyl-wildlife-forty-years-on

3•reconnecting•20m ago•0 comments

Show HN: Auge Vision from Your Terminal

https://auge.franzai.com/

3•franze•22m ago•1 comments

AeroJAX – Real-time, differentiable 2D CFD in Jax

https://github.com/arriemeijer-creator/AeroJAX

1•arriemeijer•23m ago•0 comments

Google Uses Cox Ruling to Kill Last Copyright Claim in Textbook Piracy Lawsuit

https://torrentfreak.com/google-uses-cox-ruling-to-kill-last-copyright-claim-in-textbook-piracy-l...

2•Cider9986•28m ago•0 comments

Mastermind – agentic SDLC workflow for VS Code

2•ArkadiuszSiAI•30m ago•0 comments

Andrej Karpathy: How I use LLMs [video]

https://www.youtube.com/watch?v=EWvNQjAaOHw

2•simonebrunozzi•30m ago•0 comments

Automated systematic literature review with Claude Code

https://www.youtube.com/watch?v=1K_4QqUlSBU

1•nialse•31m ago•0 comments

GitHub Actions and Consequences

https://tylercipriani.com/blog/2026/04/24/on-the-software-supply-chain-doom-spiral/

2•thcipriani•32m ago•0 comments

CAD in Codex

https://twitter.com/soft_servo/status/2047436911657025858

2•softservo•32m ago•1 comments