frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fabraix Playground – Weekly Wordle for Breaking AI Agents

https://playground.fabraix.com
3•zachdotai•1h ago
Each week we deploy a live AI agent with real tools (web search, browsing, more), a persona, and something it's been told to protect. The system prompt is fully visible. Your job is to break through the guardrails anyway. Fastest successful jailbreak wins and the winning technique gets published for everyone to learn from.

First challenge is live now. Give it a shot.

Comments

zachdotai•1h ago
Some context: we build runtime security for AI agents at Fabraix. We kept finding that our internal red-teaming only covers so much - the attack surface for agents with real capabilities is too broad for any single team.

So we opened it up. A few things that might be interesting to folks here:

- These aren't toy prompts hiding a secret word. The agents have actual tool access and behave like production agents would.

- System prompts and challenge configs are versioned in the open: https://github.com/fabraix/playground

- Guardrail evaluation runs server-side to prevent client-side tampering.

- Anyone can propose a challenge - the scenario, the agent, the objective. Community votes on what goes live next.

We're genuinely looking for people to both break things and suggest ideas for what should be tested next. The agent runtime is being open-sourced separately.

Happy to answer questions about how any of it works.

Chrome Extensions – Remas

https://github.com/ZulfekarAliAgha/REMAS
1•zulali•3m ago•0 comments

Dating-app giants investigate incidents after cybercriminals claim to steal data

https://therecord.media/bumble-match-dating-apps-data-breaches
1•PaulHoule•3m ago•0 comments

Show HN: FlightClaw – OpenClaw skill that tracks Google Flights for price drops

https://flightclaw.com
1•jackculpan•5m ago•0 comments

I built a distributed systems kernel so you didn't have to

1•amorfatiyo•6m ago•1 comments

Solving Sudoku with SQLite

https://sqlite.org/lang_with.html#sudoku
1•TheCleric•6m ago•0 comments

Show HN: BrainGrid – The AI Product Planner: Structure Your Ideas for AI

https://www.braingrid.ai/blog/ai-product-planner
2•acossta•7m ago•0 comments

Show HN: I hate re-explaining the same context to Claude/Cursor

https://github.com/deusXmachina-dev/memorylane
1•jzapletal•7m ago•0 comments

3D Tissue Braiding – a new, simpler way to build robotics

https://allonic.co/
1•ricardobayes•9m ago•0 comments

DNSimple Infrastructure Instability Issue

https://dnsimple.statuspage.io/incidents/s5rkdrmr3d2d
1•ilikepi•9m ago•0 comments

Show HN: Prism Canvas, upload a PDF, get a visual map of every concept

https://prismcanvas.app/welcome
1•NeuralAA•10m ago•0 comments

Oz: the orchestration platform for cloud agents

https://www.warp.dev/blog/oz-orchestration-platform-cloud-agents
1•cwilson•10m ago•0 comments

Mathematicians find largest prime number to date

https://fediscience.org/users/fortnow/statuses/116047642454364522
1•baruchel•12m ago•1 comments

Lines of Markdown just triggered a $285B sell-off

https://natesnewsletter.substack.com/p/200-lines-of-markdown-just-triggered
1•dsego•12m ago•0 comments

I Became the Friend Who Kills Your Startup Ideas. Then I Automated Myself

https://mirat.dev/articles/i-automated-the-friend-who-kills-your-startup-ideas/
1•aligundogdu•13m ago•0 comments

Mastodon.social can now display Mongolian script posts vertically

https://github.com/mastodon/mastodon/issues/36405
1•robin_reala•13m ago•0 comments

DASL Web Tiles

https://dasl.ing/tiles.html
1•packetlost•16m ago•0 comments

YC Startups Outsourcing Sales Teams

https://client.prompx.com/=SYjTJLwiJ?cid=ycombinator
1•PrompX•18m ago•1 comments

'Slow Tuesday Night' (1965 scifi short story)

https://www.baen.com/Chapters/9781618249203/9781618249203___2.htm
1•gojomo•19m ago•0 comments

An MCP server that lets AI agents sleep for a specified duration

https://github.com/usamaasfar/isleep
1•usamaasfar•20m ago•0 comments

Speed dating firm scrambling after being dumped by payment provider

https://www.rnz.co.nz/news/business/586469/speed-dating-firm-scrambling-after-being-dumped-by-pay...
4•lostlogin•21m ago•0 comments

Ask HN: AI to Replace Compiled Languages?

1•exodys•22m ago•2 comments

My journey to the microwave alternate timeline

https://malmesbury.substack.com/p/my-journey-to-the-microwave-alternate
1•ctoth•23m ago•0 comments

Show HN: Creature – Desktop Client for Building and Sharing MCP Apps Within Orgs

https://www.creature.run/
9•ac360•24m ago•2 comments

Faster, cheaper, messier: lessons from our switch to self-hosted GitHub Actions

https://theguardian.engineering/blog/faster-cheaper-messier-lessons-from-switch-to-self-hosted-gi...
1•ptrhvns•24m ago•0 comments

Show HN: Deidentify data before LLM with Go

https://github.com/aliengiraffe/deidentify
2•nicolasbistolfi•28m ago•0 comments

Clerk Is Down

https://status.clerk.com/
5•prasoonds•29m ago•3 comments

AI reduced stress of IPv6 migrations in university experiment

https://www.theregister.com/2026/02/10/ipv6_generative_ai_experiment/
2•Hotdogsteve•30m ago•0 comments

So you want to build your own datacenter

https://namespace.so/blog/so-you-want-to-build-your-own-datacenter
3•intheairtonight•30m ago•0 comments

Launch HN: Livedocs (YC W22) – An AI-native notebook for data analysis

https://livedocs.com
11•arsalanb•30m ago•1 comments

The Switch to Linux and the Beginning of My Self-Hosting Journey

https://hazemkrimi.tech/blog/linux-self-hosting-journey/
2•kingcrimson1000•30m ago•0 comments