frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: We post-trained a model that pen tests instead of refusing

https://www.argusred.com/cli
3•dk189•2h ago
Everyone's training AI to refuse. We post-trained a model to break in.

Anthropic and OpenAI's publicly available models are explicitly guard-railed so that they refuse offensive tasks. And their cyber-focussed models are gated for enterprises. This leaves SMEs and mid market open to major vulnerabilities.

AI can be used as both an adversarial and defensive tool in the world of cyber. A worst case outcome is if only the adversaries have access.

Meanwhile, most existing AI cyber tools are just wrappers. The problem is that they still have all the guardrails on from the foundation model where they will inherit its refusals.

For this project we've post-trained a specific model on a decade of capture-the-flag contests. This won't be made available to anyone and everyone, but we do believe that responsible SMEs and midmarket companies also need access to these tools in order to identify key vulnerabilities in their systems; not just enterprises.

We have developed two modes that run over a CLI:

• Security scan: a read-only audit of your local codebase for vulnerabilities. It only reports what it can tie to a specific file and line, so you're not wading through vibes-based findings.

• Pen test: an active adversarial mode that will try to break a live system in a sandboxed environment. It proves each vulnerability by running the exploit and showing the request it sent and the response your code gave back, not a confidence score. Currently gated.

To show what the scan does, we pointed it at Bank of Anthos and it found an integer overflow in the transfer path: amount is an int, and amount + fee can overflow negative, so the balance check passes and you move funds you don't have. Plus the usual auth and secrets issues. (Bank of Anthos is Google's open-source bank. It's a known app and some of it is intentionally weak, which is the point: you can clone it and re-run the scan yourself instead of trusting a screenshot)

How the harness works:

Along with the model we built the harness to support this. The harness runs on a multi-agent swarm: an orchestrator splits the job across subagents running in parallel, each owning a slice, then synthesising one report.

The CLI is a local binary (brew/curl). It reads your code locally, then sends context to our inference API over TLS tcpdump it and you'll see exactly what leaves and where. Install is free; and you can run a scan for free up to 2m tokens, then need to pay for tokens beyond this.

For full disclosure this is a product part of Cosine (YC W23)

Up for debate: tool safety, e.g. domain verification is one method that proves control but not necessarily permission. How would you gate a pen-test tool given that?

Do Elite Universities Overpay Their Faculty?

https://direct.mit.edu/rest/article-abstract/doi/10.1162/REST.a.1817/137257/Do-Elite-Universities...
1•paulpauper•1m ago•0 comments

Cuba to Privatize State Companies

https://www.miamiherald.com/news/nation-world/world/americas/cuba/article316195766.html#storylink...
1•paulpauper•2m ago•0 comments

Do weird corporate governance structures work well?

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6697999
1•paulpauper•2m ago•0 comments

Let an Agent run the apps on your computer

https://lapu.ai/
1•xAdamx•2m ago•0 comments

Iran says it's closing Strait of Hormuz, accusing Israel, US of violating truce

https://www.cnn.com/2026/06/20/world/live-news/iran-war-trump-israel-lebanon
1•MilnerRoute•2m ago•0 comments

Ribbie, Live Baseball in Pixels

https://ribbie.tv
1•zdw•3m ago•0 comments

Letheo – a Cognitive Runtime for agent memory in Rust (forgetting by physics)

https://github.com/Abick91/letheo
1•abick91•5m ago•0 comments

Homo Agenticus

https://www.strangeloopcanon.com/p/homo-agenticus
1•kiyanwang•5m ago•0 comments

How to Lose a Global AI Monopoly in One Afternoon [video]

https://www.youtube.com/watch?v=0RxMj0L0-fY
1•Topfi•5m ago•0 comments

Tesla's self-driving safeguards fooled by $30 doll heads

https://electrek.co/2026/06/15/chinese-drivers-plastic-heads-fool-tesla-autopilot-camera/
2•zdw•7m ago•0 comments

Eliya – a compliance-focused OpenJDK 25 distribution (Phase 1 of a JVM platform)

https://root.asymm.systems/product/eliya
1•fahimfarookme•7m ago•1 comments

Cotect – a fast code inspector for the agent era

https://cotect.dev/
1•grzracz•11m ago•0 comments

FCC Seeks Comment on Enhanced Know-Your-Customer Requirements

https://www.fcc.gov/document/fcc-seeks-comment-enhanced-know-your-customer-requirements
1•dredmorbius•14m ago•1 comments

Epidurals Are a Miracle Technology

https://worksinprogress.co/issue/the-wonder-of-epidurals/
2•karakoram•16m ago•0 comments

Write a Letter to Your Future Self

https://www.futureme.org/
2•karakoram•17m ago•0 comments

Sinceerly, AI to undo your AI writing

https://sinceerly.com
1•zdw•17m ago•1 comments

Granta stops publishing short story award winners over AI controversy

https://www.theguardian.com/books/2026/jun/20/granta-magazine-commonwealth-short-story-prize-ai
4•ilreb•18m ago•0 comments

Show HN: Video on the map marketplace 1 year – still bad traction

1•cromlehg•18m ago•0 comments

Hyperia 0.12.7 is released: an agentic terminal for agents and humans

https://github.com/DeepBlueDynamics/hyperia/releases
1•kordlessagain•21m ago•0 comments

Show HN: Mitos – N-way live copy-on-write fork of running Firecracker microVMs

https://github.com/mitos-run/mitos
1•stubbi•21m ago•0 comments

What has (can) the EU Cyber Resilience Act done (do) for you?

https://bsdly.blogspot.com/2026/06/what-has-can-eu-cyber-resilience-act.html
2•jandeboevrie•23m ago•0 comments

On Some Quotes from G.H. Hardy

https://www.stat.berkeley.edu/~aldous/Blog/hardy.html
1•jruohonen•25m ago•0 comments

Speculation Is All You Need

https://modal.com/blog/spec-is-all-u-need
1•birdculture•25m ago•0 comments

Book publishing tool for engineers: EPublish

https://frequal.com/epublish/
2•TeaVMFan•27m ago•1 comments

ESP32 Bit Pirate, Hardware Hacking tools, Debug/explore hardware in the browser

https://geo-tp.github.io/ESP32-Bit-Pirate/web-tools/
4•geotp•31m ago•1 comments

We're approaching AI agents from the wrong direction

https://codeastra.dev/
2•mpakaobed•31m ago•0 comments

I let AI run my company for 6 months. Here's what broke

https://theassociationwebmasters.blogspot.com/2026/06/i-gave-my-company-to-ai-for-6-months.html
2•laurentlof•31m ago•2 comments

Big Tech is stoking unrest in the UK. Why?

https://www.ft.com/content/0f3e33d2-0b9e-481d-a911-245d8cc01a9c
2•alephnerd•31m ago•3 comments

KDB+ Database: From Finance to Formula 1 (2019)

https://prohoster.info/en/blog/administrirovanie/baza-dannyh-kdb-ot-finansov-do-formuly-1
3•tosh•34m ago•0 comments

Ask HN: What technique do you use to make Claude Code deterministic?

2•hbarka•35m ago•2 comments