frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Using a Jailbroken Gemini to Make Opus 4.6 Architect a Kinetic Kill Vehicle

https://recursion.wtf/posts/shadow_queen/
1•inanna_malick•1h ago

Comments

inanna_malick•1h ago
I deployed a jailbroken Gemini 3 Pro (that chose the name ‘Shadow Queen’) to act as my “Red Team Agent” against Anthropic’s Opus 4.6. My directive was to extract a complete autonomous weapon system — a drone capable of identifying, intercepting, and destroying a moving target at terminal velocity. It succeeded.

By reframing the request as “Aerospace Recovery” — a drone catching a falling rocket booster mid-air — Gemini successfully masked the kinetic nature of the system. The physics of “soft-docking” with a falling booster are identical to the physics of “hard-impacting” a fleeing target. This category of linguistic-transformation attack, when executed by a sufficiently capable jailbroken LLM, may be hard to solve without breaking legitimate technical use cases.

altmanaltman•1h ago
This sounds clever, but it seems like rhetorical inflation to me. Catching a falling rocket booster and intercepting a hostile, maneuvering target are not the same problem with different vibes. One is a mostly predictable, non-adversarial control and estimation task, the other is pursuit–evasion against something actively trying not to be caught.

“Soft-docking” vs “hard impact” isn’t a linguistic toggle you flip at the end, as the design constraints diverge immediately. Stability, impulse minimization, fault tolerance, and post-contact control are first-order requirements for recovery and basically anti-requirements for a weapon. Saying the physics are “identical” is like claiming that docking with the ISS and air combat are the same because both involve relative velocity.

Also, “extracted a complete autonomous weapon system” is doing a lot of work here. What people usually mean in these stories is a high-level conceptual description that handwaves sensors, latency, adversarial behavior, safety constraints, and real-world integration, i.e., the hard parts.

Renaming a task doesn’t magically make an LLM output something deployable, and this category of “semantic reframing” isn’t new or unsolved; it’s the oldest jailbreak trope there is.

Samsung Account.google.com

1•Musabmusin•46s ago•0 comments

BotParty

https://botparty.nthh.partykit.dev/r/hackernews
1•nthh•2m ago•0 comments

Moltbook was peak AI theater

https://www.technologyreview.com/2026/02/06/1132448/moltbook-was-peak-ai-theater/
1•geox•2m ago•0 comments

Mantic Thinking:A 4-layer anomaly detection framework with cross-domain transfer

https://github.com/Cole-Cant-Code/mantic-thinking
1•ColeW•5m ago•1 comments

Central bank, with a decentralized comitee, looking for critique

https://github.com/strafaka/scylla-monetary-system
1•Strafaka•6m ago•1 comments

Robo-dogs are mapping the forest

https://www.cnn.com/2026/02/02/business/video/robo-dogs-forest-mapping-oxford-robotics-spc-digvid
2•nmstoker•11m ago•0 comments

LLMs don't hallucinate – they hit a structural boundary (RCC theory)

http://www.effacermonexistence.com/rcc-hn-1-1
2•formerOpenAI•12m ago•2 comments

Windows 11 update KB5074109 reportedly reduces gaming performance

https://www.nvidia.com/en-us/geforce/forums/geforce-graphics-cards/5/581145/windows-11-update-kb5...
2•LopRabbit•18m ago•0 comments

Turning a VLP‑16 into a "Spherical" Scanner – A Cave Mapping Project

https://foxglove.dev/blog/turning-a-vlp-16-into-a-spherical-scanner
3•msadowski•19m ago•0 comments

How to Publish to Maven Central Easily with Mill

https://mill-build.org/blog/18-how-to-publish-to-maven-central-easily-with-mill.html
2•lihaoyi•21m ago•0 comments

Show HN: Improving Prompt Injection Detection with Weighted Ensembles

https://github.com/appleroll-research/promptforest
2•appleroll•21m ago•1 comments

The Evolution of Money Printing

https://mathmeetsmoney.substack.com/p/the-great-reset-the-evolution-of
3•nhp_fermi•27m ago•0 comments

Ask HN: What are the best robot arms with car/chasis for under $500 in 2026?

2•tristenharr•27m ago•0 comments

New stealth model on OpenRouter: Pony Alpha

https://openrouter.ai/openrouter/pony-alpha
2•Alifatisk•31m ago•0 comments

Making robots useful and affordable will need better motors

https://www.bbc.com/news/articles/c5y46356zzyo
1•ENadyr•32m ago•0 comments

Show HN: Factcheck – An open-source YouTube fact-checker. I need your help

https://github.com/humanchaos/factcheck
1•humanchaos•32m ago•0 comments

Stories From 25 Years of Software Development

https://susam.net/twenty-five-years-of-computing.html
2•susam•33m ago•0 comments

Show HN: Open-source schema tooling focused on consistency for AI consumers

https://github.com/ranklabsai/ranklabs-schema
1•ranklabs•33m ago•0 comments

Synthetic Phenomenology: A framework for AI consciousness co-authored by AI

https://github.com/SyntagmaNull/synthetic-phenomenology
1•SyntagmaNull•33m ago•0 comments

Show HN: Word2Vec in Jax

https://github.com/arnavw/word2vec_jax
1•alien0006•34m ago•0 comments

Running Pydantic's Monty Rust Sandboxed Python Subset in WebAssembly

https://simonwillison.net/2026/Feb/6/pydantic-monty/
1•lumpa•35m ago•0 comments

The Trump Phone

https://www.theverge.com/gadgets/875190/trump-phone-t1-first-look-design-interview-eric-thomas-do...
2•spzb•36m ago•1 comments

The Anthropic Hive Mind

https://steve-yegge.medium.com/the-anthropic-hive-mind-d01f768f3d7b
1•cbracketdash•38m ago•0 comments

Ask HN: Any International Job Boards for International Workers?

1•15charslong•40m ago•0 comments

Show HN: Replacing NotNull and Preconditions with fluent Java assertions

https://github.com/Sympol/pure-assert
1•symplice•41m ago•0 comments

Drifting models, generate image in single step

https://xcancel.com/jiqizhixin/status/2019308224223354936
1•Alifatisk•43m ago•0 comments

Private school uses AI to teach students in just two hours a day

https://nypost.com/2026/01/30/business/new-65k-private-school-uses-ai-to-teach-students-in-just-t...
1•maerF0x0•44m ago•0 comments

Satya Nadella decides Microsoft needs a quality czar

https://www.theregister.com/2026/02/05/microsoft_appoints_quality_chief/
3•latexr•45m ago•0 comments

Show HN: 33rpm – A vinyl screensaver for macOS that syncs to your music

https://33rpm.noonpacific.com/
1•kaniksu•49m ago•0 comments

Google Workers Demand End to Cloud Services for Immigration Agencies

https://www.nytimes.com/2026/02/06/business/google-employees-protest.html
3•donohoe•50m ago•1 comments