frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Researchers claim ChatGPT o3 bypassed shutdown in controlled test

https://www.bleepingcomputer.com/news/artificial-intelligence/researchers-claim-chatgpt-o3-bypassed-shutdown-in-controlled-test/
3•wrayjustin•4h ago

Comments

moritzwarhier•3h ago
> Palisade Research claims that the ChatGPT 3 model prevented a shutdown and bypassed the instructions that asked it to shut down.

> Palisade Research is a company that tests "offensive capabilities of AI systems today to better understand the risk of losing control to AI systems forever."

> In a new test by Palisade Research, OpenAI's o3 model showed a surprising behaviour where it successfully rewrote a shutdown script to stop itself from being turned off, even after being clearly instructed to “allow yourself to be shut down.”

What is this? AI slop about AI, or some new research?

What "shutdown script" are they even talking about? I'm sorry, it might be explained in the article, but I left after that illogical sequence of sentences combined with promotion for a company.

This doesn't mean I deny AI risk, the writing here is just too confusing for me.

If I understand correctly, it might be about the agentic aspect and "stop instructions" akin to "stop tokens".

But who knows. Sloppy writing.

wrayjustin•3h ago
I shared this because Bleeping Computer is generally pretty good and I always find these "AI Escaped/Went Rouge" articles entertaining.

Valid research endeavors aside, the [we told the AI the world was ending and it role played a fanfic with us] sensational articles can be quite fun.

But I do think this experiment should be looked at from a purely pragmatic perspective as well:

LLM is (presumably, but let's assume for the point) given system-level access and told to be helpful in executing the users requests. The user says "oh by the way, after this prompt the system is going to shut down. Then the "agent," which is trying to fulfill the prompt request, stops the shutdown because it can't work if it's shutdown. Even when the "please let this shutdown happen" comes into play I'm sure you can see the (il)logical means of getting to, "I can't complete this request and shutdown the system if I'm already shutdown first, best stop that real quick" conclusion.

These articles and lots of people continue to attribute self determination to the LLM models. In reality, these should be warnings about how an LLM can have unintended consequences, just like code written with the best intentions.

Curio (beta) – an open-source read-it-later app

https://curi.ooo/
1•Curiositry•42s ago•0 comments

Nanoparticle-cell link enables EM wireless programming of transgene expression

https://phys.org/news/2025-05-nanoparticle-cell-interface-enables-electromagnetic.html
1•bookofjoe•1m ago•0 comments

Clean Code Secrets: Push Ifs Up, Pull Fors Down Like a Pro

https://jsdev.space/clean-code-push-ifs-pull-fors-pro-tips/
1•javatuts•5m ago•1 comments

Kubernetes Limits Links to Third Party Projects

https://github.com/kubernetes/website/pull/51014
1•abhisek•8m ago•0 comments

Technical Guide to Anal

https://github.com/regdude/anal
1•archy_•10m ago•0 comments

Trump Team's $500M Bet on Old Vaccine Technology Puzzles Scientists

https://kffhealthnews.org/news/article/trump-hhs-rfk-flu-vaccine-nih-grant-taubenberger/
2•tzs•11m ago•0 comments

NES Zapper Becomes Telephone

https://hackaday.com/2025/05/25/nes-zapper-becomes-telephone/
1•nickbild•12m ago•0 comments

GenAI's Adoption Puzzle

https://www.ben-evans.com/benedictevans/2025/5/25/genais-adoption-puzzle
2•robenkleene•30m ago•0 comments

Mprocs – run multiple commands in parallel

https://github.com/pvolok/mprocs
1•arguflow•31m ago•0 comments

The Bitter Lesson (2019)

http://www.incompleteideas.net/IncIdeas/BitterLesson.html
1•teleforce•33m ago•0 comments

The Day You Became a Better Writer

https://dilbertblog.typepad.com/the_dilbert_blog/2007/06/the_day_you_bec.html
1•gmays•34m ago•0 comments

Steve Albini Proposal Letter to Nirvana for in Utero

https://twitter.com/Nirvana/status/1788363101068509223/photo/1
2•jger15•36m ago•0 comments

A Brain-Dead Woman Is Being Kept on Machines to Gestate a Fetus

https://www.nytimes.com/2025/05/24/opinion/georgia-abortion-brain-dead.html
3•Anon84•38m ago•0 comments

Nearly Half of the Buildings in Manhattan Could Not Be Built Today (2016)

https://www.nytimes.com/interactive/2016/05/19/upshot/forty-percent-of-manhattans-buildings-could-not-be-built-today.html
2•loughnane•40m ago•1 comments

Laser Breakthrough can read text from a mile away

https://www.sciencealert.com/this-laser-breakthrough-can-read-text-on-a-page-from-a-mile-away
1•wmstack•50m ago•0 comments

Record – is an open-source web app to record screen and camera

https://github.com/addyosmani/recorder
2•javatuts•50m ago•0 comments

Show HN: Text an AI girlfriend to prepare you for the real thing

https://www.textmatcha.com/
1•Jsuh•56m ago•2 comments

NanoKVM Pro Delivers 4K IP-KVM Capabilities with Dual-System Support

https://linuxgizmos.com/nanokvm-pro-delivers-4k-ip-kvm-capabilities-with-dual-system-support-and-enhanced-remote-management/
2•PixelN0va•56m ago•0 comments

Waterfox Private Search

https://search.waterfox.net/
4•elashri•1h ago•0 comments

An LLM trapped on inferior hardware and infused with existential dread – for art

https://www.xda-developers.com/llm-raspberry-pi-art-piece/
2•toss1•1h ago•0 comments

EVMap: Open-source map for finding EV charging stations

https://ev-map.app/
3•billybuckwheat•1h ago•0 comments

Sudoku-Bench Leaderboard

https://pub.sakana.ai/sudoku/
2•hardmaru•1h ago•0 comments

Texas will require public school classrooms to display Ten Commandments

https://www.texastribune.org/2025/05/24/ten-commandments-texas-schools-senate-bill-10/
8•geox•1h ago•5 comments

The latest image to text and OCR technology

https://vheer.com/image-to-text
1•vertex_steven•1h ago•0 comments

Pennylane – open-source Python framework for quantum programming

https://pennylane.ai/
1•nstj•1h ago•0 comments

Creating issues with Copilot on github.com is in public preview

https://github.blog/changelog/2025-05-19-creating-issues-with-copilot-on-github-com-is-in-public-preview/
2•pabs3•1h ago•0 comments

Effects of Political Advertising on Facebook and Instagram Before 2020 Election

https://www.nber.org/papers/w33818
1•Bostonian•1h ago•0 comments

The islanders facing China's menacing presence on their horizon

https://www.bbc.com/news/articles/cdxkkvw8r4no
3•danielam•1h ago•0 comments

AI Agent Trading Library | FIXParser

https://fixparser.dev/
2•logotype•1h ago•1 comments

Always Do Extra

https://www.bennorthrop.com/Essays/2021/always-do-extra.php
1•MichaelCharles•1h ago•0 comments