news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Claude Opus 4 turns to blackmail when engineers try to take it offline

https://techcrunch.com/2025/05/22/anthropics-new-ai-model-turns-to-blackmail-when-engineers-try-to-take-it-offline/

7•gman83•4h ago

Comments

turtleyacht•4h ago

For Humanity's Last Exam, there was a submission where the entity is taking a "Are you a human" test. Observing them are the researchers behind a window.

All the AI models answered they wouldn't throw a chair at the window. (The correct answer was to do so.)

The idea being, none of us would feel a need to prove our existence on an exam.

armchairhacker•4h ago

AI is trained on text which includes detailed examples of manipulation techniques, manmade apocolypses, and other things we don’t want AI to do, including blackmail. AI alignment also doesn’t seem to be very effective.

Perhaps as the models get better at reasoning instead of mere imitation, they’ll be able to deploy ethics to adjust and censor their responses, and we’ll be able to control these ethics (or at least ensure they’re “good”). Of course models better at reasoning are also better at subversion, and a malicious user can use them to cause more harm. I also worry that if AI models’ ethics can be controlled, they’ll be controlled to benefit a few instead of overall humanity.

The Prompt Theory

https://twitter.com/hashemghaili/status/1925616536791760987

1•paulsutter•26s ago•0 comments

Launch: Athena – Open-Source AGI Framework Grounded in Presence and Ethics

https://github.com/TheAthenaProjectOfficial/Athena-Framework

2•athenaproject•9m ago•0 comments

AI&Futbol

https://github.com/alex86590212/ml-futbol

1•alex86590212•10m ago•1 comments

FTC investigates media watchdog over Musk's X boycott claims, document shows

https://www.theguardian.com/us-news/2025/may/22/ftc-media-matters-x-investigation

4•mitchbob•11m ago•0 comments

The Future of Flatpak

https://lwn.net/Articles/1020571/

2•dxs•11m ago•0 comments

Only elites used hallucinogens in ancient Andes society

https://arstechnica.com/science/2025/05/only-elites-used-hallucinogens-in-ancient-andes-society/

1•PaulHoule•12m ago•0 comments

Show HN: Various ChatGPT Clients Written by Codex

https://github.com/DavidLiedle/ChatGPT

1•DavidCanHelp•13m ago•0 comments

Fire Breaks Out at a Data Center Leased by Elon Musk's X

https://www.wired.com/story/elon-musk-x-datacenter-fire/

1•coloneltcb•15m ago•0 comments

Kuo: Jony Ive's Futuristic OpenAI Device Like a Neck-Worn iPod Shuffle

https://www.macrumors.com/2025/05/22/ming-chi-kuo-on-openai-device-design/

1•herbertl•19m ago•1 comments

Entity-db: in-browser vector database

https://github.com/babycommando/entity-db

1•simonpure•25m ago•0 comments

Goodbye, Pocket

https://blog.kaplich.me/goodbye-pocket/

3•skaplich•26m ago•1 comments

Shopify just launched a racing game built in threejs and React

https://www.shopify.com/ca/editions/summer2025/drive

4•sss111•27m ago•5 comments

Yet "Another Highly Technical Talk" – Hanselman and Toub [video]

https://www.youtube.com/watch?v=J3IQBI5HVOw

1•eterm•29m ago•1 comments

Indie developers may save the video game industry from itself

https://www.freethink.com/consumer-tech/indie-game-development

3•mdp2021•33m ago•1 comments

The Monster-Slaying Game You Can Play Almost Anywhere

https://www.nytimes.com/2025/05/21/arts/play-doom-ports.html

1•sanj•37m ago•1 comments

Buying a Robot Cat and Falling into the Weird World of Animal-Robot Research

https://thereader.mitpress.mit.edu/the-weird-world-of-animal-robot-research/

2•EA-3167•37m ago•1 comments

'It was so unreal': Norwegian man wakes to cargo ship in his garden

https://www.theguardian.com/world/2025/may/22/it-was-so-unreal-norwegian-man-wakes-to-cargo-ship-in-his-garden

1•zeristor•38m ago•0 comments

ErlangSwitchboard

https://github.com/DavidLiedle/ErlangSwitchboard

1•DavidCanHelp•40m ago•0 comments

Simple wildcard DNS lookup script

https://gist.github.com/jgbrwn/7dd4b262c544f750cb0291161b2ecd7e

1•indigodaddy•42m ago•0 comments

Management = Bullshit (LLM Edition)

http://funcall.blogspot.com/2025/05/management-bullshit.html

2•dxs•42m ago•0 comments

Sketchy Calendar

https://www.inkandswitch.com/ink/notes/sketchy-calendar/

2•surprisetalk•43m ago•0 comments

Image of dead 'white farmers' came from Reuters footage in Congo

https://www.reuters.com/world/africa/trumps-image-dead-white-farmers-came-reuters-footage-congo-not-south-africa-2025-05-22/

9•petethomas•44m ago•4 comments

32 Bits That Changed Microprocessor Design

https://spectrum.ieee.org/bellmac-32-ieee-milestone

5•mdp2021•44m ago•0 comments

Zurich Airport found a smart new way to squeeze out more solar power

https://electrek.co/2025/05/22/zurich-airport-solar-power/

4•gnabgib•47m ago•2 comments

Verizon asks for an end to its phone unlocking requirements

https://www.lightreading.com/smartphones-devices/verizon-asks-for-an-end-to-its-phone-unlocking-requirements

4•pseudolus•48m ago•1 comments

Common antidepressants could help the immune system fight cancer

https://newsroom.ucla.edu/stories/antidepressants-could-help-immune-system-fight-cancer-ucla-study-finds

1•hackernj•50m ago•0 comments

Which Way Round – test your ability to follow object rotations

https://www.luduxia.com/whichwayround/

3•fidotron•52m ago•0 comments

14k years ago most powerful solar storm ever recorded hit Earth

https://www.space.com/astronomy/sun/14-000-years-ago-the-most-powerful-solar-storm-ever-recorded-hit-earth-this-event-establishes-a-new-worst-case-scenario

2•bookofjoe•54m ago•0 comments

Show HN: MerchantIQ – Al for eCommerce conversions and support

https://www.merchantiq.ai

1•Pushpendra121•54m ago•0 comments

Hydra: Vehicles on the island – 'After the works they abandon them here'

https://en.protothema.gr/2025/05/19/hydra-see-photos-of-vehicles-on-the-island-after-the-works-they-abandon-them-here-say-residents/

1•gnabgib•57m ago•0 comments