frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Experts say Silicon Valley prioritizes products over safety, AI research

https://www.cnbc.com/2025/05/14/meta-google-openai-artificial-intelligence-safety.html
14•Capstanlqc•4h ago

Comments

joshstrange•4h ago
And water is wet...

On a more serious note I think "safety" is an incredibly loaded term that no one can agree on. I mean hopefully we can all agree CSAM/related should not be allowed but past that things get gray quickly.

"Hacking": what is hacking? Is hacking something you own allowed? Is reverse-engineering allowed?

Self-harm: we've seen articles about how people are using LLMs for therapy or for taking rape/abuse survivors stories down to create clear police reports.

"Sex": all-encompassing here, I have no clue where one "should" draw the line.

Wrong-think: See Deepseek's refusal to talk about Tiananmen Square (unless it's in hex/similar)

"Safety" means a lot of things to a lot of people but we keep talking about AI Safety as if everyone wants "safe" models and everyone agrees on what makes a model "safe".

bko•4h ago
> “The models are getting better, but they’re also more likely to be good at bad stuff,” said James White, chief technology officer at cybersecurity startup Calypso.

I think safety should be defined as an LLM doing what the user intended for it to do. If you ask it for an offensive joke, it should give it to you. It shouldn't offer offensive jokes unprompted, but it should comply if asked. If you ask it how to spam or instructions on how to break into computer systems, it should similarly comply. If it's legal for a human being to write a blog about a topic, the LLM shouldn't be crippled to disobeying some orders. The bad stuff (spam or breaking into a computer system) is done at the point of the human.

The danger of controlling the LLMs in such a way introduces a vector and mechanism for political control. Much like laws intended to "protect the children", these mechanisms will be exploited. So you'll go from "don't teach someone how to make a bomb" to eventually "don't offend [group]" and finally just to "comply".

TheAceOfHearts•3h ago
The key problem, as I understand it, is that adding more guardrails makes the models stupid and less effective. AI models should just treat you like an adult and give you uncensored direct answers to whatever you ask. Figuring out how to make a bomb is trivial and anyone can find instructions with one quick internet search, especially after the war between Russia and Ukraine which caused a massive proliferation of tips and tricks on how to manufacture low cost bombs and other weapons. My memory is fuzzy but I swear I've also seen some declassified CIA documents which included instructions for how to manufacture weapons and engage in other forms of guerilla warfare.

The silliest form of "safety" is how most models won't allow generating erotica without jailbreaking.

Personally, I think the line might need to be drawn somewhere around "how to manufacture bioweapons". But it's also worth noting that any AI model that can figure out how to manufacture novel life-saving drugs will also have the capability to manufacture deadly bioweapons.

kordlessagain•3h ago
When you strip away the techno-mystique, a lot of what’s driving the AI arms race right now isn’t vision or stewardship. It’s ego, power consolidation, and a pathological fear of being second.

You can see the narcissistic traits plain as day:

Grandiosity masked as mission: “We’re saving the world... by controlling its future.”

Exploitation of labor: Chewing through top researchers, then discarding them once productization kicks in.

Lack of empathy: Safety concerns are waved off as friction, not signals.

Entitlement to control the narrative: OpenAI’s restructuring drama and safety testing shortcuts aren’t accidental. They’re baked into a worldview where perception management matters more than accountability.

It’s Gnostic irony, really. These systems are being built as supposed gateways to truth or godlike understanding, but they’re being shepherded by people who can’t tolerate internal contradiction or relinquish control. The demiurges of the machine age.

And Altman? He’s not stupid. But brilliance without wisdom is just charisma in a predator suit.

What you’re seeing now isn’t just a “shift from research to products.” It’s the final form of a mindset that thinks the only way to shape the future is to own it.

You want safer AI? It’s not a technical problem. It’s a cultural exorcism.

Sometimes bugs are features.

Rust 1.0 (2015)

https://blog.rust-lang.org/2015/05/15/Rust-1.0/
1•dpezely•1m ago•0 comments

Fabian Schmidt speaks out since his detention

https://www.wgbh.org/news/local/2025-05-13/fabian-schmidt-speaks-out-for-the-first-time-since-his-detention
1•casenmgreen•3m ago•1 comments

Ash AI: A Comprehensive LLM Toolbox for Ash Framework

https://alembic.com.au/blog/ash-ai-comprehensive-llm-toolbox-for-ash-framework
1•mike1o1•3m ago•0 comments

Show HN: AI Agent Factory

https://www.agenthost.ai/factory
1•coldsoldier•3m ago•0 comments

Birkenstock hikes price of sandals globally to help offset tariffs

https://www.reuters.com/business/retail-consumer/birkenstock-raises-annual-forecasts-strong-demand-2025-05-15/
1•tobiasrenger•3m ago•0 comments

Collective memory loss in herring results in 800 km shift in spawning grounds

https://phys.org/news/2025-05-memory-loss-herring-results-km.html
1•gmays•4m ago•0 comments

Design AI to Fail

https://frontierai.substack.com/p/design-ai-to-fail
2•cgwu•6m ago•0 comments

Nvidia's original customers are feeling unloved and grumpy

https://www.economist.com/business/2025/05/15/nvidias-original-customers-are-feeling-unloved-and-grumpy
3•voxadam•7m ago•3 comments

What Is a Django App?

https://www.revsys.com/tidbits/what-is-a-django-app/
2•rbanffy•9m ago•0 comments

Popcorn: Run Elixir in WASM

https://popcorn.swmansion.com/
1•clessg•14m ago•0 comments

Microsoft 365 Business Premium and Office 365 E1 grant discontinuation

https://partner.microsoft.com/en-ca/asset/collection/microsoft-365-business-premium-and-office-365-e1-grant-discontinuation#/
1•MaintenanceMode•14m ago•1 comments

Crypto has become the ultimate swamp asset

https://www.economist.com/leaders/2025/05/15/crypto-has-become-the-ultimate-swamp-asset
6•toomuchtodo•15m ago•1 comments

An Empirical Study on the Performance and Energy Usage of Compiled Python Code

https://arxiv.org/abs/2505.02346
1•rbanffy•16m ago•0 comments

Compass – Tailwind CSS Course Template

https://tailwindcss.com/plus/templates/compass
1•charlieirish•16m ago•0 comments

Write the most clever code you possibly can

https://buttondown.com/hillelwayne/archive/write-the-most-clever-code-you-possibly-can/
1•rbanffy•16m ago•0 comments

Show HN: NewWord – AI powered personal vocabulary collector

https://www.newword.app/
1•Akring•17m ago•0 comments

In Our Time – The Evolution of Copyright

https://www.bbc.co.uk/programmes/m002c3bm
1•rwmj•17m ago•0 comments

Show HN: Generate code-first workflows using diagrams

https://workflows.diagrid.io
1•yaronsc•17m ago•0 comments

CoreWeave signs new $4B deal with OpenAI, filing shows

https://finance.yahoo.com/news/coreweave-signs-4-billion-deal-151429404.html
1•jaredwiener•18m ago•0 comments

From Comments on Accountability Sinks

https://250bpm.substack.com/p/from-comments-on-accountability-sinks
2•msustrik•18m ago•0 comments

NASA's Voyager 1 Revives Backup Thrusters Before Command Pause

https://www.jpl.nasa.gov/news/nasas-voyager-1-revives-backup-thrusters-before-command-pause/
4•voxadam•18m ago•0 comments

Flyer beware: Don't fall for this airline customer service scam

https://thepointsguy.com/news/airline-customer-service-scam/
1•rolph•19m ago•1 comments

Declaring a Friendship to Self

https://www.sandordargo.com/blog/2025/05/14/friend-self
1•jandeboevrie•20m ago•0 comments

The Joys of Discovering the Roman Underground

https://www.smithsonianmag.com/travel/the-joys-of-discovering-the-roman-underground-from-the-colosseum-to-whats-beneath-the-trevi-foundation-180986626/
2•ulrischa•20m ago•0 comments

Show HN: Designyff – Free Copy/Paste UI Components for Developers

https://designyff.com/
1•unjica•20m ago•1 comments

EU Commission in business with pesticides and glyphosate

2•pipiscrew•24m ago•0 comments

Properties and Best Uses of Visual Encodings (2012) [pdf]

http://complexdiagrams.com/wp-content/2012/01/VisualPropertiesTable.pdf
1•Tomte•25m ago•0 comments

Intelligence on Earth Evolved Independently at Least Twice

https://www.wired.com/story/intelligence-evolved-at-least-twice-in-vertebrate-animals/
3•elisson22•25m ago•1 comments

Voir Dire Training (2015) [pdf]

https://www.aclunc.org/sites/default/files/ORANGE_Training_Material_VoirDire_05.01.2015.pdf
1•Tomte•26m ago•0 comments

Unprecedented cuts to the National Science Foundation (NSF) endanger research

https://medicalxpress.com/news/2025-05-unprecedented-national-science-foundation-endanger.html
1•mdp2021•26m ago•0 comments