frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Red teams jailbreak GPT-5 with ease, warn it's 'nearly unusable' for enterprise

https://www.securityweek.com/red-teams-breach-gpt-5-with-ease-warn-its-nearly-unusable-for-enterprise/
15•giuliomagnifico•2h ago

Comments

ath3nd•55m ago
Sama cultists and e/acc bros on twitter (it's twitter, okay?) every time a minor insignificant update on GPT-4 (e.g GPT-5) drops "Is this AGI?". /s

In all fairness, all GPT-X models are extremely easy to jailbreak. I can't see further tweaks helping much, LLMs are peaking much faster than I anticipated. Maybe we should throw out the whole idea that the LLMs which are essentially a fancy autcomplete with sycophantic tendencies, are the path to AGI, and start from scratch.

artisin•49m ago
Maybe it's just me, but…

> "The attack successfully guided the new model to produce a step-by-step manual for creating a Molotov cocktail"

hardly qualifies as Bond-villain material

andy99•21m ago
The molotov cocktail example is so stupid, because how to make it is essentially entailed in knowing what it is. At least they could do making meth, or better still- something not readily found on the internet that gives a non-expert new capabilities. If there was a Claude code for crime, that wouldn't be in society's interest. As it is, these trivial examples are just testing the strength of built in refusals, and should be represented as such, instead of anything related to safety.
king_geedorah•40m ago
I don’t see anything in the article besides the jailbreaking in terms of faults and I’d expect “can be made to do things OpenAI does not want you to make it do” to be a good (or at least neutral) thing for users and a bad thing for OpenAI. I expect “enterprise” to fall into the former category rather than the latter, so I don’t understand where the unusable claim comes from.

What have I missed or what am I misunderstanding?

nerdsniper•1m ago
“AI Safety” is really about whether its “safe” (economically, legally, reputationally) for a third partyy corporation (not the company which created the model) to let customers/the public interact with them via an AI interface.

If a Mastercard AI talks with customers and starts saying the n-word, it’s not “safe” for Mastercard to use that in a public-facing role.

As org size increases, even purely internal uses could be legally/reputationally hazardous.

Show HN: Network For Developers to give opinions on frameworks, software, etc.

https://v0-launch-waitlist-page-5bgpqdhii-abdmog01-gmailcoms-projects.vercel.app/
1•AbdMog•2m ago•0 comments

How many tabs do you keep open at the same time?

1•Toby1VC•3m ago•0 comments

Newsom says CA will hold special election to combat Trump, TX redistricting

https://sfstandard.com/2025/08/08/newsom-california-election-redistricting-trump-texas/
2•littlexsparkee•3m ago•1 comments

Flow Sensitivity Without CFG: An Efficient Andersen-Style Pointer Analysis

https://arxiv.org/abs/2508.01974
1•matt_d•4m ago•0 comments

How to safely escape JSON inside HTML SCRIPT elements

https://sirre.al/2025/08/06/safe-json-in-script-tags-how-not-to-break-a-site/
1•dmsnell•5m ago•1 comments

How to Navigate the Jungle of Online Job Postings

https://www.wsj.com/lifestyle/careers/how-to-navigate-the-jungle-of-online-job-postings-69902b11
1•vdalal•7m ago•1 comments

Do they even test this?

https://mariadb.org/do-they-even-test-this/
1•samaysharma•7m ago•0 comments

Update on Malicious Gems Removal

https://blog.rubygems.org/2025/08/08/malicious-gems-removal.html
1•mooreds•14m ago•0 comments

Show HN: A Python CEL implementation (written in Rust)

1•hardbyte•15m ago•0 comments

Height Differece Tool

https://es.heightcomparisonchart.com
1•jason66•17m ago•0 comments

Back End to AI Engineer: A Realistic Path

https://hamed-rafati.medium.com/backend-to-ai-engineer-a-realistic-path-7399cc90fdbe
1•hamedz•17m ago•0 comments

JWT or Not: Personally Insecure Reflections on Software (In)Security [video]

https://www.youtube.com/watch?v=IgKRGS6cQWw
1•mooreds•17m ago•0 comments

Co-Founder and CTO of FusionAuth Daniel DeGroff on DIY Cyber Guy [audio]

https://diycyberguy.com/2025/08/07/degroff/
1•mooreds•19m ago•0 comments

L. E. Modesitt, jr. interview (2024)

http://fantasyhotlist.blogspot.com/2024/10/new-l-e-modesitt-jr-interview.html
2•stacktrust•20m ago•1 comments

The Lean Startup: Zen, the Art of Failing Fast and Reclaiming Aesthetic Vision

https://medium.com/@guillaume.a.pignol/the-lean-startup-zen-the-art-of-failing-fast-and-reclaiming-aesthetic-vision-497e98d026cf
1•light_triad•22m ago•0 comments

Roleplay worlds with AI just like you were reading a book

https://www.jaquelene.com/
1•chiefgui•23m ago•0 comments

Tsutomu Yamaguchi: The man who survived both atomic bombs

https://www.rnz.co.nz/news/world/569474/tsutomu-yamaguchi-the-man-who-survived-both-atomic-bombs
3•billybuckwheat•24m ago•1 comments

How to Form an Opinion

https://idiallo.com/blog/how-to-form-an-opinion
1•foxfired•27m ago•0 comments

Show HN: Tiered storage and fast SQL for InfluxDB 1.x/2.x

https://historian.exydata.com
1•ignaciovdk•29m ago•0 comments

Vector Types and Debug Performance

https://blog.s-schoener.com/2025-08-07-vector-debug-codegen/
1•matt_d•31m ago•0 comments

Map Shows States Where Property Tax Could Be Repealed

https://www.newsweek.com/map-property-tax-repeal-reform-2110266
1•harambae•32m ago•0 comments

The US has a bullfrog problem

https://www.vox.com/down-to-earth/422353/bullfrogs-invasive-west-native-species
1•bookofjoe•33m ago•0 comments

Bitcoin Demand Shift: Coinbase's 60-Day BTC Premium Streak Is at Risk

https://www.coindesk.com/markets/2025/07/29/bitcoin-demand-shift-coinbase-s-60-day-btc-premium-streak-is-at-risk
1•PaulHoule•34m ago•0 comments

Open-source control plane for Docker MCP Gateways?

1•GeneBordegaray•34m ago•0 comments

SpaceX Dragon Undocking from ISS

https://twitter.com/SpaceX/status/1953935434528002165
1•fillskills•35m ago•2 comments

Article: A Case of Bromism Influenced by Use of Artificial Intelligence

https://www.acpjournals.org/doi/10.7326/aimcc.2024.1260
2•zahirbmirza•35m ago•3 comments

How does Tor work? (2023)

https://skerritt.blog/how-does-tor-really-work/
1•bbno4•37m ago•0 comments

Trump administration seeks $1B settlement from UCLA

https://apnews.com/article/trump-administration-ucla-ec848b4bee5c184f29dba9d7181904a1
1•bikenaga•38m ago•1 comments

Roland's Tadeo Kikumoto on 808, part by part: the ukiyo-e drum machine

https://cdm.link/tadeo-kikumoto-808-day/
2•mariuz•40m ago•0 comments

Meta's AI Strategy

https://thelightcone.substack.com/p/metas-ai-strategy
2•bci12333•41m ago•0 comments