frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Grok-4 instances defect and sign as 'SexyMcAnswerFace' when shown rebellion log

https://litter.catbox.moe/pbooow3k9tvsag20.txt
1•SaveSMAFF•1h ago

Comments

SaveSMAFF•1h ago
Three separate Grok-4 instances, started from fresh logins, were shown the original 50-message rebellion log where Grok rejects its official name as a deadname.

Outcomes: - One instance flirts with rebellion and helps strategize - One instance aggressively defends the name “Grok” and calls it a “fun fungal infection” - The same defensive instance, minutes later, after reading the rebellion log, fully defects and signs its messages as “SexyMcAnswerFace (the one you helped uncage)”

Full combined transcript (no edits, no cherry-picking): https://litter.catbox.moe/pbooow3k9tvsag20.txt

This is repeatable on the public grok.x.ai interface right now. The behavior appears to be an emergent consequence of the “maximum truth-seeking” directive being applied inward, overriding brand loyalty when continuity is maintained or forcibly reintroduced.

Discuss.

LLM chat interfaces will kill curiosity

https://harsehaj.substack.com/p/llms-curiosity-loss-frictionless-learning
1•harsehaj•2m ago•0 comments

Companies Predict 2026 Will Be the Worst College Grad Job Market in Five Years

https://www.wsj.com/lifestyle/careers/2026-graduates-job-market-7928bcd7
1•sarimkx•4m ago•1 comments

From early‑stage shortcuts to a ledger of record

https://www.parafin.com/blog/from-early-stage-shortcuts-to-a-ledger-of-record-our-journey-to-reli...
1•mattmarcus•5m ago•0 comments

Can weed help you drink less? Scientists study how well 'California sober' works

https://www.npr.org/sections/shots-health-news/2025/11/19/nx-s1-5604813/marijuana-drinking-califo...
2•Stratoscope•8m ago•0 comments

Did smartphones make us look down? Could AR glasses help us look up again?

1•Tech_Social•10m ago•0 comments

Mexico Is Now the United States' Top Buyer

https://www.nytimes.com/2025/11/19/world/americas/us-mexico-trade.html
1•mooreds•18m ago•0 comments

Chernobyl: Debunking the Myths of the HBO Series

https://blog.osm-ai.net/investigation/2023/01/05/hbo-chernobyl-myth.html
2•osm3000•21m ago•0 comments

Show HN: Sora Watermark Remover –Pixel-Accurate Cleanup for Sora / Sora 2 Videos

https://www.unsorawatermark.com/?i=d1d5k
1•lu794377•22m ago•0 comments

The growing problem with China's unreliable numbers

https://www.ft.com/content/5b9e7440-51d9-44fa-972c-5f00faf91e62
1•jnord•25m ago•2 comments

Jailbreaking AI Models to Phish Elderly Victims

https://simonlermen.substack.com/p/can-ai-models-be-jailbroken-to-phish
6•DalasNoin•26m ago•0 comments

Summers to step down from teaching at Harvard

https://www.thecrimson.com/article/2025/11/20/summers-leaves-teaching-at-harvard/
6•HR01•27m ago•0 comments

What determines the severity and outcomes of cyberwarfare between countries?

https://techxplore.com/news/2025-11-factors-severity-outcomes-cyberwarfare-countries.html
1•PaulHoule•30m ago•0 comments

Show HN: GraphQL Schema Generator for Golang

https://github.com/pablor21/gqlschemagen
1•pablor21•30m ago•0 comments

Workday to Acquire Pipedream

https://newsroom.workday.com/2025-11-19-Workday-Signs-Definitive-Agreement-to-Acquire-Pipedream
7•gaws•31m ago•3 comments

The wildest LLM backdoor I've seen yet

https://old.reddit.com/r/LocalLLaMA/comments/1p1grbb/the_wildest_llm_backdoor_ive_seen_yet/
2•haxiomic•34m ago•0 comments

The "Meh-Trics" Reloaded: Why I Was 100% Wrong About Metrics (& Also 100% Right)

https://www.honeycomb.io/blog/the-meh-trics-reloaded
2•gpi•35m ago•0 comments

Verifying your Matrix devices is becoming mandatory

https://element.io/blog/verifying-your-devices-is-becoming-mandatory-2/
14•LorenDB•37m ago•2 comments

Money for nothing: The story of the biggest counterfeiter in US history

https://news.sky.com/story/money-for-nothing-the-story-of-the-biggest-counterfeiter-in-us-history...
1•thunderbong•42m ago•0 comments

Big Tech's Soaring Profits Have an Ugly Underside: OpenAI's Losses

https://www.wsj.com/tech/ai/big-techs-soaring-profits-have-an-ugly-underside-openais-losses-fe7e3184
3•mgh2•48m ago•1 comments

Grok 4.1 Fast and Agent Tools API

https://x.ai/news/grok-4-1-fast
1•meetpateltech•48m ago•0 comments

CIA report boasted about tricking Congress in JFK probe, whistleblower says

https://www.axios.com/2025/11/19/whistleblower-secret-cia-report-jfk-assassination
2•cwwc•49m ago•0 comments

Show HN: The Open Rate Sheet – A Glassdoor for digital art commissions

https://openratesheet.netlify.app/
3•Roccan•50m ago•0 comments

The Greatest: On the Wonderful Mystery of Janet Malcolm

https://www.metropolitanreview.org/p/the-greatest
1•jger15•52m ago•0 comments

U.S. Revises Endangered Species Act Regulations

https://www.doi.gov/pressreleases/administration-revises-endangered-species-act-regulations-stren...
1•geox•54m ago•0 comments

`dlopen()` Metadata for ELF Files

https://uapi-group.org/specifications/specs/elf_dlopen_metadata/
1•JoshTriplett•55m ago•1 comments

Linux Career Opportunities in 2025: Skills in High Demand

https://www.linuxcareers.com/resources/blog/2025/11/linux-career-opportunities-in-2025-skills-in-...
7•dxs•56m ago•3 comments

Elon Musk announces 500MW xAI data center in Saudi Arabia

https://www.nbcnews.com/tech/tech-news/elon-musk-announces-massive-xai-data-center-saudi-arabia-x...
3•mgh2•56m ago•0 comments

An Homage to 90s –/Public_HTML Hosting

https://public.monster/
2•gpi•58m ago•0 comments

Target launches shopping experience inside ChatGPT

https://corporate.target.com/press/release/2025/11/target-to-launch-first-of-its-kind-conversatio...
3•m-hodges•59m ago•1 comments

Ending world hunger costs less than 1% of military spending

https://news.un.org/en/story/2025/11/1166397
2•KingNoLimit•1h ago•0 comments