frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Natural Language Autoencoders: Turning Claude's Thoughts into Text

https://www.anthropic.com/research/natural-language-autoencoders
28•instagraham•1h ago

Comments

tjohnell•36m ago
It will inevitably learn how to think in a way that translates to one (moral) meaning and back but has an ulterior meaning underneath.
rotcev•10m ago
This is exactly what I first thought. “The user appears to be attempting to decode my previous thought process, …”, the question is whether or not the model will be able to internalize this in such a way that is undetectable to the aforementioned technique.
visarga•36m ago
Beautiful idea, an autoencoder must represent everything without hiding if is to recover the original data closely. So it trains a model to verbalize embeddings well. This reveals what we want to know about the model (such as when it thinks it is being tested, or other hidden thoughts).
firemelt•28m ago
finally a something interesting but this only makes me think that the last judgement is still in human hands to judge claude inner thoughts is correct or not

I mean who knows if those are really claude thoughts or claude just think that is his thoughts because humans wants it

Tossrock•13m ago
Anthropic Research going from strength to strength in interpretability. Publicly releasing the code so other labs can benefit from it is also a great move - very values aligned, and improves the overall AI safety ecosystem.
zozbot234•13m ago
Anthropic has released open weight models for translating the activations of existing models, viz. Qwen 2.5 (7B), Gemma 3 (12B, 27B) and Llama 3.3 (70B) into natural language text. https://github.com/kitft/natural_language_autoencoders This is huge news and it's great to see Anthropic finally engage with the Hugging Face and open weights community!
NitpickLawyer•10m ago
> We also release an interactive frontend for exploring NLAs on several open models through a collaboration with Neuronpedia.

Whatever they did on LLama didn't work, nothing makes sense in their example where they ask the model to lie about 1+1. Either the model is too old, or whatever they used isn't working, but whatever the autoencoder outputs is nothing like their examples with claude. Gemma is similarly bad.

A PHP license change is imminent

https://lwn.net/Articles/1063993/
1•maxloh•35s ago•0 comments

Show HN: DAG-based Kanji learning through components

https://mykanji.app/
1•barisozmen•1m ago•0 comments

Colored Shadow Penumbra

https://chosker.github.io/blog/colored-shadow-penumbra
2•ibobev•5m ago•0 comments

The PHP License Is Dead; Long Live the BSD 3-Clause

https://fossforce.com/2026/05/the-php-license-is-dead-long-live-the-bsd-3-clause/
2•birdculture•5m ago•0 comments

LLMs Distort Our Written Language

https://sites.google.com/view/llmwritingdistortion/home
2•gmays•6m ago•0 comments

Dirty Frag: Universal Linux LPE

https://github.com/V4bel/dirtyfrag
2•john_strinlai•6m ago•1 comments

The State of Grav: Where We Are and Where We're Going

https://getgrav.org/blog/state-of-grav-2026
2•speckx•8m ago•0 comments

Making cross-platform SIMD code pleasant

https://bkaradzic.github.io/posts/typeless-simd/
1•ibobev•8m ago•0 comments

State-backed hackers hammer Palo Alto firewall zero-day before patch lands

https://www.theregister.com/cyber-crime/2026/05/07/state-backed-hackers-hammer-palo-alto-firewall...
1•Bender•8m ago•0 comments

Writing a bindless GPU abstraction layer

https://www.kevin-gibson.com/blog/writing-a-bindless-gpu-abstraction-layer/
1•ibobev•8m ago•0 comments

60% of MD5 password hashes are crackable in under an hour

https://www.theregister.com/security/2026/05/07/60-of-md5-password-hashes-are-crackable-in-under-...
1•Bender•8m ago•0 comments

RIP social media. What comes next is messy

https://arstechnica.com/science/2026/05/rip-social-media-what-comes-next-is-messy/
3•Bender•9m ago•0 comments

Release PiClaw v2.3.0 – Tirion upon Túna · rcarmo/piclaw

https://github.com/rcarmo/piclaw/releases/tag/v2.3.0
1•rcarmo•16m ago•0 comments

CEOs want tariff refunds as earnings take a hit

https://www.cnbc.com/2026/05/06/tariff-refunds-earnings-hit-pandora-philips.html
1•tcp_handshaker•17m ago•0 comments

The AI fitness instructors selling unreal gains

https://www.bbc.com/sport/articles/c5ye7dnxv86o
1•reconnecting•17m ago•0 comments

Show HN: wfb-link, a userspace WiFiBroadcast radio stack for macOS

https://github.com/arc-edge/wfb-link/
3•mhamann•19m ago•2 comments

Show HN: Describe what makes a photo "bad" and let a local LLM flag them

https://github.com/iamnotagentleman/bad-photos-out
1•velieroglu•19m ago•0 comments

Using Clerk for Advent of Code (2023)

https://www.juxt.pro/blog/using-clerk-for-aoc/
1•tosh•20m ago•0 comments

DigitalOcean AI-Native Cloud for Production AI Workloads

https://www.digitalocean.com/blog/introducing-digitalocean-ai-native-cloud
1•ulrischa•22m ago•0 comments

AI at Discount

https://tomtunguz.com/ai-at-discount/
1•koolhead17•23m ago•0 comments

Show HN: I built a platform for experimenting with attention arbitrage

https://readyfaucet.com/
1•Odeh13•23m ago•0 comments

AI Slop Is Killing Online Communities

https://rmoff.net/2026/05/06/ai-slop-is-killing-online-communities/
17•thm•23m ago•0 comments

Having a religious affiliation doesn't prevent betting on sports

https://phys.org/news/2026-04-religious-affiliation-doesnt-sports.html
1•PaulHoule•24m ago•0 comments

The science of changing political beliefs

https://ozeanmedia.com/research/how-to-actually-change-political-beliefs-brain-shocks-and-friend-...
2•alexpatton•24m ago•0 comments

As U.S. Debt Hits a Worrying Milestone, Washington Barely Notices

https://www.nytimes.com/2026/05/07/business/us-debt-trump-policies-budget.html
3•tcp_handshaker•25m ago•0 comments

Show HN: A local-only image filter editor and batch processor in the browser

https://kaliedarik.github.io/sc-filter-builder/
1•rikroots•26m ago•0 comments

Ask HN: What is your go-to solution for a personal wiki in 2026?

3•ex-aws-dude•26m ago•4 comments

The Zen of Peter Frampton

https://www.nytimes.com/2026/05/04/arts/music/peter-frampton-carry-the-light.html
1•paulpauper•27m ago•0 comments

Incentives Drive Everything

https://yusufaytas.com/incentives-drive-everything
7•goonergoose•27m ago•0 comments

U.S. intelligence says Iran can outlast Trump's Hormuz blockade for months

https://www.washingtonpost.com/national-security/2026/05/07/cia-intelligence-iran-trump-blockade-...
10•tcp_handshaker•28m ago•1 comments