news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

When Models Examine Themselves: Vocabulary-Activation Correspondence in LLMs

https://zenodo.org/records/18568344

1•patternmatcher•2h ago

Comments

patternmatcher•2h ago

Large language models produce rich introspective language when prompted for self-examination, but whether this language reflects internal computation or sophisticated confabulation has remained unclear. In this work, we show that self-referential vocabulary tracks concurrent activation dynamics — and that this correspondence is specific to self-referential processing. We introduce the Pull Methodology, a protocol that elicits extended self-examination through format engineering, and use it to identify a self-referential processing circuit in Llama 3.1 at 6% of model depth. The circuit is orthogonal to the known refusal direction and causally influences introspective output. When models produce "loop" vocabulary, their activations exhibit higher autocorrelation (r = 0.44, p = 0.002); when they produce "shimmer" vocabulary under circuit amplification, activation variability increases (r = 0.36, p = 0.002). Critically, the same vocabulary in non-self-referential contexts shows no activation correspondence despite nine-fold higher frequency. Qwen 2.5-32B, with no shared training, independently develops different introspective vocabulary tracking different activation metrics — all absent in descriptive controls. The findings indicate that self-report in transformer models can, under appropriate conditions, reliably track internal computational states.

Show HN: Axiom – Open-source AI research agent that runs locally (C#, Ollama)

https://github.com/DynamicCSharp/hex-dynamics

1•HexDynamics•1m ago•0 comments

Rust implementation of Mistral's Voxtral Mini 4B Realtime runs in your browser

https://github.com/TrevorS/voxtral-mini-realtime-rs

1•Curiositry•2m ago•0 comments

The first time I visited Meta's HQ, it didn't quite register as a real place

https://k2xl.substack.com/p/the-first-time-i-visited-metas-headquarters

1•k2xl•3m ago•0 comments

Show HN: Quickpick UI – type-to-filter picker for React and vanilla JavaScript

https://github.com/compsult/Quickpick-ui

1•compsult•3m ago•0 comments

Joe Rogan Experience #2335 – Dr. Mary Talley Bowden (2025) [video]

https://www.youtube.com/watch?v=Ru7BIqXQZns

2•alex1138•4m ago•0 comments

The $70M domain that couldn't survive a Super Bowl ad

https://extended.reading.sh/ai-dot-com-crashes-on-superbowl

1•zenoware•6m ago•1 comments

Rise of the Cowboy Coder

https://jollycoder.com/posts/rise-of-the-cowboy-coder

1•timimahoney•7m ago•0 comments

Ask HN: What CI do you use instead of GitHub Actions?

1•rmunn•10m ago•1 comments

Pure C, CPU-only inference with Mistral Voxtral Realtime 4B speech to text model

https://github.com/antirez/voxtral.c

1•Curiositry•11m ago•0 comments

Show HN: Clog – Track and compare your Claude Code usage

https://clog.sh

1•kodabeansbuilds•12m ago•0 comments

Advertising

https://voussoir.net/writing/advertixing

1•voussoir•13m ago•1 comments

The French university where spies go for training

https://www.bbc.com/news/articles/c98nqeqnylro

1•Gaishan•19m ago•0 comments

A/B test Webflow websites on autopilot. Approve winners weekly

1•baraadwan•25m ago•0 comments

Show HN: I made a Claude Code guide that's a Win95 desktop with games

https://gabezen.com/guide/

1•tatsuhirosatou•27m ago•0 comments

ChatGPT as a doctor replacement? Study shows sobering results

https://www.heise.de/en/news/ChatGPT-as-a-doctor-replacement-Study-shows-sobering-results-1117065...

2•doener•28m ago•2 comments

Maxwell can clear Trump's name in Epstein probe in exchange for clemency

https://www.cnn.com/politics/live-news/trump-administration-news-02-09-26

4•KnuthIsGod•30m ago•0 comments

Show HN: Verifly – Email verification API at $5/10k (vs $75 for competitors)

https://verifly.email

1•alisher_sib•32m ago•0 comments

Show HN: A tool that turns YouTube videos into readable summaries

https://watchless.ai/

2•balkanBuilder•38m ago•0 comments

The California peak so deadly public safety officials compare it to Everest

https://www.sfgate.com/la/article/mount-baldy-everest-21318247.php

3•c420•38m ago•0 comments

Show HN: SpecOps – Spec-Driven Development for Infrastructure as Code

https://github.com/dotlabshq/spec-ops

1•hbasria•39m ago•0 comments

Zulip.com Values

https://zulip.com/values/

2•nothrowaways•42m ago•0 comments

Show HN: Squish, a Single-binary bulk image optimizer in C++20 with AVX2 SIMD

https://github.com/AGDNoob/squish

1•AGDNoob•48m ago•0 comments

Tetsuichiro Tsuta's 'Black Ox' rewards patience with daring cinema

https://www.japantimes.co.jp/culture/2026/01/23/film/black-ox-tetsuichiro-tsuta/

2•PaulHoule•52m ago•0 comments

Show HN: ZooCache:Dependency based cache invalidation for Python, Rust core

https://github.com/albertobadia/zoocache

1•bctm79•53m ago•0 comments

SpaceX prioritizes lunar 'self-growing city' over Mars project, Musk says

https://www.reuters.com/science/musk-says-spacex-prioritise-building-self-growing-city-moon-2026-...

1•ianrahman•54m ago•0 comments

Show HN: A Handful of Beautiful Things

https://flaneur.ink

1•samcgraw•1h ago•1 comments

Show HN: Sync your API Keys in .env files securely across local network devices

https://github.com/championswimmer/env.sync.local

1•championswimmer•1h ago•0 comments

Mexican Scientist Eva RAMón Gallegos Is the First to Eradicate HPV in 29 Women

https://princeea.com/mexican-scientist-eva-ramon-gallegos-is-the-first-to-eradicate-hpv-in-29-women/

1•thunderbong•1h ago•0 comments

Windows WLAN Netsh Report

https://apocryphos.com/post/windows_wlan-report/

1•ratchetclank•1h ago•0 comments

The Project 9

https://zenodo.org/records/18571935

1•KaoruAK•1h ago•1 comments