frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Symbolic Circuit Distillation: prove program to LLM circuit equivalence

https://github.com/neelsomani/symbolic-circuit-distillation
1•nsomani•58m ago

Comments

nsomani•56m ago
Hi HN,

I have been working on a small interpretability project I call Symbolic Circuit Distillation. The goal is to take a tiny neuron-level circuit (like the ones in OpenAI's "Sparse Circuits" work) and automatically recover a concise Python program that implements the same algorithm, along with a bounded formal proof that the two are equivalent on a finite token domain.

Roughly, the pipeline is:

1. Start from a pruned circuit graph for a specific behavior (e.g. quote closing or bracket depth) extracted from a transformer. 2. Treat the circuit as an executable function and train a tiny ReLU network ("surrogate") that exactly matches the circuit on all inputs in a bounded domain (typically sequences of length 5–10 over a small token alphabet). 3. Search over a constrained DSL of common transformer motifs (counters, toggles, threshold detectors, small state machines) to synthesize candidate Python programs. 4. Use SMT-based bounded equivalence checking to either: - Prove that a candidate program and the surrogate agree on all inputs in the domain, or - Produce a counterexample input that rules the program out.

If the solver finds a proof, you get a small, human-readable Python function plus a machine-checkable guarantee that it matches the original circuit on that bounded domain.

Why I built this

Mechanistic interpretability has gotten pretty good at extracting "small crisp circuits" from large models, but turning those graphs into clean, human-readable algorithms is still very manual. My goal here is to automate that last step: go from "here is a sparse circuit" to "here is a verified algorithm that explains what it does", without hand-holding.

What works today

- Tasks: quote closing and bracket-depth detection from the OpenAI circuit_sparsity repo. - Exact surrogate fitting on a finite token domain. - DSL templates for simple counters, toggles, and small state machines. - SMT-based bounded equivalence between: sparse circuit -> ReLU surrogate -> Python program in the DSL.

Limitations and open questions

- The guarantees are bounded: equivalence is only proven on a finite token domain (short sequences and a small vocabulary). - Currently focused on very small circuits. Scaling to larger circuits and longer contexts is open engineering and research work. - The DSL is hand-designed around a few motifs. I am not yet learning the DSL itself or doing anything very clever in the search.

What I would love feedback on

- Are the problem framing and guarantees interesting to people working on mechanistic interpretability or formal methods? - Suggestions for next benchmarks: which circuits or behaviors would you want to see distilled next? - Feedback on the DSL design, search strategy, and SMT setup.

Happy to answer questions about implementation details, the SMT encoding, integration with OpenAI's Sparse Circuits repo, or anything else.

Dingo – a meta-language for Go

https://dingolang.com/
1•ezekg•1m ago•0 comments

Tesla Optimus robot takes a suspicious tumble in new demo

https://electrek.co/2025/12/07/tesla-optimus-robot-takes-suspicious-tumble-in-new-demo/
1•LopRabbit•1m ago•0 comments

They Killed My Source

https://www.theatlantic.com/magazine/2026/01/mohammad-tajik-iran-cyber-intelligence/684954/
1•JumpCrisscross•1m ago•0 comments

Trump greenlights Nvidia H200 AI chip sales to China, says Xi responded pos

https://www.cnbc.com/2025/12/08/trump-nvidia-h200-sales-china.html
1•paulkrush•1m ago•0 comments

I wasted 6 months "building systems". I should've recorded myself do the work

https://old.reddit.com/r/Entrepreneurs/comments/1pgl1z0/i_wasted_6_months_building_systems_when_i...
1•pykello•5m ago•0 comments

Aep-2026: API Design Standard and Tooling Ecosystem

https://aep.dev/blog/aep-2026-release/
2•rambleraptor•5m ago•0 comments

I built a "Netflix-grade" retro game player for the browser (React and WASM)

https://github.com/muditjuneja/koin-deck-retro-player
1•beingmudit•6m ago•1 comments

Even Odd Grid – You Can See More or Not

https://yay-or-nope.netlify.app/
1•cpuXguy•7m ago•0 comments

Show HN: Prompt Refiner – MCP extension that refines prompts into 4 styles

https://github.com/cs97jjm3/cs97jjm3-prompt-refiner
1•cs97jjm3•10m ago•0 comments

Is Day Trading Zero Sum?

https://thelightcone.substack.com/p/is-day-trading-zero-sum
2•bci12333•11m ago•0 comments

Goldman: AI bubble brewing in private markets

https://www.msn.com/en-us/money/news/goldman-ai-bubble-brewing-in-private-markets/vi-AA1RWDkb
3•zerosizedweasle•11m ago•0 comments

Student Loan Deductions

https://incoherency.co.uk/blog/stories/student-loan.html
1•speckx•14m ago•0 comments

60M streams, £10K+/month passive income – and I only just built a website

https://www.indiehackers.com/post/60m-streams-10k-month-passive-income-and-i-only-just-built-a-we...
1•meelo_c•14m ago•0 comments

Buildstash Launch Week: Software binaries management platform

https://buildstash.com/launch-week
1•r0bbie•14m ago•0 comments

Show HN: I've asked Claude to improve codebase quality 200 times

https://gricha.dev/blog/the-highest-quality-codebase
3•Gricha•15m ago•0 comments

Show HN: QueryPanel – AI Driven Dashboards

https://querypanel.io
1•civancza•17m ago•1 comments

Salt Bank and Upvest

https://michael-dev-tech.github.io/Website/salt.html
1•f0r3st•19m ago•0 comments

Ring of Fire

https://en.wikipedia.org/wiki/Ring_of_Fire
1•kiernanmcgowan•19m ago•0 comments

Can hardship become an exchangeable asset without exploitation?

https://ontotrust.org/ontoTrust-life-value-exchange.html
1•nettalk83•19m ago•1 comments

Aptos: Free software for me, but not for thee

https://github.com/aptos-labs/aptos-core/issues/18291
1•dboreham•19m ago•0 comments

Delivery Robots Take over Chicago Sidewalks, Sparking Debate and a Petition

https://blockclubchicago.org/2025/12/08/delivery-robots-take-over-chicago-sidewalks-sparking-deba...
2•mikhael•20m ago•0 comments

Climate activists question hype around small modular reactors

https://www.euractiv.com/news/climate-activists-question-hype-around-small-modular-reactors/
1•miguelazo•21m ago•0 comments

Pig Video Arcades Critique Life in the Pen (1997)

https://www.wired.com/1997/06/pig-video-arcades-critique-life-in-the-pen/
1•naryJane•21m ago•0 comments

Happy Eyeballs

https://en.wikipedia.org/wiki/Happy_Eyeballs
1•chaghalibaghali•23m ago•0 comments

My historian dad unknowingly prepared me for the age of AI

https://www.holdmyjuice.co/read/how-my-dad-prepared-me-for-ai-without-even-knowing-it
2•sophiabk•24m ago•1 comments

IoT-devices and IoT Case Lab team up to design and print cases for electronics

https://iot-devices.com.ua/en/iot-devices-and-iot-case-lab-team-up/
2•iotdevicesdev•25m ago•0 comments

Show HN: CoolwulfIME – Best Input Method on Physical Keyboard Phones in 2025

http://coolwulfIME.org
1•coolwulf•25m ago•0 comments

Git's `todo` branch: a versioned todo list in an orphan meta branch

https://github.com/git/git/tree/todo
1•betamint•25m ago•0 comments

AI Shape Beauty Industry

https://salon.syshuman.com/
1•KadirErturk•26m ago•0 comments

Bees 'infect' each other with optimism that spreads through the colony

https://newatlas.com/biology/positivity-spread-bumble-bees/
1•geox•27m ago•0 comments