frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: 83K lines of C++ – cryptocurrency written from scratch, not a fork

https://github.com/Kristian5013/flow-protocol
1•kristianXXI•3m ago•0 comments

Show HN: SAA – A minimal shell-as-chat agent using only Bash

https://github.com/moravy-mochi/saa
1•mrvmochi•3m ago•0 comments

Mario Tchou

https://en.wikipedia.org/wiki/Mario_Tchou
1•simonebrunozzi•4m ago•0 comments

Does Anyone Even Know What's Happening in Zim?

https://mayberay.bearblog.dev/does-anyone-even-know-whats-happening-in-zim-right-now/
1•mugamuga•5m ago•0 comments

The last Morse code maritime radio station in North America [video]

https://www.youtube.com/watch?v=GzN-D0yIkGQ
1•austinallegro•7m ago•0 comments

Show HN: Hacker Newspaper – Yet another HN front end optimized for mobile

https://hackernews.paperd.ink/
1•robertlangdon•8m ago•0 comments

OpenClaw Is Changing My Life

https://reorx.com/blog/openclaw-is-changing-my-life/
1•novoreorx•16m ago•0 comments

Everything you need to know about lasers in one photo

https://commons.wikimedia.org/wiki/File:Commercial_laser_lines.svg
1•mahirsaid•18m ago•0 comments

SCOTUS to decide if 1988 video tape privacy law applies to internet uses

https://www.jurist.org/news/2026/01/us-supreme-court-to-decide-if-1988-video-tape-privacy-law-app...
1•voxadam•19m ago•0 comments

Epstein files reveal deeper ties to scientists than previously known

https://www.nature.com/articles/d41586-026-00388-0
1•XzetaU8•26m ago•0 comments

Red teamers arrested conducting a penetration test

https://www.infosecinstitute.com/podcast/red-teamers-arrested-conducting-a-penetration-test/
1•begueradj•34m ago•0 comments

Show HN: Open-source AI powered Kubernetes IDE

https://github.com/agentkube/agentkube
1•saiyampathak•37m ago•0 comments

Show HN: Lucid – Use LLM hallucination to generate verified software specs

https://github.com/gtsbahamas/hallucination-reversing-system
1•tywells•40m ago•0 comments

AI Doesn't Write Every Framework Equally Well

https://x.com/SevenviewSteve/article/2019601506429730976
1•Osiris30•43m ago•0 comments

Aisbf – an intelligent routing proxy for OpenAI compatible clients

https://pypi.org/project/aisbf/
1•nextime•43m ago•1 comments

Let's handle 1M requests per second

https://www.youtube.com/watch?v=W4EwfEU8CGA
1•4pkjai•44m ago•0 comments

OpenClaw Partners with VirusTotal for Skill Security

https://openclaw.ai/blog/virustotal-partnership
1•zhizhenchi•45m ago•0 comments

Goal: Ship 1M Lines of Code Daily

2•feastingonslop•55m ago•0 comments

Show HN: Codex-mem, 90% fewer tokens for Codex

https://github.com/StartripAI/codex-mem
1•alfredray•58m ago•0 comments

FastLangML: FastLangML:Context‑aware lang detector for short conversational text

https://github.com/pnrajan/fastlangml
1•sachuin23•1h ago•1 comments

LineageOS 23.2

https://lineageos.org/Changelog-31/
2•pentagrama•1h ago•0 comments

Crypto Deposit Frauds

2•wwdesouza•1h ago•0 comments

Substack makes money from hosting Nazi newsletters

https://www.theguardian.com/media/2026/feb/07/revealed-how-substack-makes-money-from-hosting-nazi...
4•lostlogin•1h ago•0 comments

Framing an LLM as a safety researcher changes its language, not its judgement

https://lab.fukami.eu/LLMAAJ
1•dogacel•1h ago•0 comments

Are there anyone interested about a creator economy startup

1•Nejana•1h ago•0 comments

Show HN: Skill Lab – CLI tool for testing and quality scoring agent skills

https://github.com/8ddieHu0314/Skill-Lab
1•qu4rk5314•1h ago•0 comments

2003: What is Google's Ultimate Goal? [video]

https://www.youtube.com/watch?v=xqdi1xjtys4
1•1659447091•1h ago•0 comments

Roger Ebert Reviews "The Shawshank Redemption"

https://www.rogerebert.com/reviews/great-movie-the-shawshank-redemption-1994
1•monero-xmr•1h ago•0 comments

Busy Months in KDE Linux

https://pointieststick.com/2026/02/06/busy-months-in-kde-linux/
1•todsacerdoti•1h ago•0 comments

Zram as Swap

https://wiki.archlinux.org/title/Zram#Usage_as_swap
1•seansh•1h ago•1 comments
Open in hackernews

Solving the Issue of Interpretability of AI

4•mikeai686•6mo ago
# Making AI Thoughts Understandable Through Separate Translator Models

I want to propose a new approach to the problem of AI opacity.

## The Core Problem

Modern AI systems work as "black boxes" - we can't see how they think. Recently, leading researchers warned that we might soon lose even the small transparency we currently have. Here's the difficulty: if we force AI to "think aloud" in human language, it reduces efficiency, but if we allow it to use efficient mathematical representations, we don't understand what's happening.

## Proposed Solution: A Modular System with Translators

I propose dividing the system into four parts:

*1. Free Internal Thinking* Let AI use any mathematical representations that are most efficient for solving tasks. We don't limit its thinking methods.

*2. Multiple Specialized Translator Models* We use several separate models trained to translate AI's internal representations into human-understandable language. Each translator can: - explain the logical structure of reasoning - highlight the main concepts the model is working with - explain how confident the model is in its conclusions Each function is performed by several different translators so results can be cross-checked.

*3. Contradiction Resolution Mechanisms* When translators give different explanations, we: - Highlight areas where they agree (high reliability) - Emphasize discrepancies (likely complex or ambiguous reasoning) - Explain why different interpretations arose If translator results don't contradict each other, we combine non-contradictory aspects into a unified explanation.

*4. Ethics Verification* We use "constitutional AI" (a special rule system, like in Claude.ai) to check: - Compliance with ethical standards - Logical consistency - Alignment with human values

## Main Advantages

- *No delays*: The model can think and produce results without delays (especially important in verbal dialogue), while explanations can be generated in parallel for quality control and, if necessary, future corrections. - *Moderation*: For critically important decisions requiring human moderation, we can wait for the translation and for the human moderator's decision - *Different perspectives*: Different translators show different aspects of thinking - *Transparency of complexities*: When translators disagree, we know the reasoning is complex - *Ethical safety*: An additional verification layer ensures alignment with values

## Open Questions

1. How do we train translators without "correct answers" from humans? 2. How many translators is optimal to use? 3. What to do if all translators cannot clearly explain the reasoning? 4. How to prove that translators accurately reflect internal thinking?

## Next Steps

I would like to: - Create a simple example of such a system working - Develop methods to verify translation accuracy - Combine this approach with existing tools

I would appreciate community feedback, especially regarding potential problems and practical challenges.

Comments

ijk•6mo ago
It sounds like you're proposing doing this operation on the tokens in the reasoning. While it would be interesting to know if allowing it to choose arbitrary tokens, the biggest issue is that there's quite a bit of evidence that the tokens it prints have only a loose relationship with the internal model processes.

I question your premise; first demonstrate that having it think aloud in "efficient mathematical representations" is a useful efficiency. Then you can demonstrate that you can do any interpretatability work on the output.