Show HN: TXT OS – Open-Source AI Reasoning, One Plain-Text File at a Time

https://github.com/onestardao/WFGY/tree/main/OS

9•TXTOS•7h ago

Hi HN,

I'm excited to share TXT OS — an open-source AI reasoning engine that runs entirely inside a single `.txt` file.

- No installs, no signup, no hidden code — just copy-paste the file into any LLM chat window (GPT, Claude, Gemini, etc.). - +22.4% semantic accuracy, +42.1% reasoning success, and 3.6× more stability (benchmarked on GSM8K and Truthful-QA). - Features Semantic Tree Memory, Hallucination Shield, and fully exportable logic. - MIT Licensed, zero tracking, zero ads.

Why did I build this? I wanted to prove that advanced reasoning and memory could be made open, portable, and accessible to anyone — just with pure text, no software or setup.

A note: I'm from China, and English is not my first language. This post and the docs were partly assisted by AI, but I personally reviewed and approved every line of content. All ideas, design, and code are my own work. If anything is unclear or could be improved, I really welcome your feedback!

I'm the author, and happy to answer any questions or suggestions here!

Comments

ultimateking•7h ago

Really cool project! Quick questions:

1. How does TXT OS store its “Semantic Tree Memory” between sessions? 2. When `kbtest` detects a hallucination, what happens next? 3. Any idea of the speed impact on smaller models like LLaMA-2-13B?

Thanks for sharing—excited to try it out!

TXTOS•7h ago

Semantic Tree Memory

We actually serialize the tree as a compact JSON-like structure right in the TXT file—each node gets a header like #NODE:id and indented subtrees. When you reload, TXT OS parses those markers back into your LLM’s memory map. No external DB needed—just plain text you can copy-paste between sessions.

--- When kbtest Fires

Internally it tracks our ΔS metric (semantic tension). Once ΔS crosses a preset threshold, kbtest prints a warning and automatically rolls you back to the last “safe” tree checkpoint. That means you lose only the bad branch, not your entire session. Think of it like an undo button for hallucinations.

--- Performance on LLaMA-2-13B

Benchmarks were on GPT-4, but on a 13B model you’ll see roughly a 10–15% token-generation slow-down thanks to the extra parsing and boundary checks. In practice that’s about +2 ms per token, which most folks find an acceptable trade-off for the added stability.

Hope that clears things up—let me know if you hit any weird edge cases!

brown2000•5h ago

interesting! Quick question:

does TXT OS work equally well with open-source models, or is it optimized more for models like GPT-4 or Claude?

TXTOS•5h ago

Hey, good question!

I've actually tested TXT OS with about 10 different AIs already—you can check out the full rundown on my repo. Generally, ChatGPT, Grok, Claude, and Perplexity gave the smoothest and best experience. The others still work fine, but some, like Gemini, have minor quirks (Gemini randomly adds a weird parameter during initial setup, but it sorts itself out after the first step).

So, long story short, if you want a hassle-free experience, go with ChatGPT, Grok, Claude, or Perplexity!

yyhhooq•2h ago

could you explain the four math a little bit? Why it can activate AI ?

TXTOS•2h ago

Sure — it’s not about activating AI like magic, it’s about steering its reasoning process.

Each formula plays a role in making the LLM more stable, coherent, and logically self-aware:

• = I - G + mc² defines semantic residue — how far the current output strays from meaning. • BigBig(G) recombines context & error to steer output back toward intent. • BBCR detects collapse and triggers reset → rebirth (like fail-safe logic). • BBAM models attention decay — restoring continuity over multiple steps.

Together, this makes the LLM act less like autocomplete… and more like a self-guided reasoner.

Ask HN: Could the C64 startup screen have encouraged more users to learn BASIC?

Show HN: An open-source, Android app for discovering privacy-respecting software

Show HN: Make Led Scroller Message

What are we missing out on when we think Transformer is unreasonable in biology?

Entering a Nuclear Power Plant – Smarter Every Day [video]

Five Lagrange Points Every Project Manager Should Know

Show HN: Super Launch, a clean and minimal product launch platform

Show HN: Urban Pipeline for Citywide Insights

Show HN: Wordrops – A competitive writing platform inspired by League of Legends

Easy dynamic dispatch using GLIBC Hardware Capabilities

Earth's Inner Core Is Solid – Not Liquid – Even Though It's Blistering Hot

Pixel Piranhas

Things Not to Learn as an AI Engineer – By Paul Iusztin

Iranian official claims Israel used 'occult and supernatural spirits' during war

Decrypting Crypto: Digital Assets and Web3 Explained

Tandy Corporation, Part 3 – By Bradford Morgan White

Why I got rid of all my Neovim plugins

A closer look at the Model Context Protocol

State of the Vibes: A Slice of Vibe Coding in June 2025

Water levels and temperatures across Canada

Halo's Future Will Be Revealed in October, Says Xbox

Moon Landing Will Take More Than Rocket Science

Commodore 64 Ultimate is the company's first hardware release in over 30 year

Cadence IP for LPDDR6 Launched

Show HN: Clu3 – Team up with GPTs in a 2v2 game of codenames

Lab-grown meat is focus of new laws and legislation

The Case of the Ghostly Photographs: A. C. Doyle and spirit-photography industry

Build your own Anycast Network in Nine Steps (2016)

Inequality, Part VII: Crypto

Tim Cook Isn't Going Anywhere Soon, but an Apple Shake-Up Looms