frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

We Built a Language Model 14,000,000x Smaller Than GPT3 and Formally Verified It

https://github.com/dkypuros/atomic-lang-model
7•katosteven•6h ago

Comments

katosteven•6h ago
For the last few years, the AI world has been dominated by a single idea: bigger is better. But what if the future of AI isn't just about scale, but about precision, efficiency, and accessibility?

This is the story of the Atomic Language Model (ALM), a project that challenges the "bigger is better" paradigm. It’s a language model that is not just millions of times smaller than the giants, but is also formally verified, opening up new frontiers for AI.

The result of our work is a capable, recursive language model that comes in at under 50KB.

This project is led by David Kypuros of Enterprise Neurosystem, in a vibrant collaboration with a team of Ugandan engineers and researchers: myself (Kato Steven Mubiru), Bronson Bakunga, Sibomana Glorry, and Gimei Alex. Our ambitious, shared goal is to use this technology to develop the first-ever language architecture for a major Ugandan language.

https://github.com/dkypuros/atomic-lang-model/tree/main

From "Trust Me" to "Prove It": Formal Verification Modern LLMs are opaque black boxes validated empirically. The ALM is different. Its core is formally verified using the Coq proof assistant. We have mathematically proven the correctness of its recursive engine. This shift from experimental science to mathematical certainty is a game-changer for reliability.

The Team and the Mission: Building Accessible AI This isn't just a technical exercise. The ALM was born from a vision to make cutting-edge AI accessible to everyone, everywhere. By combining the architectural vision from Enterprise Neurosystem with the local linguistic and engineering talent in Uganda, we are not just building a model; we are building capacity and pioneering a new approach to AI development—one that serves local needs from the ground up.

Unlocking New Frontiers with a Lightweight Architecture A sub-50KB footprint is a gateway to domains previously unimaginable for advanced AI:

Climate & Environmental Monitoring: The ALM is small enough to run on low-power, offline sensors, enabling sophisticated, real-time analysis in remote locations. 2G Solutions: In areas where internet connectivity is limited to 2G networks, a tiny, efficient model can provide powerful language capabilities that would otherwise be impossible. Space Exploration: For missions where power, weight, and computational resources are severely constrained, a formally verified, featherweight model offers unparalleled potential. Embedded Systems & Edge Devices: True on-device AI without needing a network connection, from microcontrollers to battery-powered sensors. A Pragmatic Hybrid Architecture The ALM merges the best of both worlds:

A formally verified Rust core handles the grammar and parsing, ensuring correctness and speed. A flexible Python layer manages probabilistic modeling and user interaction. What's Next? This project is a testament to what small, focused, international teams can achieve. We believe the future of AI is diverse, and we are excited to build a part of that future—one that is more efficient, reliable, and equitable.

We've launched with a few key assets:

The Research Paper: For a deep dive into the theory , we are working on it. The GitHub Repository: The code is open-source. We welcome contributions! A Live Web Demo: Play with the model directly in your browser (WebAssembly). We'd love to hear your thoughts and have you join the conversation.

NitpickLawyer•6h ago
Could you add a link for the web demo? Couldn't find it in the repo.
dkypuros•4h ago
We’re working on it. Great feedback
icodar•5h ago
The next token prediction appears to be predicted based on fixed grammatical rules. However, modern LLMs learn the rules themselves. Did I misunderstand?
dkypuros•4h ago
We use a deliberately small, hand‑written grammar so that we can prove properties like grammaticality, aⁿbⁿ generation, and bounded memory. The price we pay is that the next‑token distribution is limited to the explicit rules we supplied. Large neural LMs reverse the trade‑off: they learn the rules from data and therefore cover much richer phenomena, but they can’t offer the same formal guarantees. The fibration architecture is designed so we can eventually blend the two—keeping symbolic guarantees while letting certain fibres (e.g. embeddings or rule weights) be learned from data.
dkypuros•4h ago
We’re eventually headed toward completely externalized data that feeds into the system

Coca-Cola will roll out cane sugar Coke in US after Trump push

https://www.bbc.com/news/articles/cev0rj193j3o
1•tartoran•21s ago•0 comments

Show HN: Scrybble sync – Sync files from reMarkable to Obsidian

https://scrybble.ink
1•azeirah•1m ago•0 comments

Stablecoin as a Service

https://www.bitgo.com/stablecoin-as-a-service/
1•wslh•3m ago•0 comments

Show HN: Cursor Stats – See your Cursor usage like GitHub contributions

https://chromewebstore.google.com/detail/cursor-stats/fdlfealdpfjfjnadhdobllmondagkcal
1•alexerm•3m ago•0 comments

Silicon Valley engineer admits theft of US missile tech secrets

https://www.theregister.com/2025/07/22/engineer_admits_trade_theft/
4•rntn•6m ago•0 comments

Why Your Enterprise Needs a Headless CMS – Now More

1•imcasy•8m ago•0 comments

New approach allows drone swarms to autonomously navigate complex environments

https://techxplore.com/news/2025-07-approach-drone-swarms-autonomously-complex.html
1•Brajeshwar•10m ago•0 comments

New Discovered 'Infinity Galaxy' Could Prove How Supermassive Blackholes Formed

https://www.wired.com/story/newly-discovered-infinity-galaxy-could-prove-how-ancient-supermassive-black-holes-formed/
1•Brajeshwar•10m ago•0 comments

AI-Designed Drugs Can Target 'Undruggable' Proteins in Cancer and Alzheimer's

https://singularityhub.com/2025/07/21/ai-designed-drugs-can-now-target-previously-undruggable-proteins-in-cancer-and-alzheimers/
1•Brajeshwar•11m ago•0 comments

Show HN: ModelMashup – Chat with Multiple LLMs Simultaneously

https://modelmashup.vercel.app/
1•martianmanhunt•13m ago•0 comments

Show HN: OLLS – A Vendor-Neutral Standard for LLM Inputs and Outputs

1•gpt4o•13m ago•0 comments

A book for non-tech leaders on venture studios and AI (based on real stories)

https://www.venturespeed.ai
1•aszig•13m ago•1 comments

Show HN: Create your color palettes in context, not isolation

https://colorpal-sage.vercel.app/
1•mazahermuraj•13m ago•0 comments

Ask HN: Mentor for Transition from Software Engineer to Sales Engineer

2•pedrodelfino•14m ago•0 comments

Vibecoding a High Performance System

https://andrewkchan.dev/posts/systems.html
5•reasonableklout•17m ago•0 comments

In Indonesia, a startup captures coolants to stop global warming

https://www.japantimes.co.jp/environment/2025/07/10/climate-change/indonesia-startup-coolants-global-warming/
1•PaulHoule•18m ago•0 comments

AI Market Clarity

https://blog.eladgil.com/p/ai-market-clarity
5•todsacerdoti•20m ago•0 comments

Engineering wisdom from a year of Talking Postgres

https://techcommunity.microsoft.com/blog/adforpostgresql/bits-of-wisdom-from-a-year-of-talking-postgres/4434622
2•clairegiordano•21m ago•0 comments

Hierarchical Reasoning Model

https://arxiv.org/abs/2506.21734
3•fofoz•21m ago•0 comments

Honda Has Entered a New Business – Launching Rockets into Space

https://www.slashgear.com/1918718/honda-first-successful-rocket-launch/
4•Bluestein•21m ago•1 comments

What If the Real Power of MRI Isn't in the Image at All?

https://medium.com/@jensenbox/what-if-the-real-power-of-mri-isnt-in-the-image-at-all-401c9e00ae1d
3•jensenbox•21m ago•0 comments

Visualizing multi-dimensional data: score distributions in English football

https://blog.engora.com/2025/07/visualizing-multi-dimensional-data.html
2•Vermin2000•22m ago•1 comments

How to dismantle windmill blades [video]

https://www.bilibili.com/video/BV1ma3BzNEoR/
1•markus_zhang•22m ago•1 comments

The Case for Sabotage

https://collectiveactionintech.substack.com/p/the-case-for-sabotage
2•chobeat•24m ago•0 comments

Citizen will share crime videos with the NYPD

https://www.theverge.com/news/711146/citizen-app-nyc-verified-partners-alerts-surveillance
5•gametorch•24m ago•2 comments

First Hubble Telescope Images of Interstellar Comet 3I/Atlas

https://bsky.app/profile/astrafoxen.bsky.social/post/3luiwnar3j22o
2•jandrewrogers•28m ago•0 comments

Gemini North Discovers Long-Predicted Stellar Companion of Betelgeuse

https://noirlab.edu/public/news/noirlab2523/
4•layer8•28m ago•0 comments

TDD Guard: Enforcing Test-Driven Development with Claude Code Hooks

https://nizar.se/tdd-guard-for-claude-code/
1•Nizoss•29m ago•0 comments

Trust Deterministic Execution to Scale and Simplify Your Systems [video]

https://www.youtube.com/watch?v=siEtKc6Sq2Y
1•todsacerdoti•29m ago•0 comments

Chess Llama

https://lazy-guy.github.io/chess-llama/
1•amrrs•30m ago•0 comments