frontpage.

I've been working on a personal project called νοῦς (Nous). It's a Python implementation of the transformer model described in Attention is All You Need (Vaswani et al., 2017), using only JAX for the backend. The goal started as a learning exercise, but the system turned into something that many others might also find useful.

As I previously mentioned, the backend is completely Python, but I have also built an electron-based frontend, releasing a fully built MacOS app. If you are on Windows, you can clone the repository and visit the README for further instructions. All model components are completely configurable: depth, width, number of heads, training parameters, and generation settings. You can both train locally using the Electron app or using the CLI with a remote GPU. Note that using the CLI, you will have to modify source code to edit the components.

Nous would be useful in experimenting with specific architectures, running controlled training setups, or even teaching/learning how modern LLMs work (the README also explains how every single part of my backend works, so it could be quite helpful to use while reading the codebase).

I originally wanted to make it a NumPy-only version but switched fully to JAX after running into performance problems.

Currently, the app comes with a pre-trained model that has 76.9M parameters, trained to a final loss value of ~0.6. It also comes with a Byte-Pair Encoding tokenizer implementation, so that you can build the tokenizer on a dataset of your choice.

Everything is open-source. Repo link is above.

Researchers surprised by the brain benefits of cannabis usage in adults over 40

Peter Thiel warns the Antichrist, apocalypse linked to the 'end of modernity'

USS Preble Used Helios Laser to Zap Four Drones in Expanding Testing

Show HN: Animated beach scene, made with CSS

An update on unredacting select Epstein files – DBC12.pdf liberated

Was going to share my work

Pitchfork: A devilishly good process manager for developers

You Are Here

Why social apps need to become proactive, not reactive

How patient are AI scrapers, anyway? – Random Thoughts

Vouch: A contributor trust management system

I built a terminal monitoring app and custom firmware for a clock with Claude

Tiny C Compiler

Y Combinator Founder Organizes 'March for Billionaires'

Ask HN: Need feedback on the idea I'm working on

OpenClaw Addresses Security Risks

Apple finalizes Gemini / Siri deal

Italy Railways Sabotaged

Emacs-tramp-RPC: high-performance TRAMP back end using MsgPack-RPC

Nintendo Wii Themed Portfolio

"There must be something like the opposite of suicide "

Ask HN: Why doesn't Netflix add a “Theater Mode” that recreates the worst parts?

Show HN: Engineering Perception with Combinatorial Memetics

Show HN: Steam Daily – A Wordle-like daily puzzle game for Steam fans

The Anthropic Hive Mind

Just Started Using AmpCode

LLM as an Engineer vs. a Founder?

Crosstalk inside cells helps pathogens evade drugs, study finds

Show HN: Design system generator (mood to CSS in <1 second)

Show HN: 26/02/26 – 5 songs in a day