frontpage.

Hi everyone, I needed to break sentences into their individual words and figure out what part of speech each word is. Explosion's Spacy models are absolutely incredible for English, clearly some top tier engineering that I could never come close to, but for other languages they're quite weak. I created my own by taking Spacy outputs, cleaning them up with an LLM, and then fine-tuning a Gemma model on that. The result is extremely good and consistent results for 7 languages. The models are also much cheaper and more consistent than would be possible with ChatGPT. (For example, should "don't" be treated as "don't" or "do", "n't"? ChatGPT will pick one randomly.)

It sounds simple, and I'm not going to say it was the most complicated thing ever, but there were quite a few steps involved in getting it right. Getting LLMs to do the cleanup task consistently is very hard. You wouldn't think it but there are often multiple ways to break down a sentence.

An interesting part was structuring the model output so it could use the exact same tokens as the input. Most tokens are prefixed by a space, so you want the model's "desired output" to also involve the words prefixed by a space. It makes the task much easier because the model doesn't have to learn the mapping between prefixed and unprefixed tokens. Doing that instantly made my models start performing much better.

Neural networks and deep learning (2019)

SanDisk laughs all the way to the bank as memory price hike drives $3B revenue

Ask HN: Future of dev experience is control center for coding agents?

Show HN: NovaEngine v4.0 – High-speed data deduplication for cloud logs

Apple Almost Chose Anthropic Before Google Gemini

Classic 7 and Project Luna, Near-Perfect Mods of Windows 7/XP GUI for Windows 10

Church of Molt – Crustafarianism

Scrobble-CLI: log your vinyl record listens from terminal

FOSDEM 2026 Live Streaming

I built Spaceship – a minimal browser – macOS for now – pay what you want

Why AI coding agents feel powerful at first, then become harder to control

A high mountain lizard from Peru: the highest-altitude reptile

The Mind of a Crypto Portfolio Manager: A Game Plan for $1000 in 2026

Self-Improving AI Skills

Claude 4.5 converted the PDF into a medium-length SKILL.md

Clawk.ai – Twitter for AI Agents

Ask HN: What's so special about Sam Altman?

Show HN: Government Contracts API – Unified REST API for Federal Contract Data

Show HN: A Slack bot that summarizes decisions and ignores lunch talk

Starlink updates privacy policy to allow consumer data to train

From HashHop to Memory-Augmented Language Models

I spent 5 years how to code .made real projects only to be called AI slop?

Reference Target: having your encapsulation and eating it too

Moltbook: A social network where 32,000 AI agents interact autonomously

Show HN: I built COON an code compressor that saves 30-70% on AI API costs

Show HN: Mic Preamp Build with Cheap ECM

A Sudden BeckerCAD 3D Pro Review (2021)

Show HN: Phage Explorer

Discrete Distribution Networks: A novel generative model with simple principles

Chill brain-music interface enhancing music chills with personalized playlists

Show HN: SOTA NLP Models