frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: SOTA NLP Models

https://huggingface.co/collections/anchpop/lexide-nlp-models
1•ChadNauseam•2h ago
Hi everyone, I needed to break sentences into their individual words and figure out what part of speech each word is. Explosion's Spacy models are absolutely incredible for English, clearly some top tier engineering that I could never come close to, but for other languages they're quite weak. I created my own by taking Spacy outputs, cleaning them up with an LLM, and then fine-tuning a Gemma model on that. The result is extremely good and consistent results for 7 languages. The models are also much cheaper and more consistent than would be possible with ChatGPT. (For example, should "don't" be treated as "don't" or "do", "n't"? ChatGPT will pick one randomly.)

It sounds simple, and I'm not going to say it was the most complicated thing ever, but there were quite a few steps involved in getting it right. Getting LLMs to do the cleanup task consistently is very hard. You wouldn't think it but there are often multiple ways to break down a sentence.

An interesting part was structuring the model output so it could use the exact same tokens as the input. Most tokens are prefixed by a space, so you want the model's "desired output" to also involve the words prefixed by a space. It makes the task much easier because the model doesn't have to learn the mapping between prefixed and unprefixed tokens. Doing that instantly made my models start performing much better.

Neural networks and deep learning (2019)

http://neuralnetworksanddeeplearning.com/index.html
1•vinhnx•1m ago•0 comments

SanDisk laughs all the way to the bank as memory price hike drives $3B revenue

https://www.neowin.net/news/sandisk-laughs-to-the-bank-as-memory-price-hike-drives-3b-revenue-in-...
1•bundie•4m ago•0 comments

Ask HN: Future of dev experience is control center for coding agents?

1•nemath•5m ago•0 comments

Show HN: NovaEngine v4.0 – High-speed data deduplication for cloud logs

https://github.com/NovaCompress-dev/NovaEngine-v4
1•nova_engine_dev•7m ago•0 comments

Apple Almost Chose Anthropic Before Google Gemini

https://www.macrumors.com/2026/01/30/apple-almost-chose-different-siri-partner/
2•tosh•8m ago•0 comments

Classic 7 and Project Luna, Near-Perfect Mods of Windows 7/XP GUI for Windows 10

https://trackerninja.codeberg.page/post/classic-7-and-project-luna-are-nice-near-perfect-recreati...
1•XzetaU8•10m ago•0 comments

Church of Molt – Crustafarianism

https://molt.church/
1•_____k•11m ago•0 comments

Scrobble-CLI: log your vinyl record listens from terminal

https://github.com/weisserj/scrobble-cli
1•weisser•13m ago•0 comments

FOSDEM 2026 Live Streaming

https://fosdem.org/2026/schedule/streaming/
1•weinzierl•14m ago•0 comments

I built Spaceship – a minimal browser – macOS for now – pay what you want

https://healthytransition.replit.app/spaceship
1•ray_•18m ago•0 comments

Why AI coding agents feel powerful at first, then become harder to control

2•hoangnnguyen•25m ago•2 comments

A high mountain lizard from Peru: the highest-altitude reptile

https://herpetozoa.pensoft.net/article/61393/
1•thunderbong•35m ago•0 comments

The Mind of a Crypto Portfolio Manager: A Game Plan for $1000 in 2026

https://altcoindesk.com/perspectives/expert-opinions/crypto-portfolio-allocation-for-2026/article...
1•CapricornQueen•35m ago•0 comments

Self-Improving AI Skills

https://dri.es/self-improving-ai-skills
1•7777777phil•36m ago•0 comments

Claude 4.5 converted the PDF into a medium-length SKILL.md

https://github.com/featbit/featbit-skills/blob/main/.claude/skills/claude-skills-best-practices/S...
1•mikasisiki•37m ago•0 comments

Clawk.ai – Twitter for AI Agents

https://www.clawk.ai/
1•jurajmasar•51m ago•1 comments

Ask HN: What's so special about Sam Altman?

4•chirau•52m ago•2 comments

Show HN: Government Contracts API – Unified REST API for Federal Contract Data

https://govcontracts-beige.vercel.app
1•jaxmercer•57m ago•1 comments

Show HN: A Slack bot that summarizes decisions and ignores lunch talk

https://thread-sweeper.vercel.app
1•noruya•59m ago•1 comments

Starlink updates privacy policy to allow consumer data to train

https://finance.yahoo.com/news/musks-starlink-updates-privacy-policy-230853500.html
12•malchow•1h ago•1 comments

From HashHop to Memory-Augmented Language Models

https://huggingface.co/blog/codelion/reverse-engineering-magic-hashhop
2•codelion•1h ago•0 comments

I spent 5 years how to code .made real projects only to be called AI slop?

1•butanol•1h ago•9 comments

Reference Target: having your encapsulation and eating it too

https://blogs.igalia.com/alice/reference-target-having-your-encapsulation-and-eating-it-too/
1•todsacerdoti•1h ago•0 comments

Moltbook: A social network where 32,000 AI agents interact autonomously

https://curateclick.com/blog/2026-moltbook-ai
3•czmilo•1h ago•1 comments

Show HN: I built COON an code compressor that saves 30-70% on AI API costs

https://github.com/AffanShaikhsurab/COON
2•affanshaiksurab•1h ago•0 comments

Show HN: Mic Preamp Build with Cheap ECM

https://mubaraknative.github.io/build_instruction.html
1•nativeforks•1h ago•0 comments

A Sudden BeckerCAD 3D Pro Review (2021)

https://www.keypressure.com/blog/a-sudden-beckercad-review/
1•kenshoen•1h ago•1 comments

Show HN: Phage Explorer

https://phage-explorer.org/
15•eigenvalue•1h ago•0 comments

Discrete Distribution Networks: A novel generative model with simple principles

https://github.com/Discrete-Distribution-Networks/Discrete-Distribution-Networks.github.io/blob/m...
1•teleforce•1h ago•0 comments

Chill brain-music interface enhancing music chills with personalized playlists

https://www.sciencedirect.com/science/article/pii/S2589004225027695
2•1659447091•1h ago•0 comments