frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

The Cult of LK99

https://www.youtube.com/watch?v=fj3WwMxUDZ8
1•EwanG•1m ago•1 comments

Norway Bans AI for Kids 6-13 in Schools

https://yipzap.com/norway-bans-ai-for-kids-6-13-in-schools-a-bold-move-against-tech-dependency/
1•noida•2m ago•0 comments

Exhaustive and Definitive Ranking of All NES Games Released in North America

https://8bitnintendo.science/
2•CharlesW•3m ago•0 comments

Fast-tracked power plants fuel AI boom, with little public scrutiny

https://www.reuters.com/business/energy/fast-tracked-power-plants-fuel-ai-boom-with-little-public...
1•JumpCrisscross•4m ago•0 comments

Freelance Hiring, Without the Chaos

https://hight.ai
1•YinkaIyiola•4m ago•0 comments

Futuristic Japanese Warship Is on the Market and Winning Fans

https://www.wsj.com/world/asia/this-futuristic-japanese-warship-is-on-the-market-and-winning-fans...
2•JumpCrisscross•5m ago•0 comments

Bill that would mandate AI chip location tracking gains industry support

https://www.nbcnews.com/tech/tech-news/chips-security-act-gains-industry-support-letter-rcna350500
3•elliotbnvl•12m ago•0 comments

Welcome to America, World Cup visitors. Don't forget to tip

https://www.axios.com/2026/06/21/world-cup-service-fees-tipping-culture
3•rguiscard•15m ago•0 comments

The Wheel of Life

https://dsernst.com/writing/2026/wheel-of-life
1•dsernst•15m ago•0 comments

Good results fine tuning a local LLM like Qwen 3:0.6B to categorize questions

https://www.teachmecoolstuff.com/viewarticle/fine-tuning-a-local-llm-to-categorize-questions
2•dev-experiments•19m ago•0 comments

The Revolution Will Not Be Digitized

https://lareviewofbooks.org/article/social-codes-tech-workers-class-identity-digital-capitalism/
1•colinb•21m ago•0 comments

Simple hard way to conjugate Japanese verbs

https://underreacted.leaflet.pub/3mmevu6woys27
2•valzevul•21m ago•0 comments

Show HN: Real-Life Deception Detection Without Uploading Video

https://github.com/WhissleAI/lie_detection_binary
1•ksingla025•24m ago•0 comments

Evaluating sugar-sweetened beverage tax effects

https://pmc.ncbi.nlm.nih.gov/articles/PMC12669988/
1•luu•27m ago•0 comments

I Canceled My French Tutor and Built an LLM Tool That Does It Better

https://alshe.substack.com/p/i-canceled-my-french-tutor-and-built
2•Anon84•28m ago•0 comments

PebbleOS

https://github.com/coredevices/PebbleOS
1•arbayi•29m ago•0 comments

Robust Jobserver

https://codeberg.org/mlugg/robust-jobserver/src/branch/main/spec.md
1•birdculture•31m ago•0 comments

The Deadly Rise of Giant Trucks and S.U.V.s

https://www.nytimes.com/interactive/2026/06/21/us/trucks-suv-pedestrian-crashes.html
8•xnx•32m ago•1 comments

Migrating from Claude to DeepSeek without breaking everything

https://blog.firetiger.com/migrating-from-claude-to-deepseek-without-breaking-everything/
2•eric_khun•34m ago•0 comments

Compass – guardrails and a hard budget cap for AI coding agents

https://github.com/dshakes/compass
1•chandu1221•36m ago•0 comments

The Crusade of Hormuz

https://www.historytoday.com/archive/history-matters/crusade-hormuz
2•Thevet•41m ago•0 comments

Real-Time GPS Tracking Station

https://www.gps-satellites.com/
1•gnabgib•45m ago•0 comments

Show HN: Fold-logging.nvim – fold logging and debug-print statements in Neovim

https://github.com/markosnarinian/fold-logging.nvim
1•markosn•45m ago•0 comments

Swift, Gay and Pope's season in the sun

https://www.theguardian.com/books/2026/jun/12/the-twitnam-summer-by-hester-grant-review-swift-gay...
1•Petiver•54m ago•0 comments

I just released Mango Launcher

https://www.mangolauncher.com/
2•PinyaApps•55m ago•0 comments

Street Traffic Regulation (1909)

https://rosap.ntl.bts.gov/view/dot/16295
1•willmeyers•56m ago•1 comments

I Gave an AI a Civilization to Run. It Built a Nuke – Launching CivBench

https://www.lwilko.com/blog/i-gave-an-ai-a-civilization
3•LiamWilko•58m ago•2 comments

AI Is Discovering the Doorman Fallacy [video]

https://www.youtube.com/watch?v=f-QzIum9bNU
1•CHB0403085482•1h ago•1 comments

Show HN: Tunr – Expose your local server in 3 seconds

https://github.com/ahmetvural79/tunr
2•ahvural•1h ago•0 comments

Improvements to Std:Format in C++26

https://mariusbancila.ro/blog/2026/06/19/improvements-to-stdformat-in-c26/
1•rbanffy•1h ago•0 comments
Open in hackernews

Apertus – Open Foundation Model for Sovereign AI

https://apertvs.ai/
96•T-A•1h ago

Comments

yreg•1h ago
previous thread: https://news.ycombinator.com/item?id=45108401
trvz•1h ago
The previous version of this model has been pretty bad, but claimed to adhere to copyright laws. However, based on my testing, that's not true either. So in my view this is completely useless.
embedding-shape•1h ago
As long as the following remains true, this release ends up a bigger contribution to science at large than most other models trained "behind closed doors":

> Fully open model: open weights + open data + full training details including all data and training recipes

coder543•42m ago
Is a recipe useful if no one likes it?

There are equally open, much more useful models out there: https://artificialanalysis.ai/?models=nvidia-nemotron-3-ultr...

simonw•53m ago
It uses fineweb, which is derived from Common Crawl, which is an unlicensed scrape of web pages.
throwaw12•1h ago
Looks like their instruct models are Llama3.1 fine tune from last year. Is there any progress on new models?

My last hope for soverign AI is from Chinese open models

kordlessagain•1h ago
Sovereign AI is not about using just one model. It's about using the right model for the right job, and getting them to talk through the solution TOGETHER before presenting the answer.

If you want to mix models like this, check out https://github.com/deepbluedynamics/nemesis8

atemerev•1h ago
I use it extensively. It is not ready for agentic use, but as a generic driving model for RAG use cases, it is pretty competent. You can build useful software with it.
MASNeo•35m ago
I use Apertus including as the driver for an agent, not a coding agent. Find it useful enough. What was your Challenge?
maxloh•1h ago
Other fully open LLMs include Allen AI's OLMo 3.1 and MBZUAI's K2 Think V2, both of which have released their full training pipelines and datasets.

Nvidia Nemotron is also an open training source model, though a portion of its dataset remains proprietary.

Quoting lambda's comment:

> Note that the Nemotron models are generally stronger than Olmo and K2 Think V2 (according to Artificial Analysis benchmarks), and there is a lot of overlap in their datasets (lots of datasets are based on the same sources with different filtering, Olmo and K2 Think V2 both have used some Nemotron datasets).

> But yeah, Nemotron is a modern and fairly capable LLM, even the 122b is more capable than Deepseek R1 (a 671b model) on most benchmarks, and there's also the recently released 550b Ultra now.

https://news.ycombinator.com/item?id=48492439

_pdp_•1h ago
I want to believe.
pferde•55m ago
For a model that claims to focus on many languages, it's quite unreliable when it comes to simple questions like "how to say X in language Y" or "how to conjugate verb X in language Y". It keeps hallucinating words that do not exist, and when corrected, it only hallucinates a new lie.
SwellJoe•53m ago
I like the idea, and it has become more pressing that everyone outside the US think about tech sovereignty because the US has become an unsafe place to keep your data, but the impression I get from Apertus is that it moves at the speed of a committee. I have no expectation they'll deliver a competitive model. At least, not competitive with current models. Maybe competitive with models a year ago (though they haven't even done that yet, right?).
nezuzen•37m ago
"the US has become an unsafe place to keep your data"

I empathize with this but curious what would make any other country a better safehaven for your data? I personally like the EU's approach to data safeguards, but are there other locales/data protections you have in mind that would keep your data "safe".

digitaltrees•21m ago
The rule of law exists in other countries in a way it does not in the US right now.
maxloh•51m ago
Great to see more fully open LLMs.

I think a problem with open-weight models is that while you can improve them, you are not going to create the next generation of LLMs by fine-tuning. We are at the mercy of frontier labs for access to SOTA LLMs. For example, Anthropic recently started requiring identity verification for Claude [0], same for OpenAI [1].

If one day China's distillation labs stop releasing their LLMs as open-weight, I doubt American labs will continue to release free LLM weights without that competition.

That's where fully open pipelines shine: they enable the community to create the next generation of SOTA LLMs. That is the only way LLMs truly become sovereign.

[0]: https://news.ycombinator.com/item?id=48618455

[1]: https://news.ycombinator.com/item?id=48618606

dofm•39m ago
> We are at the mercy of frontier labs for access to SOTA LLMs

I disagree with this use of SOTA, and this topic is why.

Anthropic and OpenAI have “cutting-edge” models. These are beyond the state of the art but they are closed, secretive, hard to quantify.

The “state of the art” is open source, open weights models that can be inspected, studied, shared and critiqued, because that is what is meant by “the art” —- it is the knowledge and principles and evidence and materials available to all. The “state of the art” is the highest point of that.

I wish we could make this distinction and stop blessing two secretive, unverifiable loss-making companies with so much power.

(Putting that aside, I suspect — without evidence, mind you - that the endless march to solving models by making them bigger is not the solution anyway.)

sockaddr•33m ago
Sorry but I think you’re requirement that something only be “the art” if any arbitrary person can critique it is off. The frontier labs are working on the state of the art but it’s just art that you aren’t allowed to see. Unfortunately.
dTal•21m ago
It's good that there is a movement for open LLMs, but it's not where the battleground is right now. The battleground is local vs service LLMs, and we are losing that battle badly despite all the software being here now and viable, entirely because UX sucks.

How many normal people do you know who use "ChatGPT"? A lot, probably.

How many even know what "Gemma" is, let alone have downloaded llama.cpp, a GGUF file from Hugginface, and run "llama-server" from a text console with all the correct command arguments? How many are thinking about this use case when speccing out their next computer? Where is the breathless marketing copy boasting x tok/s?

We are sleepwalking into slavery.

double0jimb0•17m ago
Yea, anyone who understands what makes products actually usable is opting to get paid for said skill.
idiotsecant•17m ago
Better UX does not buy you a datacenter farm to train state of the art cutting edge models. Right now the only people who can do that are the technobility class.
dTal•4m ago
It does not, but it might encourage more people to care. Worrying about training is a luxury when you are starting from a baseline of "OpenAI spies upon me and controls my access". Let's focus on getting every Tom, Dick and Harry 1) on board with LLMs, because they're happening, 2) habitually using local software.
wmf•10m ago
LM Studio
mrshu•9m ago
By far the most impactful product of the Apretus project are the people. To quote a memorable line from Dominique Paul (https://www.thisiscrispin.com/):

> What most people miss IMO is that this is not a team who is doing this for the fourth time like virtually any other LLM provider and who could learn from its own past experiences. I bet if the team would do another model training they could get way better results at one fourth of the costs.

anon373839•23m ago
> China's distillation labs

This notion that Chinese labs are merely distilling frontier models is quite an unwarranted slur. Those labs have published WAY more useful research than US labs on RL techniques, novel model architectures, training pipelines, etc. They have also hit intelligence-per-parameter densities that US labs have yet to attain.

Apart from that, merely training a model on outputs from another model, off policy and without the logits, doesn’t really work that well.

The Chinese labs know how to build frontier level models. GLM-5.2 shows that they no longer even need Nvidia chips to do it.