frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

SQLite is all you need for durable workflows

https://obeli.sk/blog/sqlite-is-all-you-need-for-durable-workflows/
268•tomasol•4h ago•151 comments

EV Stupidity Checklist

http://hypercritical.co/2026/05/29/ev-stupidity-checklist
50•pchristensen•1h ago•34 comments

The dead economy theory

https://www.owenmcgrann.com/p/the-dead-economy-theory
532•WillDaSilva•6h ago•713 comments

Notes from the Mistral AI Now Summit in Paris

https://koenvangilst.nl/lab/mistral-ai-now-summit
282•vnglst•6h ago•91 comments

On Rendering Diffs

https://pierre.computer/writing/on-rendering-diffs
113•amadeus•3h ago•29 comments

Shift will clean homes for free to train future robots

https://www.theverge.com/ai-artificial-intelligence/939765/ai-training-data-startup-shift-free-cl...
45•evilsimon•3h ago•69 comments

Show HN: Tiny-vLLM – high performance LLM inference engine in C++ and CUDA

https://github.com/jmaczan/tiny-vllm
51•yu3zhou4•2h ago•7 comments

Bijou64: A variable-length integer encoding

https://www.inkandswitch.com/tangents/bijou64/
194•justinweiss•7h ago•70 comments

The California state assembly has passed the 'Protect Our Games Act'

https://www.invenglobal.com/articles/22330/stop-killing-games-movement-gains-momentum-california-...
136•TechTechTech•2h ago•135 comments

It's hard to justify buying a Framework 12

https://www.jeffgeerling.com/blog/2026/its-hard-to-justify-framework-12/
176•watermelon0•7h ago•328 comments

Liquid AI reveals 8B-A1B MoE trained on 38T

https://www.liquid.ai/blog/lfm2-5-8b-a1b
122•simjnd•6h ago•37 comments

Show HN: Open-source private home security camera system (end-to-end encryption)

https://github.com/secluso/core
3•arrdalan•4m ago•0 comments

Rothko for your current weather conditions

https://rothko.joonas.wtf/
97•jxmorris12•4h ago•11 comments

Print with dozens of colors: Our new open-source ColorMix for PrusaSlicer

https://blog.prusa3d.com/our-new-open-source-colormix-model-in-prusaslicer-and-easyprint_136079/
17•rented_mule•3d ago•0 comments

Why I collect DLES

https://dles.gg/blog/dles-gg-manifesto
17•trizoza•1h ago•4 comments

Show HN: Zot – Yet another coding agent harness

https://www.zot.sh
45•patriceckhart•16h ago•53 comments

GTA 6 Developers Unionize

https://rockstarintel.com/gta-6-developers-announce-rockstar-games-union/
513•AndrewKemendo•7h ago•339 comments

Show HN: TV Explorer. Adding advanced UI to free online TV

https://tvexplorer.live
84•dtagames•5h ago•19 comments

Is AI causing a repeat of frontend’s lost decade?

https://mastrojs.github.io/blog/2026-05-23-is-AI-causing-a-repeat-of-frontends-lost-decade/
257•xyzal•11h ago•222 comments

CAPTCHAs can still detect AI agents

https://research.roundtable.ai/captchas-detect-ai/
55•timshell•6h ago•47 comments

Letter from the Duke of Wellington to the British Foreign Office (1809)

https://wellsoc.org/society-member-pages/anecdotes-of-wellington/
38•backuprestore•4h ago•8 comments

The mysterious Hy3 LLM is topping OpenRouter Model Rankings by a large margin

https://minimaxir.com/2026/05/openrouter-hy3/
92•freediver•22h ago•88 comments

Citing 'severe' math deficits, UC faculty demand a return to SAT tests for STEM

https://www.latimes.com/california/story/2026-05-27/uc-math-professors-demand-return-of-sat-for-s...
409•brandonb•1d ago•639 comments

Microsoft 0-day feud escalates as researcher threatens another exploit dump

https://www.theregister.com/security/2026/05/28/microsoft-0-day-feud-escalates-as-researcher-thre...
106•Cider9986•2h ago•23 comments

We should be more tired than the model

https://vickiboykis.com/2026/05/28/we-should-be-more-tired-than-the-model/
137•tosh•10h ago•122 comments

High Density Living, 2000 Years Ago: Inside the Roman Apartment Building

https://commonedge.org/high-density-living-2000-years-ago-inside-the-roman-apartment-building/
140•surprisetalk•10h ago•59 comments

Cedana (YC S23) Is Hiring

https://www.ycombinator.com/companies/cedana/jobs/d1vYocG-forward-deployed-engineer-ai-hpc
1•neelm•10h ago

A Trillion Characters

https://characters.fastserial.com
23•andersmurphy•2h ago•12 comments

I am retiring from tech to live offline

https://openpath.quest/2026/i-am-retiring-from-tech-to-live-offline/
710•PinkG•7h ago•486 comments

Expertise in the age of AI

https://www.moderndescartes.com/essays/ai_and_expertise/
97•brilee•9h ago•95 comments
Open in hackernews

The mysterious Hy3 LLM is topping OpenRouter Model Rankings by a large margin

https://minimaxir.com/2026/05/openrouter-hy3/
92•freediver•22h ago

Comments

andai•21h ago
So basically, Hy3 is the cheapest decent model on OpenRouter, unless you use DeepSeek as the provider for DeepSeek V4 Flash, in which case DeepSeek's insane caching wins out. (And Hy3 is close-ish on the benchmarks.)
0xbadcafebee•19h ago
You need to use DeepSeek API directly to gain the extra caching benefits. The DeepSeek provider on OpenRouter is only the 5th-cheapest for V4 Flash, so you have to specify DeepSeek provider when calling OpenRouter. But DeepSeek's API discounts on its models only applies if you call DeepSeek directly. So anyone using OpenRouter to call DeepSeek models is actually losing quite a bit of money.
beacon294•19h ago
ZDR is also on by default and deepseek is not ZDR.
NitpickLawyer•16h ago
> The DeepSeek provider on OpenRouter is only the 5th-cheapest for V4 Flash

You might have the default settings on your account, which limit Deepseek as a provider. If you disable that feature you see them on openrouter as well (and they serve it at the same cost as their own API).

0xbadcafebee•16h ago
I just checked my settings and I have everything enabled. https://openrouter.ai/deepseek/deepseek-v4-flash?sort=price (per-1M price) shows DeepSeek provider as #5. https://openrouter.ai/deepseek/deepseek-v4-flash/pricing?sor... (effective price) shows them as #3. The effective price will change your total cost since each provider has a different price for input vs output vs cache, so what's #1 and #5 for one person could be #5 and #1 for somebody else, depending on their workload.

However, I just double checked, and OpenRouter's pricing page for Flash v4 with DeepSeek provider shows a cache hit rate of $0.0028, which is the same as on DeepSeek's official API pricing page ($0.0028), so they do seem to be the same price, (assuming DeepSeek is able to pin your specific OpenRouter requests to the same DeepSeek server). OpenRouter adds 5% to that cost, but still it might be cheaper than the other providers.

Also just found out OpenRouter has a new feature "Response Caching" where they can cache identical requests and return them immediately with no billing. The entire request must be identical, though, not just a prefix, and you have to enable this feature. I don't know who would need to send multiple identical requests, but it's better than nothing?

NitpickLawyer•15h ago
Interesting, it seems we have some providers offering dsv4-flash cheaper than ds themselves. For the full model it's the other way around, all 3rd party providers are 2x+ more expensive.
0xbadcafebee•5h ago
The cheaper ones are fp4 and fp8 whereas I assume DeepSeek provider is unquantized, so that probably accounts for it. DeepSeek also doesn't necessarily have the cheapest hardware, other providers could be using it as a loss leader, etc
Aurornis•20h ago
> Two new models are now beating LLM darling Claude in terms of token usage and by more than 50%?

Time for a reminder that OpenRouter leaderboards only show tokens sent through OpenRouter, which most Anthropic API users don’t use.

svantana•12h ago
I would think that's true for all the models on OR. The data is skewed for sure, but it's interesting none the less.
killingtime74•10h ago
Are you next going to say YouTube rankings don't take into account videos that aren't on YouTube and Spotify rankings don't take into account songs that aren't on Spotify?
0xbadcafebee•20h ago
> it makes sense that a cheaper model would prevail, but only if it offered similar quality

You're trying to think logically, which has no place in an AI discussion. :) People just jump to whatever the latest model is. Plenty of people also prefer price to "quality" (which is very subjective). It's new, it's cheap, so people use it. It's likely people will stop using it when something else is cheaper and/or newer.

olmo23•12h ago
Since my employer pays for it, I just select the latest and greatest.
vessenes•19h ago
Since there’s only one inference provider it could be a recycling/ad experiment. The similar usage between trial and paid periods would be explained by this as well.
simonw•18h ago
First model I've tried that gave me back HTML with a "Change Pelican Color" button: https://static.simonwillison.net/static/2026/hy3-preview-pel...

(Transcript: https://gist.github.com/simonw/c2a0d8ecd3056a2681319eae8fc3f...)

fragmede•17h ago
Haha does it get bonus points for the extra button, or does it fail because html != SVG?
dodslaser•17h ago
Any bonus points for the color sre immediately subtracted because the "animate wheels" button leaves the wheels stationary and makes the sun rotate.
MostlyStable•17h ago
I wonder if it is actually animating the wheels as well, but just managed to match up the spin rate to the gap size.
fragmede•17h ago
Hy3 is a Scandinavian model, and is leaking that out via Norse mythology about Sol being a wheel!
cicko•16h ago
simonw•18h ago
OpenRouter rankings frustrate me, because they show the total number of tokens but they provide no indication of how many unique users a model has.

Which means if a surprise model tops the leaderboard one week we can never be sure if it was because a single whale user pushing billions of tokens a day switched to it, or if it represents a genuine community trend towards that model.

senordevnyc•14h ago
Agreed. My little solo dev SaaS app’s production pipelines push almost two billion tokens a day.
senordevnyc•7h ago
Haha, I never tire of the AI haters downvoting stuff like this.

Down with reality!!

svantana•13h ago
Also, while we're pitching new features to openrouter, I'd like to see a "$ spent" chart, which would remove all these huge freebie spikes. It looks like it would be pretty much dominated by claude.
numlocked•35m ago
(openrouter co-founder here)

Yeah we should do something to indicate cardinality. I can share that there can often (I'm talking generally; not related to this model in particular) be e.g. a very large app that can be pushing a lot of volume. But in almost all cases that app has a large number of end users. Hypothetically, for instance, would Cursor be consider one user, or millions?

Will think about it! Thanks for the feedback.

bandrami•17h ago
For the life of me I will never understand the thought process that leads you to say "we don't really know who developed this LLM but I'm going to feed all of my business's data to it"
ddalex•17h ago
what can it do ? it's just a big set of numbers, if you trust the host that's good enough
what266262•17h ago
If you are ok with everything being fed into it being stored forever I guess it’s no problem. I don’t see how you trust them if you don’t know them.
Dylan16807•17h ago
Who is "them" here? The developers and the hosts are not the same.
bandrami•16h ago
(And either one is a threat vector)
ddalex•11h ago
where would it be stored ? it's just a big set of numbers.
Mashimo•17h ago
If you Code open source projects anyway, might give it a spin.
freakynit•17h ago
This was originally a 400+B param model which was later reduced to 295B considering it as the "optimal zone".

https://www.mdshare.online/s/uend0pj3og_A_rgcxzINf

zone411•17h ago
I’ve tested this model on four of my benchmarks:

https://github.com/lechmazur/buyout_game 10th out 36.

https://github.com/lechmazur/pact/ 14th out 25.

https://github.com/lechmazur/nyt-connections/ 60th out 81.

https://github.com/lechmazur/debate 16th out of 29.

CamperBob2•51m ago
Would be interesting to see the 27B dense Qwen 3.6 model thrown into the mix.
baxtr•12m ago
Good stuff!

Is there a reason you change the leaderboard graphs for the third and fourth one?

Also: would be great to have an overview page with a summary over all test, like a total score or similar.

thot_experiment•16h ago
Tried this extensively in OpenCode, never used it once since Gemma 4 came out, got into thought loops and did stupid edits I didn't ask for more often than the local 31b model. One of the worst "frontier" models I've ever tried.
cicko•16h ago
How is it a "mysterious" model? It's Tencent's Hy3?
theanonymousone•16h ago
My question as well. Isn't Tencent a very well-known company? Maybe the mystery is in the model itself?
alecco•15h ago
PSA: Don't use OpenRouter for DeepSeek V4 as it messes up you caching. Use DeepSeek API directly and you'll get 2x to 3x more cached tokens.
numlocked•35m ago
Can you share more? I'm with OpenRouter and we would love to address this! We don't see this in our own testing, I don't believe -- but will share this feedback and dig in.
lithiumii•15h ago
What's so mysterious? Isn't it from Tencent?
gmerc•12h ago
Very mysterious: https://huggingface.co/tencent/Hy3-preview
segmondy•9h ago
High token usage cuz it's free doesn't count
That depends on the perspective. If you're on the Sun, the wheels rotate around you.
postepowanieadm•15h ago
ROTFL
Garlef•15h ago
Judging from the dotted trajectory lines, it even "thought" about giving the bike a wobble.

(But maybe that's just my interpretation based on something else going wrong in the animation)

preek•12h ago
It actually rendered an SVG inline in the HTML page. I just tested the SVG and it renders itself just fine, including colors. So, tbh, I'd say the task has been properly achieved.
embedding-shape•8h ago
Maybe I'm just extremely nitpicky, but I'd consider that a failure, as the prompt is asking for SVG, not HTML.

Bit like asking for CSS and then getting a HTML file back with the CSS embedded, that was not what I was asking for!

prmph•32m ago
Welcome to the ambiguities of natural language
cwmoore•10h ago
But…and I’m sure I’m not alone here…that is a snowman, and what it is on is not a bicycle.

What do we think we are doing with this life?

PunchyHamster•39m ago
...the animate wheels button makes sun start to spin
martinald•31m ago
Hi! Big fan of OpenRouter and the data you provide. It'd be awesome if you would consider providing volume of tokens per hour, mostly for my own curiosity as to quite how peaky demand is.

Thanks!

simonw•27m ago
I'd consider Cursor one user because it's one entity that made an editorial decision about which model to make available to their own community.

If you treated Cursor as millions of users it might look like millions of people independently chose a new model when actually it was Cursor making the choice for them - and the thing I care most about is how many choices were made that selected a model and put it above the others.

est•17h ago
> I'm going to feed all of my business's data to it

Your business data is probably worthless, even considered harmful for the pretrain corpus.

Your interactions and decision making process are most valuable parts of the whole business.

bandrami•16h ago
I assure you my business's data is not remotely worthless which is why there are pretty strict laws and regulations about what we can do with it
TZubiri•15h ago
>Your business data is probably worthless

please tell me you are not in charge of the data of any business I'm a client of

est•15h ago
to clarify, probably worthless to AI vendors, but might be useful for third-parties.
TZubiri•15h ago
Third parties that can be clients of the AI vendor...
selcuka•12h ago
If it's worthless to AI vendors, they won't include it in the training corpus, so third parties won't have access to it.
estearum•11h ago
They're alluding to something more like espionage of just selling the interesting stuff you put in the text box.
TZubiri•4h ago
Wow I thought this was quite obvious, apparently not, so I'll explain.

Llm provider sells usage of their model. You use it to write code. Other clients use it to write code as well. If the llm provider trains with user data, then the usage benefits other users. If you pay the company to generate code,then by definition it is useful, and highly likely that other customers care about it.

Replace writing code with anything, a lawyer, a psychologist, a confessional. The IO is inherently useful to users of the same category.

That is to say nothing of adversarial use, that is, being useful because a counterparty might find it useful, so an attacker might find common code patterns, a lawyer might see what the opposition might be advised, a boy might see what a girl asks or gets advised, etc..

If this sounds too complex to you, just think of training on data as exfiltration with added steps, because that's what it is

estearum•4h ago
Oh well this is a bad argument. I made a mistake by assuming you made a good argument instead.
bandrami•9h ago
The worry is direct exfiltration, not training
TZubiri•4h ago
But it isn't worthless because the user is paying for that, and third parties are paying for that as well. Unless the input output is completely different, which it's not because you are human, and I bet you have a profession which other humans have, and many other qualities which you share with other humans.

In any case, relying on the chance that the LLM inference won't train on your data because of it's presumably low value is as good a strategy as crossing your fingers or venerating the god of rain. You should be relying on contractual clauses at least when including professional and client data.

elpocko•13h ago
Could be! Let's check. I just need your name and address, your SSN, a list of businesses you are a client of, and a DNA sample.
kirtivr•15h ago
You don't need to know who developed the LLM - whether it was Google or OpenAI.

What you need to know is who is the provider for the LLM, and whether their endpoints are zero data retention enabled and opted out of training. OpenRouter gives you an easy way to control this.

koiueo•15h ago
How can openrouter control what LLM provider does with your data on their side?
kirtivr•15h ago
OpenRouter and the provider sign a contract clearly specifying how input data is to be handled.

It's the same way we trust OpenAI to not train on our data if we've opted out although there is no control on whether they can retain the data indefinitely.

lmf4lol•15h ago
I really dont want to be cynic but those guys gave a flying f””” about copyright while scraping the whole internet. How can I ever trust them to respect the oot-out setting. I cant. Thieves be thieves.

And even if they dont train on the data. Who guarantees us, they dont let another AI model analyse all the data, exfiltrating all kinds of intelligence and using it? I only can imagine what OpenAI and Anthropic know….

astrange•14h ago
Scraping the internet isn't a copyright violation. Using it for LLM training is much more transformative than Google and Internet Archive, which are legal.
alfiedotwtf•13h ago
To be honest, this is the first time someone has spelt it out in a nicely succinct paragraph.

And just like that, I totally agree with you

estearum•11h ago
Except it ignores the entire premise of copyright which is to protect incentives to create original work, which Google does not destroy and which LLMs (very loudly and proudly) try to do.

There are several components of the Fair Use test, "transformation" is just one of them. The most important dimension is the effect on the market, i.e. the effect on incentives.

You probably shouldn't base your legal analysis on pithy internet comments regardless of how succinct or agreeable they are to you.

jazzyjackson•8h ago
Your right, scraping is legally protected. It's reproducing verbatim text that's a violation, which is why LLMs still clumsily refuse to produce song lyrics. They are capable of copyright violations and have to be 'aligned' not to get their providers sued.
koiueo•14h ago
Contracts means shit if they are not enforceable.

Ask yourself

1. How would you know the provider has violated the contract?

2. How could you prove it?

3. Why would OpenRouter take your side in this (unlike your example with OpenAI, you're not a signing party)?

4. How would OpenRouter enforce the contract after all three above are somehow resolved in your favor?

IANAL, but IMO it's all a legal theater.

EDIT: formatting

lmf4lol•15h ago
This is not entirely true and ignoring a couple of potential attack vectors like Data Poisoning: https://arxiv.org/abs/2408.12798

Its of course highly dependant on the use case and the environment, but simply saying that the only important part is to know where the data goes is too simple.

WithinReason•13h ago
It's from Tencent, says it in the article:

https://hy.tencent.com/research/hy3

bandrami•13h ago
Right but Tencent is a massive half-state-controlled holding company so that's not really helpful.
minraws•12h ago
OpenAI & Anthropic are deeply in bed with US govt, and they need US govt approval before model releases, and all US Companies under various acts need to share data with the govt.

I mean sure there are investors and a little more open-ness, but with the example of Mythos we don't even know if public will get access to the "good" stuff because it's too dangerous.

If your only opinion on trusting these companies more than one based in China is, they are Chinese then good luck, all the best.

bandrami•12h ago
Well, I mean, just as a legal question I'm not allowed to use Chinese software at work, so yeah that's kind of definitive for me
nl•12h ago
> and they need US govt approval before model releases

This isn't the case (yet).

irthomasthomas•11h ago
It is for models trained with 10^26 flops. Anthropic confirmed Mythos was less than this. You could estimate the upper bound on model size from this.
nl•9h ago
That's the Biden executive order. It's notify only - the company must tell the government but the government doesn't approve or allow the release.
irthomasthomas•8h ago
Ah yeah that sounds right.
estearum•11h ago
The difference is "the various acts" in the US are things that are largely very hard to do, extremely limited in scope, and companies who dispute the government's propriety can (and do) go to court to fight it.

Sure "China bad, US good" is naive, but certainly not more naive than suggesting that companies and individuals have similar rights and protections as each other.

> and they need US govt approval before model releases

This is just not true and it would be a gigantic legal battle to make it true against the model companies' wishes, which is indicative of your entire misunderstanding here.

adrian_b•10h ago
There was recently some announcement from the US govt itself (after the Mythos announcement) that they were pondering about allowing model releases from now on only after approving them.

So it may not be strictly true for the moment, but it is certainly something that the current US govt can mandate at any time.

estearum•9h ago
The US government just saying they were pondering something is:

1) Far from them actually trying to do it

2) Very, very far from them actually doing it successfully

The US government absolutely cannot "just tell" private entities what products they're allowed to create and sell, and the fact that LLMs are arguably a form of expression will make these particular products extremely hard to regulate – especially as a broad "government checkpoint" on incremental product updates.

In China, it really is as simple as the government deciding that it doesn't like your products and ta-da, you can no longer sell them.

It's beyond naive to act like these are similar in any meaningful sense.

Danox•6h ago
Nonsense, the genie is out of the bottle worldwide and it isn’t going back in, and due to the activity of the current US government America’s standing, is declining most countries going into the future are going to hedge against the United States and whatever it says the good old days (goodwill/the small benefit of the doubt) are gone.

The AI oligarchs have no loyalty and when it comes to making money and they will drop the king at their first opportunity and the king in return will do the same.

throawayonthe•10h ago
but we know who they are? how is this relevant
st3fan•10h ago
How do you “feed data into a model” ? Use the correct terminology and concepts please. It is important.