frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Sakana Fugu

https://sakana.ai/fugu/
54•Finbarr•2h ago

Comments

nickandbro•1h ago
Very interesting. I wonder if its kinda functions similarly to how OpenRouter's fusion API does. Hopefully isn't too long to respond.
stygiansonic•1h ago
From a brief reading of what Fusion does: https://openrouter.ai/docs/guides/features/plugins/fusion

Looks like Fusion calls a bunch of models and then uses an LLM to synthesize the results, and pass to another model for final output.

Fugu looks like it's doing something different? Using an LLM earlier on in the flow as an orchestrator to decide which other LLMs to call. More coordinator than simply synthesizing results, and more "agentic".

It's interesting because it's all exposed behind a single OpenAI compatible endpoint (Responses API?) and so then presumably someone could use this for one of their single agents. Now you have agent-of-agents, nested in some sense. The token usage increases accordingly!

ljlolel•59m ago
Yea similar, possibly even more steps / slower. I put together an all open source fusion at 1/3 of price of Fable: https://trustedrouter.com/blog/open-fusion-beats-fable-5

We open sourced it all

and will be releasing a similar orchestrator next week on TrustedRouter

ed_mercer•1h ago
So basically... openrouter?
alasano•46m ago
OpenRouter Fusion is basically ask N models + synthesizer step.

This is ask a special orchestrator they built, which is in front of a bunch of models, which model would suit the request best.

Regular Fugu seems to be just "pick the best model and route the request there"

Fugu Ultra can generate like a little mini workflow/plan instead to achieve a result

1. Ask GPT to derive the math. 2. Ask Opus to check for implementation/security issues. 3. Ask Gemini to synthesize or resolve disagreement. 4. Return final answer.

I could be wrong but seems to be that at a glance, so I think it's more dynamic than OpenRouter Fusion.

runeblaze•17m ago
links to two papers with at least enough apparent quality and novelty to get into ICLR 2026

> So basically... openrouter

:skull:

i now really wonder how many people of the public understood my thesis defense lol

ljlolel•1h ago
I’ve also developed and open-sourced Mythos level model using fusion/synthesis on TrustedRouter

https://trustedrouter.com/blog/fusion-evals-open-source

eevmanu•1h ago
Reminds me of <https://github.com/irthomasthomas/llm-consortium>
eevmanu•47m ago
Fugu Ultra <https://console.sakana.ai/models#fugu-ultra> sounds similar to GPT-5.5 Pro or Gemini 3.1 Deep Think .

Is there any official source that could confirms if Fable (or Mythos) is parallelized test-time compute (like GPT 5.5 Pro) or sparse Mixture-of-Experts (MoE) transformer combined with a multi-agent, inference-time compute scaling architecture (Gemini 3.1 Deep Think)?

embedding-shape•57m ago
> Frontier-level performance without single-vendor dependency. [...] Plug collective intelligence directly into your workflows today with a single API.

Does multiple vendors run this "single API" or how is this not replacing a single-vendor dependency for another single-vendor dependency?

bprasanna•38m ago
Isn't this what perplexity is?
adamnemecek•33m ago
Seems kinda underwhelming considering they raised like $400M.
nixosbestos•32m ago
AI noob question, is this like Amp? I just use Amp, I ask it to do neat stuff and it does it. I desperately need to invest in my AI skills but every day I open two new tabs and add it to "AI stuff" folder, and then go back to drowning in work to do.
GolfPopper•24m ago
This is a joke, right?
puttycat•17m ago
Can someone explain this in layman terms? I don't understand any of it
david_shi•11m ago
It's similar to this: https://openrouter.ai/blog/announcements/fusion-beats-fronti...

Basically, if you combine a bunch of near-frontier models (like GPT 5.5, etc) you can get performance that sometimes surpasses top line models like Claude's Fable.

Sakana seems to have a separate approach using a domain specific model to perform the model routing step.

david_shi•13m ago
Their research around building a domain specific model is pretty cool, it's kind of like Karpathy's autoresearch but pointed at deciding the optimal model to use at each step of the inference.

If cost becomes an even bigger problem being able to choose "best performance possible" or "strong but cost effective" will be useful.

https://arxiv.org/pdf/2512.04695

prodigycorp•12m ago
ngl, I thought sakana.ai was doing cooler stuff than this. that said, when you use these models, the release of a product like this follows your natural intuition when using these models. The best way to use LLMs is to have at least two in your pocket, because the models do a good job at covering each others assets and filling in obvious pockets of knowledge or coding styles that other models dont have.

it's interesting that they're offering in the form of fixed cost subscription plans too. My impression was that the first party providers can do this because they api inference margins to the tune of 80ish percent. Anyone else orchestrating on top of these models have to pass through these costs or eat it themselves.

holistio•10m ago
You pay $200/month to Anthropic, $200/month to OpenAI, $200/month to Cursor, $200/month to $200/month to Google, and seeing that it didn't come to a nice round $1024/month, you pay $200/month to Sakana to coordinate it all, because why not.

While you're at it, feel free to send me $200 as well, I'll generate a crypto address ending with "AI".

holistio•6m ago
TIL: I just found out that base58 disallows I (capital i), l (lowercase L), O (capital o) and 0 (zero), so I could only generate GrxoJt4eNXE2QaQ55iPSa7hhiYdzCo8ZeAuokmh2Cai.

(don't send anything, sharing only because of the base58 fun fact I didn't know)

Did my old job only exist because of fraud?

https://david.newgas.net/did-my-old-job-only-exist-because-of-fraud/
324•advisedwang•6h ago•148 comments

Apertus – Open Foundation Model for Sovereign AI

https://apertvs.ai/
266•T-A•6h ago•90 comments

Help I accidentally a wigglegram

https://lmao.center/blog/wiggle-accidents/
49•gregsadetsky•2d ago•5 comments

Sakana Fugu

https://sakana.ai/fugu/
56•Finbarr•2h ago•20 comments

Memory Safe Inline Assembly

https://fil-c.org/inlineasm
50•pizlonator•2d ago•8 comments

Everything is logarithms

https://alexkritchevsky.com/2026/05/25/everything-is-logarithms.html
150•E-Reverance•7h ago•30 comments

There is minimal downside to switching to open models

https://www.marble.onl/posts/cancel_claude.html
106•amarble•7h ago•64 comments

The Flat Curve Society

https://steve-yegge.medium.com/the-flat-curve-society-36c8b01eb33b
14•fbuilesv•1h ago•7 comments

1983 Northern Telecom Commodore Phone

https://www.oldtelephoneroom.ca/1983-northern-telecom-commodore-phone/
33•arexxbifs•3h ago•8 comments

Good results fine tuning a local LLM like Qwen 3:0.6B to categorize questions

https://www.teachmecoolstuff.com/viewarticle/fine-tuning-a-local-llm-to-categorize-questions
54•dev-experiments•5h ago•8 comments

How I play video games with spinal muscular atrophy

https://www.openassistivetech.org/how-i-actually-play-video-games-with-sma-the-tools-i-use-every-...
61•dannyobrien•3d ago•9 comments

Efficient C++ Programming for Modern C++ CPUs, Chapter 4/part 2

https://6it.dev/blog/infographics-operation-costs-in-cpu-clock-cycles-take-2-80736
9•birdculture•2d ago•0 comments

JSON-LD explained for personal websites

https://hawksley.dev/blog/json-ld-explained-for-personal-websites/
185•ethanhawksley•9h ago•55 comments

Identity verification on Claude

https://support.claude.com/en/articles/14328960-identity-verification-on-claude
629•bathory•15h ago•548 comments

PowerFox Browser

https://powerfox.jazzzny.me/
93•thisislife2•7h ago•30 comments

Beyond All Reason (Free Total Annihilation Inspired RTS)

https://www.beyondallreason.info
448•mosiuerbarso•16h ago•269 comments

Japanese verb conjugation the simple hard way

https://underreacted.leaflet.pub/3mmevu6woys27
50•valzevul•5h ago•65 comments

Prefer duplication over the wrong abstraction (2016)

https://sandimetz.com/blog/2016/1/20/the-wrong-abstraction
442•rafaepta•12h ago•301 comments

Show HN: HN Game Stories – mini-documentary of games that hit the front page

https://video.intellios.ai
5•coolwulf•1d ago•0 comments

Canadian government spent $46.8M on a secret Palantir contract

https://theijf.org/brief/canadian-palantir-contract-amendments-obd
10•logickkk1•2h ago•1 comments

From Combinatorial Mess to Linear Elegance: Architecting a Conversion Engine

https://blog.minimal.app/conversion-engine/
16•arthurofbabylon•4d ago•3 comments

HPV jabs cut risk of dying from cervical cancer before 30 to almost zero

https://www.theguardian.com/society/2026/jun/17/hpv-jabs-reduce-risk-dying-cervical-cancer-before...
204•toomuchtodo•4d ago•120 comments

Minecraft: Java Edition 26.2, the first version with Vulkan 1.2

https://www.minecraft.net/en-us/article/minecraft-java-edition-26-2
86•ObviouslyFlamer•4d ago•29 comments

The minimum viable unit of saleable software

https://brandur.org/minimum-viable-unit
142•brandur•11h ago•53 comments

Show HN: Recall – fully-local project memory for Claude Code

https://github.com/raiyanyahya/recall
87•mateenah•7h ago•61 comments

Show HN: Teach your kids perfect pitch

https://github.com/paytonjjones/bsharp
73•paytonjjones•15h ago•52 comments

Rent collections are down in New York

https://www.politico.com/news/2026/06/21/rent-collections-are-down-in-new-york-and-no-ones-sure-w...
47•JumpCrisscross•6h ago•113 comments

Wildcard (YC W25) is hiring an applied ML engineer

https://www.ycombinator.com/companies/wildcard/jobs/SEmo4di-founding-applied-ml-engineer
1•kaushikmahorker•11h ago

Show HN: Criterion Closet as a website – pull any of 1,247 films off the shelf

https://the-criterion-closet.vercel.app
63•olievans•1d ago•15 comments

FDA advisors unanimously vote to approve Moderna's mRNA after agency drama

https://arstechnica.com/health/2026/06/fda-advisors-unanimously-vote-to-approve-modernas-mrna-aft...
138•worik•6h ago•76 comments