frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: GoModel – an open-source AI gateway in Go; 44x lighter than LiteLLM

https://github.com/ENTERPILOT/GOModel/
47•santiago-pl•2h ago
Hi, I’m Jakub, a solo founder based in Warsaw.

I’ve been building GoModel since December with a couple of contributors. It's an open-source AI gateway that sits between your app and model providers like OpenAI, Anthropic or others.

I built it for my startup to solve a few problems:

  - track AI usage and cost per client or team
  - switch models without changing app code
  - debug request flows more easily
  - reduce AI spendings with exact and semantic caching
How is it different?

  - ~17MB docker image
    - LiteLLM's image is more than 44x bigger ("docker.litellm.ai/berriai/litellm:latest" ~ 746 MB on amd64)
  - request workflow is visible and easy to inspect    
  - config is environment-variable-first by default
I'm posting now partly because of the recent LiteLLM supply-chain attack. Their team handled it impressively well, but some people are looking at alternatives anyway, and GoModel is one.

Website: https://gomodel.enterpilot.io

Any feedback is appreciated.

Comments

anilgulecha•1h ago
how does this compare to bifrost - another golang router?
santiago-pl•1h ago
First of all, GoModel doesn't have a separate private repository behind a paywall/license.

It's more lightweight and simpler. The Bifrost docker image looks 4x larger, at least for now.

IMO GoModel is more convenient for debugging and for seeing how your request flows through different layers of AI Gateways in the Audit Logs.

anilgulecha•58m ago
That would be valuable if there's a commitment to never have a non-opensource offering under GoModel? If so, you can document it in the repo.
santiago-pl•54m ago
I would love to keep it open source forever, but I can't promise that for now. I've written a whole doc page about it if you're curious: https://gomodel.enterpilot.io/docs/about/license
Talderigi•1h ago
Curious how the semantic caching layer works.. are you embedding requests on the gateway side and doing a vector similarity lookup before proxying? And if so, how do you handle cache invalidation when the underlying model changes or gets updated?
giorgi_pro•48m ago
Hey, contributor here. That's right, GoModel embeds requests and does vector similarity lookup before proxying. Regarding the cache invalidation, there is no "purging" involved – the model is part of the namespace (params_hash includes the LLM model, path, guardrails hash, etc). TTL takes care of the cleanup later.
pjmlp•1h ago
Expectable, given that LiteLLM seems to be implemented in Python.

However kudos for the project, we need more alternatives in compiled languages.

santiago-pl•57m ago
Agree and thank you! Please let us know if you'd like to give it a try and if you miss any feature in GoModel.
tahosin•40m ago
This is really useful. I've been building an AI platform (HOCKS AI) where I route different tasks to different providers — free OpenRouter models for chat/code gen, Gemini for vision tasks. The biggest pain point has been exactly what you describe: switching models without changing app code.

One thing I'd love to see is built-in cost tracking per model/route. When you're mixing free and paid models, knowing exactly where your spend goes is critical. Do you have plans for that in the dashboard?

santiago-pl•30m ago
This comment looks like AI-generated.

However IIUC what you're asking for - it's already in the dashboard! Check the Usage page.

indigodaddy•28m ago
Any plans for AI provider subscription compatibility? Eg ChatGPT, GH Copilot etc ? (Ala opencode)
santiago-pl•3m ago
You are not the first person who has asked about it.

It looks like a useful feature to have. Therefore, I'll dig into this topic more broadly over the next few days and let you know here whether, and possibly when, we plan to add it.

rvz•17m ago
I don't see any significant advantage over mature routers like Bifrost.

Are there even any benchmarks?

Original GrapheneOS responses to WIRED fact checker

https://discuss.grapheneos.org/d/34369-original-grapheneos-responses-to-wired-fact-checker
78•ChrisArchitect•1h ago•43 comments

Laws of Software Engineering

https://lawsofsoftwareengineering.com
470•milanm081•5h ago•232 comments

As Oceans Warm, Great White Sharks Are Overheating

https://e360.yale.edu/digest/great-white-sharks-climate
86•speckx•1h ago•60 comments

Tim Cook's Impeccable Timing

https://stratechery.com/2026/tim-cooks-impeccable-timing/
164•hasheddan•5h ago•234 comments

Fusion Power Plant Simulator

https://www.fusionenergybase.com/fusion-power-plant-simulator
50•sam•2h ago•16 comments

John Ternus to become Apple CEO

https://www.apple.com/newsroom/2026/04/tim-cook-to-become-apple-executive-chairman-john-ternus-to...
2103•schappim•19h ago•1200 comments

Show HN: GoModel – an open-source AI gateway in Go; 44x lighter than LiteLLM

https://github.com/ENTERPILOT/GOModel/
48•santiago-pl•2h ago•13 comments

Show HN: VidStudio, a browser based video editor that doesn't upload your files

https://vidstudio.app/video-editor
162•kolx•4h ago•58 comments

Running a Minecraft Server and More on a 1960s Univac Computer

https://farlow.dev/2026/04/17/running-a-minecraft-server-and-more-on-a-1960s-univac-computer
92•brilee•3d ago•15 comments

A type-safe, realtime collaborative Graph Database in a CRDT

https://codemix.com/graph
98•phpnode•5h ago•29 comments

Tindie store under "scheduled maintenance" for days

https://www.tindie.com/
67•somemisopaste•3h ago•24 comments

Clojure: Transducers

https://clojure.org/reference/transducers
23•tosh•2d ago•4 comments

Your favorite brands got worse on purpose

https://www.worseonpurpose.com/p/your-favorite-brands-got-worse-on-purpose
131•neon_electro•2h ago•72 comments

MNT Reform is an open hardware laptop, designed and assembled in Germany

http://mnt.stanleylieber.com/reform/
189•speckx•1d ago•72 comments

Anthropic says OpenClaw-style Claude CLI usage is allowed again

https://docs.openclaw.ai/providers/anthropic
376•jmsflknr•12h ago•215 comments

Anthropic takes $5B from Amazon and pledges $100B in cloud spending in return

https://techcrunch.com/2026/04/20/anthropic-takes-5b-from-amazon-and-pledges-100b-in-cloud-spendi...
131•Brajeshwar•3h ago•130 comments

Leonardo, Borgia, and Machiavelli: A Fateful Collusion

https://www.historytoday.com/archive/leonardo-borgia-and-machiavelli-fateful-collusion
16•apollinaire•5d ago•0 comments

Slava's Monoid Zoo

https://factorcode.org/slava/monoids.html
32•luu•1d ago•6 comments

Kasane: New drop-in Kakoune front end with GPU rendering and WASM Plugins

https://github.com/Yus314/kasane
5•nsagent•38m ago•0 comments

Salmon exposed to cocaine and its main byproduct roam more widely

https://www.science.org/content/article/cocaine-pollution-gives-salmon-wanderlust
107•1659447091•11h ago•62 comments

A Roblox cheat and one AI tool brought down Vercel's platform

https://webmatrices.com/post/how-a-roblox-cheat-and-one-ai-tool-brought-down-vercel-s-entire-plat...
264•bishwasbh•12h ago•142 comments

The Beauty of Bonsai Styles

https://longwoodgardens.org/blog/2023-05-17/beauty-bonsai-styles
163•lagniappe•12h ago•30 comments

Less human AI agents, please

https://nial.se/blog/less-human-ai-agents-please/
85•nialse•9h ago•102 comments

Colorado River disappeared record for 5M years: now we know where it was

https://phys.org/news/2026-04-colorado-river-geological-million-years.html
4•wglb•23h ago•1 comments

High-Fidelity KV Cache Summarization Using Entropy and Low-Rank Reconstruction

https://jchandra.com/posts/hae-ols/
50•jchandra•2d ago•12 comments

Apple ignores DMA interoperability requests and contradicts own documentation

https://fsfe.org/news/2026/news-20260420-01.html
173•kirschner•5h ago•33 comments

Louis Zocchi, games industry pioneer, has died

https://icv2.com/articles/news/view/62176/r-i-p-louis-zocchi-the-godfather-dice
109•sgbeal•10h ago•48 comments

Qwen3.6-Max-Preview: Smarter, Sharper, Still Evolving

https://qwen.ai/blog?id=qwen3.6-max-preview
676•mfiguiere•1d ago•359 comments

How to make a fast dynamic language interpreter

https://zef-lang.dev/implementation
225•pizlonator•15h ago•44 comments

Vera C. Rubin Observatory has Discovered 11,000 New Asteroids

https://www.universetoday.com/articles/the-vera-c-rubin-observatory-has-discovered-11000-new-aste...
46•tcp_handshaker•3h ago•5 comments