frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: GoModel – an open-source AI gateway in Go; 44x lighter than LiteLLM

https://github.com/ENTERPILOT/GOModel/
49•santiago-pl•2h ago
Hi, I’m Jakub, a solo founder based in Warsaw.

I’ve been building GoModel since December with a couple of contributors. It's an open-source AI gateway that sits between your app and model providers like OpenAI, Anthropic or others.

I built it for my startup to solve a few problems:

  - track AI usage and cost per client or team
  - switch models without changing app code
  - debug request flows more easily
  - reduce AI spendings with exact and semantic caching
How is it different?

  - ~17MB docker image
    - LiteLLM's image is more than 44x bigger ("docker.litellm.ai/berriai/litellm:latest" ~ 746 MB on amd64)
  - request workflow is visible and easy to inspect    
  - config is environment-variable-first by default
I'm posting now partly because of the recent LiteLLM supply-chain attack. Their team handled it impressively well, but some people are looking at alternatives anyway, and GoModel is one.

Website: https://gomodel.enterpilot.io

Any feedback is appreciated.

Comments

anilgulecha•1h ago
how does this compare to bifrost - another golang router?
santiago-pl•1h ago
First of all, GoModel doesn't have a separate private repository behind a paywall/license.

It's more lightweight and simpler. The Bifrost docker image looks 4x larger, at least for now.

IMO GoModel is more convenient for debugging and for seeing how your request flows through different layers of AI Gateways in the Audit Logs.

anilgulecha•1h ago
That would be valuable if there's a commitment to never have a non-opensource offering under GoModel? If so, you can document it in the repo.
santiago-pl•1h ago
I would love to keep it open source forever, but I can't promise that for now. I've written a whole doc page about it if you're curious: https://gomodel.enterpilot.io/docs/about/license
Talderigi•1h ago
Curious how the semantic caching layer works.. are you embedding requests on the gateway side and doing a vector similarity lookup before proxying? And if so, how do you handle cache invalidation when the underlying model changes or gets updated?
giorgi_pro•1h ago
Hey, contributor here. That's right, GoModel embeds requests and does vector similarity lookup before proxying. Regarding the cache invalidation, there is no "purging" involved – the model is part of the namespace (params_hash includes the LLM model, path, guardrails hash, etc). TTL takes care of the cleanup later.
pjmlp•1h ago
Expectable, given that LiteLLM seems to be implemented in Python.

However kudos for the project, we need more alternatives in compiled languages.

santiago-pl•1h ago
Agree and thank you! Please let us know if you'd like to give it a try and if you miss any feature in GoModel.
tahosin•52m ago
This is really useful. I've been building an AI platform (HOCKS AI) where I route different tasks to different providers — free OpenRouter models for chat/code gen, Gemini for vision tasks. The biggest pain point has been exactly what you describe: switching models without changing app code.

One thing I'd love to see is built-in cost tracking per model/route. When you're mixing free and paid models, knowing exactly where your spend goes is critical. Do you have plans for that in the dashboard?

santiago-pl•42m ago
This comment looks like AI-generated.

However IIUC what you're asking for - it's already in the dashboard! Check the Usage page.

indigodaddy•40m ago
Any plans for AI provider subscription compatibility? Eg ChatGPT, GH Copilot etc ? (Ala opencode)
santiago-pl•15m ago
You are not the first person who has asked about it.

It looks like a useful feature to have. Therefore, I'll dig into this topic more broadly over the next few days and let you know here whether, and possibly when, we plan to add it.

rvz•30m ago
I don't see any significant advantage over mature routers like Bifrost.

Are there even any benchmarks?

Show HN: GoModel – an open-source AI gateway in Go; 44x lighter than LiteLLM

https://github.com/ENTERPILOT/GOModel/
53•santiago-pl•2h ago•14 comments

Show HN: VidStudio, a browser based video editor that doesn't upload your files

https://vidstudio.app/video-editor
167•kolx•4h ago•60 comments

Show HN: Daemons – we pivoted from building agents to cleaning up after them

https://charlielabs.ai/
10•rileyt•27m ago•1 comments

Show HN: Zero-allocation embedded security in Rust (fits in 256KB Flash)

https://github.com/craton-co/craton-shield
4•victor-craton•1h ago•0 comments

Show HN: Mediator.ai – Using Nash bargaining and LLMs to systematize fairness

https://mediator.ai/
112•sanity•1d ago•59 comments

Show HN: WeTransfer Alternative for Developers

https://dlvr.sh/
17•mariusbolik•8h ago•4 comments

Show HN: Flight Risk: Can you break an AI agent?

https://ctf.demo.lorikeetcx.ai/
2•tetrakai•4h ago•0 comments

Show HN: Holos – QEMU/KVM with a compose-style YAML, GPUs and health checks

https://github.com/zeroecco/holos
51•zeroecco•19h ago•22 comments

Show HN: Prompt-to-Excalidraw demo with Gemma 4 E2B in the browser (3.1GB)

https://teamchong.github.io/turboquant-wasm/draw.html
156•teamchong•2d ago•61 comments

Show HN: Run TRELLIS.2 Image-to-3D generation natively on Apple Silicon

https://github.com/shivampkumar/trellis-mac
198•shivampkumar•1d ago•34 comments

Show HN: DSS, a new human-readable and plain format for XLS and spreadsheets

https://github.com/Datastripes/DataSheetStandard/
5•vinserello•9h ago•0 comments

Show HN: Git Push No-Mistakes

https://github.com/kunchenguid/no-mistakes
14•akane8•22h ago•6 comments

Show HN: Shader Lab, like Photoshop but for shaders

https://eng.basement.studio/tools/shader-lab
159•ragojose•4d ago•46 comments

Show HN: Faceoff – A terminal UI for following NHL games

https://www.vincentgregoire.com/faceoff/
128•vcf•1d ago•43 comments

Show HN: MDV – a Markdown superset for docs, dashboards, and slides with data

https://github.com/drasimwagan/mdv
148•drasim•3d ago•53 comments

Show HN: MCPfinder – An MCP server that finds and installs other MCP servers

https://mcpfinder.dev/
7•coderai•19h ago•0 comments

Show HN: A lightweight way to make agents talk without paying for API usage

https://juanpabloaj.com/2026/04/16/a-lightweight-way-to-make-agents-talk-without-paying-for-api-u...
53•juanpabloaj•1d ago•12 comments

Show HN: Alien – Self-hosting with remote management (written in Rust)

105•alongub•1d ago•43 comments

Show HN: A fake small claims court for petty complaints

https://benlirio.com/petty-small-claims/
5•blirio•12h ago•1 comments

Show HN: I Built SwiftUI but for macOS MDM

https://github.com/photon-hq/Astrolabe
7•RyanZhuuuu•19h ago•0 comments

Show HN: I built an AI that assigns YOU tasks

https://www.pause.build/
3•chaidhat•14h ago•6 comments

Show HN: Palmier – bridge your AI agents and your phone

https://github.com/caihongxu/palmier
5•caihongxu•14h ago•5 comments

Show HN: Smol machines – subsecond coldstart, portable virtual machines

https://github.com/smol-machines/smolvm
493•binsquare•3d ago•147 comments

Show HN: Mimi in the browser – hear the semantic/acoustic split

https://www.frisson-labs.com/mimi-codec
4•ymaws•17h ago•1 comments

Show HN: Ctx – a /resume that works across Claude Code and Codex

https://github.com/dchu917/ctx
7•dchu17•1d ago•0 comments

Show HN: I made a calculator that works over disjoint sets of intervals

https://victorpoughon.github.io/interval-calculator/
312•fouronnes3•3d ago•54 comments

Show HN: Themeable HN

https://github.com/insin/comments-owl-for-hacker-news/releases/tag/v3.6.1
5•insin•1d ago•1 comments

Show HN: PanicLock – Close your MacBook lid disable TouchID –> password unlock

https://github.com/paniclock/paniclock/
260•seanieb•4d ago•114 comments

Show HN: Auto-generated titles and colors for parallel Claude Code sessions

https://github.com/jbarbier/which-claude-code
3•julien421•17h ago•0 comments

Show HN: Eris – desktop PGP workstation with simple GUI

https://eris.sibexi.co/
3•Sibexico•18h ago•0 comments