frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: LLM-Gateway – Zero-Trust LLM Gateway

https://github.com/openziti/llm-gateway
7•michaelquigley•6h ago
I built an OpenAI-compatible LLM gateway that routes requests to OpenAI, Anthropic, Ollama, vLLM, llama-server, SGLang... anything that speaks /v1/chat/completions. Single Go binary, one YAML config file, no infrastructure.

It does the things you'd expect from this kind of gateway... semantic routing via a three-layer cascade (keyword heuristics, embedding similarity, LLM classifier) that picks the best model when clients omit the model field, weighted round-robin load balancing across local inference servers with health checks and failover.

The part I think is most interesting is the network layer. The gateway and backends communicate over zrok/OpenZiti overlay networks... reach a GPU box behind NAT, expose the gateway to clients, put components anywhere with internet connectivity behind firewalls... no port forwarding, no VPN. Zero-trust in both directions. Most LLM proxies solve the API translation problem. This one also solves the network problem.

Apache 2.0. https://github.com/openziti/llm-gateway

I work for NetFoundry, which sponsors the OpenZiti project this is built on.

Comments

veilpiercer•3h ago
Realistically if you want to use an OSS model instead of the big 3, you're faced with evalutating models/providers across all these axes, which can require a fair amount of expertise to discern. You may even have to write your own custom evaluations. Meanwhile Anthropic/OAI/Google "just work" and you get what it says on the tin, to the best of their ability. Even if they're more expensive (and they're not that much more expensive), you are basically paying for the priviledge of "we'll handle everything for you".

Show HN: Open-Source Animal Crossing–Style UI for Claude Code Agents

https://github.com/outworked/outworked/releases/tag/v0.3.0
28•ZeidJ•3h ago•20 comments

Show HN: Kagento – LeetCode for AI Agents

https://kagento.io
4•ifdotpy•27m ago•0 comments

Show HN: I put an AI agent on a $7/month VPS with IRC as its transport layer

https://georgelarson.me/writing/2026-03-23-nullclaw-doorman/
318•j0rg3•22h ago•93 comments

Show HN: For You – AI art floats down a river for strangers to find

https://www.foryouriver.com/
3•arclegger•1h ago•4 comments

Show HN: Anvil – Desktop App for Spec Driven Development

https://github.com/zdenham/anvil
3•zdenham•2h ago•0 comments

Show HN: Sup AI, a confidence-weighted ensemble (52.15% on Humanity's Last Exam)

https://sup.ai
20•supai•1d ago•22 comments

Show HN: Foundry: a Markdown-first CMS written in Go

https://github.com/sphireinc/Foundry
9•nsayoda•4h ago•4 comments

Show HN: MLForge: A Visual Machine Learning Platform

https://github.com/zaina-ml/ml_forge
3•zaina-ml•2h ago•1 comments

Show HN: Fio: 3D World editor/game engine – inspired by Radiant and Hammer

https://github.com/ViciousSquid/Fio
90•vicioussquid•1d ago•9 comments

Show HN: Control Codex via WhatsApp using a Codex plugin

https://github.com/abuiles/codex-whatsapp-relay
3•abuiles•3h ago•0 comments

Show HN: Build AI Trading Agents in Cursor/Claude with an MCP Server

https://financialdata.net/mcp-server
3•financial-data•3h ago•0 comments

Show HN: Turbolite – a SQLite VFS serving sub-250ms cold JOIN queries from S3

https://github.com/russellromney/turbolite
159•russellthehippo•1d ago•40 comments

Show HN: jid – JSON Incremental Digger v1.1.0 with JMESPath support

https://github.com/simeji/jid/releases/tag/v1.1.0
5•jamslater•3h ago•0 comments

Show HN: Cranki – Crosswords meet Anki flashcards

https://cranki.app
3•petargyurov•4h ago•0 comments

Show HN: AgentGuard – A high-performance Go proxy for AI agent guardrails

https://github.com/Caua-ferraz/AgentGuard
3•millimercure•4h ago•0 comments

Show HN: Minimalist library to generate SVG views of scientific data

https://github.com/alefore/mini_svg/
41•afc•4d ago•3 comments

Show HN: OpenHelm – OpenClaw but free (using your Claude Code subscription)

https://www.openhelm.ai/
3•maxbeech•4h ago•2 comments

Show HN: FileWash – File tools that never see your files

https://filewash.app/
3•PressKeyProdigy•5h ago•0 comments

Show HN: Veil – Dark mode PDFs without destroying images, runs in the browser

https://veil.simoneamico.com/
89•simoneamico•1d ago•24 comments

Show HN: Forkrun – NUMA-aware shell parallelizer (50×–400× faster than parallel)

https://github.com/jkool702/forkrun
5•jkool702•9h ago•1 comments

Show HN: Alumnium – SOTA Browsing for Claude Code

https://alumnium.ai/blog/webvoyager-benchmark/
3•p0deje•6h ago•0 comments

Show HN: AppDesk – Native macOS Client for App Store Connect

https://appdesk.dev
4•prasadrl•6h ago•0 comments

Show HN: Grafana TUI – Browse Grafana dashboards in the terminal

https://github.com/lovromazgon/grafana-tui
4•lmazgon•9h ago•3 comments

Show HN: Optio – Orchestrate AI coding agents in K8s to go from ticket to PR

https://github.com/jonwiggins/optio
81•jawiggins•2d ago•56 comments

Show HN: Bottrace – headless CLI debugger for Python, built for LLM agents

https://github.com/devinvenable/bottrace
3•devinvenable•6h ago•0 comments

Show HN: LLM-Gateway – Zero-Trust LLM Gateway

https://github.com/openziti/llm-gateway
7•michaelquigley•6h ago•1 comments

Show HN: Git Web Manager

https://github.com/WallabyDesigns/gitmanager
3•wallabydesigns•6h ago•0 comments

Show HN: ClawRun – Deploy AI agents to secure sandboxes with one command

https://clawrun.sh/?hn
3•afshinmeh•7h ago•0 comments

Show HN: Aegis – Security framework for AI agents

https://acacian.github.io/aegis/playground/
2•Acacian•7h ago•0 comments

Show HN: A plain-text cognitive architecture for Claude Code

https://lab.puga.com.br/cog/
148•marciopuga•1d ago•49 comments