frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: I visualized the entire history of Citi Bike in the browser

https://bikemap.nyc/
20•freemanjiang•3h ago•8 comments

Show HN: An LLM response cache that's aware of dynamic data

https://blog.butter.dev/on-automatic-template-induction-for-response-caching
7•raymondtana•1h ago•0 comments

Show HN: I built a "Do not disturb" Device for my home office

https://apoorv.page/blogs/over-engineered-dnd
66•quacky_batak•4d ago•33 comments

Show HN: A to Z – A word game I built from a childhood road trip memory

https://a26z.fun/
7•jackhulbert•3h ago•2 comments

Show HN: Free and local browser tool for designing gear models for 3D printing

https://gears.dmtrkovalenko.dev
35•neogoose•14h ago•9 comments

Show HN: Flatagents: State machine orchestration with stateless LLM agents

https://github.com/memgrafter/flatagents
2•beacon294•1h ago•1 comments

Show HN: KeelTest – AI-driven VS Code unit test generator with bug discovery

https://keelcode.dev/keeltest
25•bulba4aur•9h ago•10 comments

Show HN: Tool for Testing MCP Servers

https://www.mcp-workbench.ai/
2•opiniateddev•1h ago•0 comments

Show HN: SMTP Tunnel – A SOCKS5 proxy disguised as email traffic to bypass DPI

https://github.com/x011/smtp-tunnel-proxy
122•lobito25•22h ago•40 comments

Show HN: bikemap.nyc – visualization of the entire history of Citi Bike

https://github.com/freeman-jiang/bikemap.nyc
6•freemanjiang•2h ago•3 comments

Show HN: Grammar of Graphics CLI tool made in Rust

https://github.com/williamcotton/gramgraph
2•williamcotton•2h ago•0 comments

Show HN: Comet MCP – Give Claude Code a browser that can click

https://github.com/hanzili/comet-mcp
25•hanzili•3d ago•25 comments

Show HN: VaultSandbox – Test your real MailGun/SES/etc. integration

https://vaultsandbox.com/
52•vaultsandbox•1d ago•9 comments

Show HN: Mantic.sh – A structural code search engine for AI agents

https://github.com/marcoaapfortes/Mantic.sh
73•marcoaapfortes•1d ago•34 comments

Show HN: 48-digit prime numbers every git commit

https://textonly.github.io/git-prime/
65•keepamovin•6d ago•52 comments

Show HN: Tylax – A bidirectional LaTeX to Typst converter in Rust

https://github.com/scipenai/tylax
16•democat•7h ago•2 comments

Show HN: Seapie – a Python debugger where breakpoints drop into a REPL

https://github.com/hirsimaki-markus/seapie
7•markushirsimaki•4h ago•1 comments

Show HN: The kissing number theorem predicts particle masses from sphere packing

https://colab.research.google.com/drive/1_zDIOONfs4WvnpG7GDEH6hzSM25Fsu93?usp=sharing
2•AlekseN•1h ago•3 comments

Show HN: Make audio loops online

https://makeloops.online/
66•bilalba•2d ago•22 comments

Show HN: Prism.Tools – Free and privacy-focused developer utilities

https://blgardner.github.io/prism.tools/
364•BLGardner•1d ago•99 comments

Show HN: Tailsnitch – A security auditor for Tailscale

https://github.com/Adversis/tailsnitch
271•thesubtlety•2d ago•28 comments

Show HN: Stash – Sync Markdown Files with Apple Notes via CLI

https://github.com/shakedlokits/stash
69•shuka•1d ago•21 comments

Show HN: I built a 3D World Map and multiplayer geography game using Three.js

https://www.mixora.xyz
2•qwrwenm•6h ago•0 comments

Show HN: DoNotNotify – Log and intelligently block notifications on Android

https://donotnotify.com/
339•awaaz•2d ago•163 comments

Show HN: llmgame.ai – The Wikipedia Game but with LLMs

https://www.llmgame.ai
24•jmcallister•1d ago•22 comments

Show HN: Jax-JS, array library in JavaScript targeting WebGPU

https://ss.ekzhang.com/p/jax-js-an-ml-library-for-the-web
79•ekzhang•1d ago•21 comments

Show HN: KektorDB – Lightweight, Embeddable Vector+Graph Database Written in Go

https://github.com/sanonone/kektordb
2•san0n•7h ago•1 comments

Show HN: Foundertrace – chain of YC startups founded by its employees

https://foundertrace.com/
38•loondri•3d ago•13 comments

Show HN: GPU Cuckoo Filter – faster queries than Blocked Bloom, with deletion

https://github.com/tdortman/cuckoo-filter
31•tdortman•1d ago•4 comments

Show HN: Can you hit replacement? A fertility SIM with cited sources

https://www.tfrsim.com/
3•joshuafkon•8h ago•0 comments
Open in hackernews

Show HN: Plano – Edge and service proxy with orchestration for AI agents

https://github.com/katanemo/plano
7•adilhafeez•1d ago
Hey HN — I’m Adil from Katanemo (with Salman, Shuguang, and Meiyu)

We previously shared an early version of this project as ArchGW. Based on customer feedback, the scope expanded from “LLM routing and model access” into something broader: delivery infrastructure for agentic applications. We renamed it to Plano and reworked the architecture accordingly.

The problem

On-the-ground AI practitioners will tell you that calling an LLM is not the hard part. The really hard part is delivering agentic applications to production quickly and reliably, then iterating without rewriting system code every time. In practice, teams keep rebuilding the same concerns that sit outside any single agent’s core logic:

This includes model agility — the ability to pull from a large set of LLMs and swap providers without refactoring prompts or streaming handlers. They need to learn from production by collecting signals and traces that tell them what to fix. They need consistent policy enforcement for moderation and jailbreak protection, rather than sprinkling hooks across codebases. And they need multi-agent patterns like handoff and specialization without turning their app into orchestration glue.

These concerns get rebuilt and maintained inside fast-changing frameworks and application code, coupling product logic to infrastructure decisions. It’s brittle, and pulls teams away from core product work into plumbing they shouldn’t have to own.

What Plano does

Plano moves core delivery concerns out of process into a modular proxy and dataplane designed for agents. It supports inbound listeners (agent orchestration, safety and moderation hooks), outbound listeners (hosted or API-based LLM routing), or both together.

Plano provides the following capabilities via a unified, protocol-native, framework-friendly dataplane:

- Orchestration: Low-latency routing and handoff between agents. Add or change agents without modifying app code, and evolve strategies centrally instead of duplicating logic across services.

- Guardrails & Memory Hooks: Apply jailbreak protection, content policies, and context workflows (rewriting, retrieval, redaction) once via filter chains. This centralizes governance and ensures consistent behavior across your stack.

- Model Agility: Route by model name, semantic alias, or preference-based policies. Swap or add models without refactoring prompts, tool calls, or streaming handlers.

- Agentic Signals™: Zero-code capture of behavior signals, traces, and metrics across every agent, surfacing traces, token usage, and learning signals in one place.

The goal is to keep application code focused on product logic while Plano owns delivery mechanics.

More on Architecture

Plano has two main parts:

Envoy-based data plane. Uses Envoy’s HTTP connection management to talk to model APIs, services, and tool backends. We didn’t build a separate model server—Envoy already handles streaming, retries, timeouts, and connection pooling. Some of us are core Envoy contributors at Katanemo.

Brightstaff, a lightweight controller written in Rust. It inspects prompts and conversation state, decides which upstreams to call and in what order, and coordinates routing and fallback. It uses small LLMs (1–4B parameters) trained for constrained routing and orchestration. These models do not generate responses and fall back to static policies on failure. The models are open sourced here: https://huggingface.co/katanemo

Plano runs alongside your app servers (cloud, on-prem, or local dev), doesn’t require a GPU, and leaves GPUs where your models are hosted.

Repo https://github.com/katanemo/plano + docs https://docs.planoai.dev/

Comments

dang•1d ago
> We previously shared an early version of this project as ArchGW

Looks like these are the previous threads, if anyone's curious:

Show HN: ArchGW – An intelligent edge and service proxy for agents - https://news.ycombinator.com/item?id=44546265 - July 2025 (15 comments)

Show HN: ArchGW – An open-source intelligent proxy server for prompts - https://news.ycombinator.com/item?id=43259862 - March 2025 (7 comments)

Show HN: archgw: open-source, intelligent proxy for AI agents, built on Envoy - https://news.ycombinator.com/item?id=42187132 - Nov 2024 (14 comments)