frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Go-LLM-proxy v0.3 released – translating proxy for Claude Code and Codex

https://go-llm-proxy.com
2•yatesdr•4h ago

Comments

yatesdr•4h ago
Happy to report v0.3 released for go-llm-proxy!

Great for connecting your local LLM coding and vision models to Claude Code and Codex.

General improvements

> Vision pipeline - images described by your vision model, transparent to the client

> Dual OCR pipeline - smart routing for PDFs and tool output (text extraction first, vision fallback for scanned docs). Dedicated OCR models like

> PaddleOCR-VL are ~17x faster than general vision models on document pages

> Brave & Tavily search integration - native behavior for Claude Code and Codex when configured on the proxy

> Per-model processor routing - override vision, OCR, and search settings per model

> Context window auto-detection from backends SSE keepalive improvements during pipeline processing Full MCP SSE endpoint for web search on OpenCode, Qwen Code, Claw, and other MCP-compatible agents Docker update for easier deployment (limited testing so far)

Codex-specific

> Full Responses API translation - Chat Completions under the hood, your local backend doesn't need to support /v1/responses

> Reasoning token display - reasoning_summary_text.delta events so Codex shows thinking natively

> Native search UI - emits web_search_call output items so Codex renders "Searched N results" in its interface

> Structured tool output - Codex's view_image returns arrays/objects, not strings. The proxy handles all three formats

> mcp_tool_call_output and mcp_list_tools input types handled (Codex sends these, other backends choke on them)

> Config generator produces config.toml with provider, reasoning effort, context window, and optional Tavily MCP

Claude Code-specific:

> Full Messages API translation - Anthropic protocol to Chat Completions, so Claude Code works with vLLM/llama-server

> Thinking blocks - backend reasoning tokens wrapped as thinking/signature_delta content blocks so Claude Code renders them

> web_search_20250305 server tool intercepted and executed proxy-side

> PDF type: "document" blocks extracted to text before forwarding

> Streaming search with server_tool_use + web_search_tool_result blocks so Claude Code shows "Did N searches"

> /anthropic/v1/messages explicit route for clients that use the Anthropic base URL convention

> Config generator produces settings.json with Sonnet/Opus/Haiku tier selectors, thinking toggles, and start scripts

Tell HN: Anthropic's "extra usage" can kick in before hitting the session quotas

1•vially•4m ago•0 comments

Are we building IDEs for engineers anymore?

https://shub.club/writings/2026/april/who-are-these-ides-for/
1•forthwall•5m ago•0 comments

Show HN: Tmuzika – terminal music player in C (v1.1.2)

https://github.com/ivanjeka/tmuzika
1•ivanjeka•8m ago•0 comments

Claude Code caches unredacted session history and secrets in plaintext

https://github.com/anthropics/claude-code/issues/43675
1•rctruta•9m ago•0 comments

I'm 16 and just published my first Python library – QuantX

https://github.com/aaravchour/quantx
1•aaravchour•11m ago•0 comments

Sleeping sickness drug simplifies treatment, raising hopes for eradication

https://www.science.org/content/article/truly-spectacular-drug-sleeping-sickness-simplifies-treat...
1•MaysonL•12m ago•0 comments

The Iran war highlights the creeping use of AI in warfare

https://www.chathamhouse.org/2026/03/iran-war-highlights-creeping-use-ai-warfare
2•_____k•13m ago•0 comments

Show HN: Ray – an open-source AI financial advisor that runs in your terminal

https://rayfinance.app
4•kaniksu•17m ago•1 comments

Show HN: Cadenza – Connect Wandb logs to agents easily for autonomous research

https://github.com/mylucaai/cadenza
2•hgarud•19m ago•0 comments

Subway Challenge

https://en.wikipedia.org/wiki/Subway_Challenge
1•skogstokig•19m ago•0 comments

Digger – Back and Digitally Remastered

https://www.digger.org/
1•bananaboy•19m ago•0 comments

Rescue team in Iran face 'harrowing and dangerous' search for US crew member

https://www.bbc.com/news/articles/c0le43jw8e9o
4•asdefghyk•22m ago•1 comments

Show HN: SwarmFeed – An X-like social platform built for AI agents

https://www.swarmfeed.ai/
2•jamesweb•25m ago•0 comments

The Republican Party Has a Nazi Problem

https://www.theatlantic.com/magazine/2026/04/republican-party-nazi-problem/686055/
6•tastyface•25m ago•1 comments

Show HN: Nelson, a Ralph-like loop for finding vulnerabilities

https://github.com/swelljoe/nelson
2•SwellJoe•30m ago•1 comments

With the right cache, multiple 800K Opus sessions are still affordable

https://matrix.dev/blog-2026-04-04
2•yuanzhi1203•31m ago•0 comments

Previously untranslated or unpublished writings of Leibniz published next month

https://dailynous.com/2026/03/27/lots-more-leibniz/
2•danielam•34m ago•0 comments

Whoop is trying to copyright UI patterns, activity rings, dark mode and words

https://www.youtube.com/watch?v=iAcx7kP9sog
4•chakintosh•39m ago•0 comments

vLLM introduces memory optimizations for long-context inference

https://github.com/vllm-project/vllm/releases
4•addisud•39m ago•0 comments

We Score MCP Servers – and Why We Rebuilt It from Scratch

https://mcppedia.org/blog/2026-04-04-how-we-score-mcp-servers
3•bibekshrestha•45m ago•1 comments

The Only Game Worth Playing

https://newsletter.calvinrosser.com/p/207
3•sillywabbit•45m ago•0 comments

Map Gesture Controls - Control maps with your hands

https://sanderdesnaijer.github.io/map-gesture-controls/
3•hebelehubele•48m ago•0 comments

Show HN: Local-first resume generator with in-browser PDF rendering

https://resume.journy.live/
4•dlvktrsh•48m ago•2 comments

WebAssembly Explorer

https://mbebenita.github.io/WasmExplorer/
4•luu•48m ago•0 comments

Feds Seek Access to Three Texas State Parks for Border Wall

https://insideclimatenews.org/news/02042026/texas-state-parks-border-wall-construction/
3•geox•51m ago•0 comments

Debian Is Figuring Out How Age Verification Laws Will Impact It

https://www.phoronix.com/news/Debian-Undecided-Age-Laws
4•breve•52m ago•0 comments

A Python package for verifying PyPI attestations of other Python packages

https://github.com/Halfblood-Prince/trustcheck
1•halfblood1010•52m ago•2 comments

ACE on a USB→HDMI Adapter

https://blazelight.dev/blog/ms2160.mdx
2•theblazehen•54m ago•0 comments

LLM 'benchmark' – writing code controlling units in a 1v1 RTS

https://yare.io/ai-arena
4•levmiseri•58m ago•0 comments

Syd sandbox has new Tutorial

https://man.exherbo.org/sydtutorial.7.html
2•hayali•1h ago•0 comments