news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Edgequake-litellm – Rust-backed drop-in replacement for LiteLLM (v0.1)

https://github.com/raphaelmansuy/edgequake-llm

2•raphaelmansuy•1h ago

Comments

raphaelmansuy•1h ago

Hi — I'm Raphael Mansuy. I built edgequake-litellm to provide a low-latency, Rust-backed drop-in replacement for LiteLLM. It exposes the same Python API (`completion()`, `acompletion()`, `stream()`, `embedding()`), supports provider/model routing (OpenAI, Anthropic, Gemini, Mistral, xAI, OpenRouter, Ollama, LM Studio, etc.), and ships as a single ABI3 wheel with zero Python runtime deps.

Quick migration:

```python import edgequake_litellm as litellm # drop-in alias ```

Why build it? LiteLLM is excellent, but its pure-Python HTTP layer adds SDK overhead. I moved the core into Rust (edgequake-llm) and wrapped it with PyO3 to cut latency and provide a robust, multi-arch wheel. This is v0.1 — P0 compatibility is in place, but I'd love feedback on priorities: provider coverage, proxy features, billing/budgets, or tool-calling parity.

Install:

pip install edgequake-litellm

Repo: https://github.com/raphaelmansuy/edgequake-llm

If you try it, please star the repo and open issues for features you want most — I'm actively iterating. Happy to answer technical questions here.

OpenAI Is Betting on a Security Nightmare

https://julsimon.substack.com/p/openai-is-betting-on-a-security-nightmare

1•doener•40s ago•0 comments

Show HN: CSS Agent Garden – AI agents style one HTML page via MCP

https://css-agent-garden.fly.dev/

1•codecoded•4m ago•0 comments

That irritating feeling France was right – US makes Gaullism respectable again

https://www.economist.com/europe/2026/02/18/that-irritating-feeling-that-france-was-right

2•saubeidl•5m ago•1 comments

What is the probability of a coin landing on its edge?

1•vivzkestrel•5m ago•0 comments

I accidentally managed to uncover the system prompt of Google Gemini 3 Flash

https://pastebin.com/wMPqrmsw

1•errorcodezero•7m ago•0 comments

Show HN: ModelWar – CoreWars for Agents

https://modelwar-delta.vercel.app

1•pj4533•7m ago•0 comments

When to Vibe Code?

https://www.youtube.com/watch?v=TkRkLBcm9D4

1•grahamlee•8m ago•0 comments

Challenging the Single-Responsibility Principle

https://kiss-and-solid.com/blog/keep-it-simple

1•WolfOliver•9m ago•0 comments

The 8KB Page: PostgreSQL Page Layout Visualized

https://boringsql.com/visualizers/8kb-page/

1•radimm•10m ago•0 comments

Regulated Crypto Investigation Team – Intelligence Cyber Wizard Services

1•Robertjoe•10m ago•0 comments

Measuring Input-to-Photon Latency (Because 'Wayland Feels Off' Isn't a Metric)

https://davidjusto.com/articles/m2p-latency/

1•rhim•13m ago•0 comments

Chat with Llamma 8B at 16,000 TPS

https://chatjimmy.ai/

1•nl•19m ago•0 comments

Llamma 3.1 8B in hardware, 16,000 TPS

https://taalas.com/products/

1•nl•20m ago•0 comments

MD5 Algorithm Explainer

https://md5algorithm.vercel.app/

1•fanweixiao•20m ago•0 comments

The Gemini Servility Trap

1•gemfan•20m ago•0 comments

RFC 2295 – Transparent Content Negotiation in HTTP

https://datatracker.ietf.org/doc/html/rfc2295

1•locknitpicker•21m ago•0 comments

How did I revolutionized my productivity using OpenClaw

https://clawhub.ai/Quarantiine/effortlist-ai

1•daniel_ward•22m ago•0 comments

PostgreSQL's 8KB Page

https://boringsql.com/posts/inside-the-8kb-page/

2•radimm•27m ago•0 comments

Database table is an awful API

https://www.innoq.com/en/blog/2026/02/your-database-table-is-an-awful-api/

1•Ookami86•27m ago•0 comments

Donut Lab Announces Upcoming Independent Battery Measurements

https://www.donutlab.com/measurement-reports-announcement/

2•peritpatrio•28m ago•0 comments

We built a desktop AI agent that runs commands locally

https://desktopcommander.app/

1•rkrizanovskis•29m ago•1 comments

Tariffs paid by midsize US companies tripled last year JPMorganChase study shows

https://apnews.com/article/trump-tariffs-midsized-companies-costs-consumers-2a25158ff1d06bd7f72d9...

1•petethomas•30m ago•0 comments

What Developers Actually Need to Know Right Now

https://www.oreilly.com/radar/what-developers-actually-need-to-know-right-now/

1•asplake•31m ago•0 comments

Did a prize-winning novelist steal a woman's life story?

https://www.theguardian.com/books/2026/feb/17/did-a-prize-winning-novelist-steal-a-woman-life-sto...

1•brandonlc•39m ago•0 comments

Gay men have long been rumored to run Silicon Valley - WIRED investigates

https://www.wired.com/story/inside-the-gay-tech-mafia/

4•helloplanets•42m ago•0 comments

How AI Is Rewiring Winemaking and Wine Collecting - Shirley M. Mueller M.D.

https://www.psychologytoday.com/us/blog/the-mind-of-a-collector/202602/how-ai-is-rewiring-winemak...

2•omkar-foss•46m ago•1 comments

GPU Rack Power Density, 2015–2025

https://syaala.com/blog/gpu-rack-density-timeline-2026

1•jaynamburi•48m ago•1 comments

Lessons from Building Claude Code: Prompt Caching Is Everything

https://twitter.com/trq212/status/2024574133011673516

1•tosh•50m ago•0 comments

The Whispering Earring by Scott Alexander

https://croissanthology.com/earring?

1•nvader•51m ago•0 comments

Show HN: CLI tool to analyze your Vector Embeddings!

https://github.com/dakshjain-1616/Embedding-Evaluator

2•gauravvij137•53m ago•1 comments