frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

We added TOON compression to our LLM gateway – compress prompts, saves tokens

https://github.com/toon-format/toon
1•raaihank•1h ago

Comments

raaihank•1h ago
Costbase is an LLM cost optimization proxy. We just shipped TOON (Token-Oriented Object Notation) compression.

TOON is an open format (not ours): https://github.com/toon-format/toon

It converts JSON like this:

    {"id": "cust_001", "name": "Acme", "mrr": 15000}
Into: id: cust_001 name: Acme mrr: 15000

We integrated it into our gateway to automatically compress JSON in tool results, user messages, and tool call arguments before they hit the LLM.

Benchmarks on real payloads:

- CRM query (10 records): 48% tokens saved - E-commerce orders (4 orders): 34% saved - API metrics (8 endpoints): 43% saved

Sub-100μs latency overhead. LLMs parse it correctly in our testing (GPT-4o, Claude, etc).

Not a silver bullet — works best on arrays of objects with uniform schemas. Deeply nested or irregular JSON sees less benefit.

Curious what strategies others use for token compression. We considered CSV for tabular data but it doesn't handle nested structures.

https://www.costbase.ai

How does OpenAI balance long-term research bets with product-forward research?

https://twitter.com/markchen90/status/2018779039205667046
1•tosh•37s ago•0 comments

Public Notice: I Am Your AIB and the Warning That Came True

1•rowanseerwald•2m ago•0 comments

voyage-multimodal-3.5: a new multimodal retrieval frontier with video support

https://blog.voyageai.com/2026/01/15/voyage-multimodal-3-5/
1•fzliu•4m ago•0 comments

Detecting and Monitoring OpenClaw (clawdbot, moltbot) in your environment

https://isc.sans.edu/diary/32678
1•Binary_Impact•8m ago•1 comments

Quantum Computing for Programmers

https://github.com/qcc4cp/qcc
1•altro•8m ago•0 comments

FireClaw: Personal OpenClaw assistant in a single binary, built on Firecracker

https://github.com/AFK-surf/fireclaw
2•Johnson8053•11m ago•1 comments

Melinda French Gates Appears to Confirm Divorce Was Related to Epstein

https://gizmodo.com/melinda-french-gates-appears-to-confirm-divorce-with-bill-was-related-to-epst...
2•petethomas•11m ago•0 comments

Show HN: Sentinel – a Pingora-based reverse proxy (inspired by River)

https://sentinel.raskell.io/
3•raskelll•15m ago•0 comments

Ask HN: Does anyone keep prompts and reasoning as part of dev cycle?

1•sshadmand•18m ago•1 comments

Russian spy spacecraft have intercepted Europe's key satellites

https://www.ft.com/content/cd08c49c-658e-49c9-9a15-234f2bfc2074
3•mraniki•19m ago•2 comments

The world is more equal than you think

https://economist.com/graphic-detail/2026/02/03/the-world-is-more-equal-than-you-think
4•andsoitis•21m ago•2 comments

A new nuclear arms race beckons

https://economist.com/international/2026/02/03/a-new-nuclear-arms-race-beckons
1•andsoitis•23m ago•0 comments

Rust Is Just a Tool

https://lewiscampbell.tech/blog/260204.html
2•LAC-Tech•23m ago•0 comments

Show HN: Augmenting developer docs into high-level interactive mental models

https://docmaps-web.vercel.app/
3•b_mutea•24m ago•0 comments

Elon Musk's mega-merger makes little sense business sense

https://www.economist.com/business/2026/02/03/elon-musks-mega-merger-makes-little-business-sense
2•andsoitis•25m ago•0 comments

How to squeeze a lexicon (2001) [pdf]

https://marcinciura.wordpress.com/wp-content/uploads/2019/10/lexicon.pdf
1•mci•26m ago•0 comments

Ask HN: At what point in the future will AIs stop working or not be correct?

2•roschdal•32m ago•1 comments

Show HN: TitanShell – Security-first desktop client for OpenClaw

https://github.com/DaguangZhou/TitanShell
1•snowwolf_zdg•33m ago•1 comments

Show HN: Gateway – An open-source proxy to securely handle BYOK keys

https://github.com/glueco/gateway
3•mumernisar•39m ago•1 comments

Echoes

https://thelehrhaus.com/culture/echoes/
2•barry-cotter•40m ago•0 comments

Manna by Marshall Brain

https://marshallbrain.com/manna1
2•jpmitchell•41m ago•0 comments

CUBO the Industrial-Grade Local RAG

https://github.com/PaoloAstrino/cubo
3•50kIters•41m ago•0 comments

Show HN: Astrolabe – Navigate Your Data Universe in Nextcloud

https://blog.coutinho.io/introducing-astrolabe-navigate-your-data-universe-in-nextcloud
2•cbcoutinho•48m ago•0 comments

"Superhuman": A 13 year old boy swims 2.5 miles to save family swept out to sea

https://www.cbsnews.com/news/boy-swims-hours-save-mom-siblings-swept-out-sea-superhuman/?ftag=YHF...
7•gurjeet•48m ago•0 comments

Oracle to raise $50B as AI debt piles up

https://www.marketwatch.com/story/oracles-monster-25-billion-debt-financing-points-to-anxieties-a...
2•zerosizedweasle•51m ago•0 comments

Show HN: Yutovo – visual online and desktop calculator inside a text editor

https://yutovo.com
3•denprog•51m ago•0 comments

Siemens Energy Bets $1B That A.I. Power Demand Will Last

https://www.nytimes.com/2026/02/03/business/energy-environment/siemens-energy-ai-power-demand.html
2•ChrisArchitect•58m ago•0 comments

Show HN: Ghidra MCP Server – 110 tools for AI-assisted reverse engineering

https://github.com/bethington/ghidra-mcp
2•xerzes•1h ago•1 comments

Western Digital doubles the performance of HDD with dual-actuator High-Bandwidth

https://www.tomshardware.com/pc-components/hdds/western-digital-doubles-the-performance-of-hard-d...
1•XzetaU8•1h ago•0 comments

"Virtual Twin Factory" is just a brute-force legacy model

https://tushar1qaz.substack.com/p/the-virtual-twin-is-a-brute-force
1•FuseGov•1h ago•0 comments