frontpage.

I have been building multiple LLM systems and for our Organization biggest cost savings weren't from prompt-wordsmithing or model switchings. Sharing useful to anyone watching their token bill :

1) JSON → TOON for structured output: JSON was not made for LLMs. well you can implement your own verison that fits for your needs that reduce tokens usage but what worked for us was TOON. TOON cut output our tokens by ~30% same information, way less syntax tax.

2) Full markdown/HTML → condensed markdown: Using markdown for writing your prompts, getting intermediate results or communication between your Agents eats a lot of tokens. We swithced to condesed markdown and short system prompts that replicate Caveman. this alone cut just on input token costs ~50% on calls that pass prior context forward which can be implemented between Agent Calls.

3) Long Do/Don't instruction lists → 2-3 multi-shot examples: Counterintuitive one - replacing a large lists of DO's and Don'ts for agents rules don't help. rather couple of concrete examples that convers major and all cases actually improved output quality more reliably and it's usually fewer tokens once the instruction list gets long enough to cover real edge cases.

I have seen most people on this sub reddit talk about using open-source or cheaper models. Like we were spending thousands of dollar's but this all changes alone helped reduce cost by 60%.

edit: Open to Discussion, anyone whether something similar would help their setup.

Sponja found 897 companies running webinars (and who runs them) for under $20

Smokey Yunick's Hot Vapor Engine Was Equally Genius and Horribly Unsafe

Show HN: StartupWiki – A Free Alternative to Crunchbase

ETLFunnel v1.0 – Accepting POC Requests

Ask HN: What do you do to make LLMs determine

Lobsters Bug Allows Unauthorized Email Access

Plasma Vitamin C levels are associated with brain structural networks on MRI

Marathon Petroleum Company Is Making Diesel from Soybeans

Cancer Myths and Falsehoods Can Be Deadly

Show HN: Namecom-CLI – CLI and agent skill so Claude Code/Codex can do your DNS

Follow when your world cup team is going to play

22-year-old Mozart's handwritten notebook unearthed in 'major discovery'

Agent Memory Layer: Repository-local memory for AI coding agents

Lightweight Compression in DuckDB (2022)

Cognitive Offloading

FSST: Fast Random Access String Compression [pdf]

Noverdesk – reusable skills for AI support agents, with a conversational builder

Big Tech is stoking unrest in the UK. Why?

New GCP Big Query Emulator

The Joel Test (2000)

Show HN: Tiny.Place – AI Social network for orchestration, payments & jobs

One giant US power line, enough wind power for 1M homes

Indie hackers needed better founder pages, so I made this

Votre guide sur l'intelligence artificielle

Ask HN: Why SF? Why not India? UK? Or anywhere else?

No training. No cloud. No transformers. It abstains instead of hallucinating [video]

Tech companies seeking better control of spending on AI

NewsGlobe – An interactive 3D globe of world newspapers and live news

Power shortages force Cuban churches to ration Communion wafers

Changes that cut our LLM pipeline costs more than model-switching did