Show HN: OpenSlimedit – Cut AI coding token usage by 21-45% with zero config

https://github.com/ASidorenkoCode/openslimedit

2•aSidorenkoCode•1h ago

Comments

verdverm•1h ago

I don't think having a tool that rewrites tool descriptions is a good idea.

You should put effort into getting them in a good place and accept the token levels (which is part of that design space).

aSidorenkoCode•1h ago

Every API call sends the full tool schema for all available tools. In a 10-20 step session, you're paying for the same verbose descriptions over and over. Models don't need a paragraph-long explanation of read on the 15th call.

This plugin slims descriptions to one-liners like "Read file content." while cutting 21-45% of token usage. No schema changes, no custom tools. Just trimmed boilerplate as an opt-in plugin.

verdverm•1h ago

> Every API call sends the full tool schema for all available tools.

Only if you are doing it wrong, search >>> summarization

Then the other question, is it deterministic between runs or am I going to get a different summary each session, turn, or toolcall? And depending on that frequency, am I using more token than I save by doing summarization for N tools?

Minimizing token usage is not the goal in of itself, re: the ageless tradeoff of quantity vs quality

For some context, my system prompt is around 5k tokens at the start. I put file contents there read/write/agents.md, which save millions of tokens and seems to work better than making them message parts.

> Just trimmed boilerplate

This is not what I see this tool doing. It's automatically manipulating words in the background that you should put far more care and attention towards. Referring to those words as "boilerplate" you can just throw into a slop machines is revealing

aSidorenkoCode•58m ago

the benchmarks show no degradation in task completion with the shorter descriptions. We're in the age where frontier LLMs don't need instructions on how to read or edit a file.

The descriptions aren't dynamically summarized either. They're static in the plugin, same every call, every session. Zero overhead, fully deterministic.

This has been validated in over 3000 benchmark runs in OpenCode and I ran the entire Exercism Python practice suite (https://github.com/exercism/python/tree/main/exercises/pract...) with and without the plugin with identical results. An initial dataset is shared in the repo.

verdverm•50m ago

Have you made that benchmarking process open so others could reproduce it?

> with identical results

If your results are identical, you should be very sus, something is wrong if this is true. Nothing in agentic is reliable of fully deterministic

aSidorenkoCode•43m ago

Good benchmark results don't mean identical outputs. The task completion rate is the same: both pass the same exercises. The paths the model takes differ, but the end result is the same -> pass the tests

The full benchmarking methodology and tooling will be published alongside the paper.

verdverm•41m ago

you used the word "identical" to describe it, not me

words matter

which is why I still think this is a terrible idea, I don't think it holds up in the general case and would, as a peer reviewer, be inclined to believe there is benchmark filtering that makes for good results.

You should use the same benchmarks everyone else is when you write your paper

Pentagon concludes Alibaba and BYD have links to Chinese military

Trump Administration Announces That We Don't Know Where the Sun Goes at Night

Border social media searches and English: Australian conservative migration plan

Fake job recruiters hide malware in developer coding challenges

Drink Whole Milk, Eat Red Meat, and Use ChatGPT

Show HN: Evolved x86 AVX-512 kernels for NF4 LLM inference

DHS has reportedly sent out subpoenas to identify ICE critics online

Show HN: Wapuubot, an open source AI agent in your WordPress admin

'Dark matter, not a black hole, could power Milky Way's heart'

Ancient Robots

Codex CLI might silently re-route your GPT-5.3 traffic to GPT-5.2 instead

Show HN: JeffTube

You Should Make Your Own OpenClaw

Zephyr Real Time Operating System from the Linux Foundation

China Floating Turbine Passes Testing and Completes a Grid-Connected Flight

Windows 11 update KB5077181 causes boot loops and network issues for some users

GitHub - New repository settings for configuring pull request access

Show HN: ContextSubstrate – Capture, diff, replay AI agent runs (Git agent work)

Interzone Magazine Collection Archive

Chinese Cars are coming to the US

A Defector Explains the Remote-Work Scam Helping North Korea Pay for Nukes

Top non-ad google result for "polymarket" in Australia is a crypto scam

Show HN: I gave my AI agent $50 and let it trade on Kalshi

The heavy reality of Venezuela's oil

Minnesota Bathrobe Lady Sam Stroozas of MPR News

We are in the "gentleman scientist" era of AI research

I got tired of babysitting Claude,so I built AI agent that run on my laptop 24/7

Terry Tao – Machine assistance and the future of research mathematics [video]

Your pet's microchip may now be useless after chip company goes out of business

The great software stock meltdown