frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Butter, a muscle memory cache for LLMs

https://docs.butter.dev
14•edunteman•1h ago
Hi HN, Erik here. Today we launch Butter, an OpenAI-compatible API proxy that caches LLM generations and serves them deterministically on revisit.

Since April, we’ve been working on this concept of “muscle memory,” or deterministic replay, for agent systems performing automations. You may recall our first post in May, launching a python package called Muscle Mem: https://news.ycombinator.com/item?id=43988381

Since then, the product has evolved entirely, now taking the form of an LLM Proxy. For a deep dive into this process, check out: https://blog.butter.dev/muscle-mem-as-a-proxy

The proxy’s killer feature is being template-aware, meaning it can reuse cache entries across structurally similar requests. Inducing variable structure from context windows is no easy task, which we cover in a technical writeup here: https://blog.butter.dev/template-aware-caching

The proxy is currently open-access and free to use so we can quickly discover and work through a slew of edge cases and template-induction errors. There’s much work to be done before it’s technically sound, but we’d love to see you take Butter for a spin and share how it went, where it breaks, if it’s helpful, if we're going down a dead end, etc.

Cheers!

Comments

ketan_around•55m ago
Exciting to see a product like this launch! There are obviously a host of ‘memory’ solutions out there that try to integrate in fancy ways to cache knowledge / save tokens, but I think there’s a beauty in simplicity to just having a proxy over the OpenAI endpoint.

Interested to see where this goes!

edunteman•42m ago
An interesting alternative product to offer is injecting prompt cache tokens into requests where they could be helpful; not bypassing generations but at least low hanging fruit for cost savings
tsvoboda•43m ago
looks pretty cool! How would you integrate this into production agent stacks like langchain, autogpt, even closed loop robotics?
edunteman•36m ago
Thanks! For langchain you can repoint your base_url in the client. Autogpt I'm not as familiar with. Closed loop robotics using LLMs may be a stretch for now, especially since vision is a heavy component, but theoretically the patterns baked into small language models running on-device or hosted LLMs at higher level planning loops, could be emulated by a butter cache if observed in high enough volume.
raymondtana•28m ago
For AutoGPT, there is the option to set a llamafile endpoint, which follows the Chat Completions API. So, theoretically, you should be able to use that to point to Butter's LLM proxy.

Our 2022 Pivot, aligning Hospitals and Researchers

https://merqur.io/2022/09/05/the-story-of-our-2022-pivot/
1•merqurio•2m ago•0 comments

Show HN: Learn Chinese by partially translating things you'd normally read

1•wayy•3m ago•0 comments

visionOS 26 Review: Keep moving toward the future

https://sixcolors.com/post/2025/10/visionos-26-review-keep-moving-toward-the-future/
1•CharlesW•4m ago•0 comments

Every Religion Is Based on One Word

1•phoenixhaber•5m ago•1 comments

Implementing /Usr Merge in Alpine

https://alpinelinux.org/posts/2025-10-01-usr-merge.html
1•rascul•5m ago•0 comments

We're all senior engineers now

https://theahura.substack.com/p/were-all-senior-engineers-now
1•theahura•9m ago•0 comments

Decoding gender dimorphism of the human brain using multimodal MRI data

https://www.sciencedirect.com/science/article/abs/pii/S1053811913000074
1•CGMthrowaway•11m ago•0 comments

The AirPods Pro 3 Are Everything I Wanted, and They're Crucially Flawed

https://aftermath.site/airpods-pro-3-review-heartrate-v-shaped-sound
1•sandbach•12m ago•0 comments

Show HN: Fuzz Forge, vulnerability discovery with AI and fuzzing

https://github.com/FuzzingLabs/fuzzforge_ai
9•unbalancedparen•12m ago•0 comments

Show HN: A Standalone Android AI Automation Agent

https://github.com/iamvaar-dev/heybro
1•iamvaar-dev•12m ago•0 comments

Mythbusting Illiteracy in the Middle Ages

https://www.medievalists.net/2023/11/mythbusting-illiteracy-in-the-middle-ages/
1•stconstantine•13m ago•0 comments

Nobody Would Edit Shakespeare, Right? Right?

https://blogs.loc.gov/loc/2025/09/nobody-would-edit-shakespeare-right-right/
1•CharlesW•14m ago•0 comments

Show HN: I built a StopScrolling Predictor 4 creators and brands to help friend

https://www.aisthetix.com/
1•the_mahala•15m ago•0 comments

Biodegradable riboflavin-containing polypeptide for energy storage

https://www.pnas.org/doi/10.1073/pnas.2509325122
1•PaulHoule•19m ago•0 comments

The Truth About the School "Replacing Teachers with AI"

https://danmeyer.substack.com/p/the-truth-about-2-hour-learning-and
4•simonebrunozzi•20m ago•0 comments

MuseScore Studio 4.6 is now available

https://musescore.org/en/4.6
2•promiseofbeans•20m ago•0 comments

Building a Datacenter (For Dummies) Part I – Crucible Capital

https://docsend.com/view/xskh32kw8hupge8y
1•ChrisArchitect•22m ago•0 comments

Blackout at Chornobyl after drone attack

https://kyivindependent.com/chornobyl-nuclear-plant-loses-power-after-russian-attack-on-nearby-to...
1•defly•25m ago•0 comments

First-ever guitar amp authenticated on the blockchain

https://www.guitarworld.com/gear/amps/synner-sg-22-universal-pedal-platform-amplifier
1•NikolaNovak•26m ago•1 comments

YouTube to pay $22M for White House ballroom to settle Trump lawsuit

https://www.cbsnews.com/news/youtube-settles-trump-lawsuit-white-house-ballroom/
3•anigbrowl•26m ago•1 comments

Evaluating the Impact of AI on the Labor Market: Current State of Affairs

https://budgetlab.yale.edu/research/evaluating-impact-ai-labor-market-current-state-affairs
2•ChrisArchitect•28m ago•0 comments

Prison guard, shot six times, hails FCC push to allow cell jamming behind bars

https://www.nbcnews.com/news/us-news/ex-prison-guard-shot-six-hails-fcc-push-allow-cellphone-jamm...
1•rmason•30m ago•0 comments

Two Weeks Makes a Difference: How Paternity Leave Can Increase/Decrease Divorce

https://www.governance.fyi/p/two-weeks-makes-the-difference-how
2•toomuchtodo•32m ago•1 comments

White House ballroom construction to continue through shutdown, official says

https://abcnews.go.com/Politics/live-updates/trump-admin-live-updates/?id=126029955
8•zzzeek•33m ago•0 comments

Apple's next-gen Vision Pro revealed in new regulatory filing

https://www.theverge.com/news/789801/apple-vision-pro-fcc-filing-leak
1•CharlesW•34m ago•0 comments

There is no industry standard as to which way a tap should turn

http://susan-stepney.blogspot.com/2025/09/there-is-no-industry-standard-as-to.html
1•speckx•34m ago•0 comments

Denounce.vercel.app

https://denounce.vercel.app/
2•patrikcsak•35m ago•2 comments

Monkey Selfie Copyright Dispute

https://en.wikipedia.org/wiki/Monkey_selfie_copyright_dispute
1•thunderbong•35m ago•0 comments

Claude Code 2 and the hidden cost of slow coding assistants: context switching

https://coding-with-ai.dev/posts/context-switching/
3•imasl42•35m ago•1 comments

OpenAI researcher posts fake CCTV footage of a real person shoplifting

https://twitter.com/GabrielPeterss4/status/1973120058907041902
13•jsheard•35m ago•5 comments