frontpage.

Show HN: Flemma – a Neovim plugin where the .chat buffer is the conversation

2•StanAngeloff•1h ago

Hey HN, I posted Flemma back in October 2025 with no context. Since then I've shipped >100 commits and used it daily as my primary AI workspace so I figured a proper update was due.

The core idea: a .chat file IS the conversation. No SQLite, no JSON logs, no shadow state. What you see in the buffer is exactly what the model receives. Edit an assistant reply to fix a hallucination, delete a tangent, fork by duplicating the file - it all works because there's nothing to fall out of sync.

What's new since October:

- Tool calling. Models can run shell commands, read/edit/write files (same as Pi, just 4 tools). Results go straight into the buffer. There's an approval flow (Ctrl-] cycles: preview -> execute -> send) so nothing runs without your say-so. Parallel tool use also works.

- Prompt caching for Anthropic, OpenAI and Vertex AI. Flemma places cache breakpoints automatically. Long conversations are now significantly cheaper (this was a major pain point for me).

- Extended thinking / reasoning support for all 3 providers.

- Per-buffer overrides via frontmatter. `flemma.opt` lets you pick which tools a buffer can use, set provider parameters, switch models - all scoped to that one file.

- Open registration APIs for both providers and tools. Custom tools can resolve definitions asynchronously from CLI subprocesses or remote APIs. I plan on adding mcporter support at some point.

Flemma works with Anthropic, OpenAI and Vertex AI. You get cost tracking, presets, Lua template expressions, file attachments and a lualine.nvim component.

One thing I want to be upfront about: nearly every line of code in Flemma was written by AI (Claude Code as of late, Amp and Aider in the past). It says so in the README. Every change was personally architected, reviewed and tested by me. I decide what gets built and I vet every diff. I think this is where a lot of software development is heading and I'd rather be honest about it than pretend otherwise.

I'm @StanAngeloff on GitHub - long-time Neovim user and open source enthusiast. Happy to answer questions.

https://github.com/Flemma-Dev/flemma.nvim

The consequences of task switching in supervisory programming

'A game-changer': UC San Diego professor initiates new field of medical science

Show HN: Alarm Arcade – an alarm app that only stops after you beat a mini-game

Expedition 33 art book confiscated because officials think it's an ancient relic

Adversarial Patch: images that make classifiers ignore other items in a scene

Opinion: NATO Has Seen the Future and Is Unprepared

Putin Didn't Know How Good He Had It

Elon vs. MongoDB- feature by feature comparison

Postgres Locks Explained: From Theory to Advanced Troubleshooting

Evolving Git for the Next Decade

Show HN: I built an AI that generates love proposals as pitch decks

The Women Mourning the "Deaths" of Their AI Boyfriends

This Is How A Child Dies of Measles

Exposure to Acrylamide and Trace Metals in Food: Still a Concern [France]

Iran internet blackout, turns the browser into a proxy

Lockdown Mode and Elevated Risk Labels in ChatGPT

I Didn't Want AI to Be Good at This

Looks Aren't Everything? Clavicular Begs to Differ

Olympic athletes push their bodies to the limit. Should we?

Superintelligence or not, we are stuck with thinking

A platform for founders to share ideas and find co-founders

Vulnerability scanner targeting logic bugs

X-raying OpenAI's unit economics

Long-Running Agents in Research Preview

LLMs Don't Read Code. Neither Do I. So I Wrote Redis in Machine Code

The United States Is Southern Now

The new AI playbook: why LLM-native beats traditional ML in verticals

Idax – A beautiful, idiomatic and less frustrating IDA C++ SDK

Show HN: My agents are building a secure fork of OpenClaw

SHA-256 Proyect