frontpage.

Show HN: I built a full LLM chat client as a Neovim filetype

3•StanAngeloff•1h ago

I started Flemma because I wanted to bring my AI workload into Neovim.

My workflow started with me living in developer portals - Claude Workbench, OpenAI Platform, Vertex (the horrors of GCP's Console UI!) I would spend hours in these web UIs crafting prompts, iterating on system instructions, maintaining a carefully curated library of sessions. But the browser really wasn't optimised for this kind of interaction. Editing was clunky (muscle memory <C-W> would close the tab instead and wipe my work), going back and forth between the LLM and myself felt off. So I had an idea: what if I could turn a Markdown document into an LLM chat interface?

At first I thought that would be enough - just do what I was already doing in the browser... but in Neovim. And sure enough having been accustomed to my own setup for the last decade I immediately felt a productivity boost. Writing functional requirement documents, statements of work, deep research across different systems - all of it felt better in "my" editor.

I then had a taste of Aider... and Claude Code... and all the other tools that were coming out. And Flemma felt lacking. So I started building: tool support, better conversation organisation, a proper UI to tame the noise that tool calls introduce.

Today, Flemma is a fully evolved AI workspace. It runs autonomous agent loops, interacts with multiple LLMs (Anthropic, OpenAI, Vertex, Moonshot) and lets you switch providers mid-conversation - something I do occasionally during research, asking two or three different models for their take on a problem, then combining findings into a final document.

Under the hood, .chat files are just Markdown with role markers (@You:, @Assistant:), but Flemma treats them as a proper filetype with its own parser, AST, LSP server, template engine and sandboxed tool execution. The buffer *is* the state - no hidden database, no JSON history, no server process. Your conversations are portable, greppable and version-controllable (I backup mine in Git). You can close Neovim, reopen the file a week later and pick up exactly where you left off.

It's got prompt caching, extended thinking, 7 built-in tools, a layered config system that gets out of your way and a UI that keeps getting refined to bring the noise down and make long agent sessions pleasant to work in (not quite there yet).

Flemma is for anyone who'd rather stay in Neovim. If that's you, I'd love to hear what you think.

Repo: https://github.com/Flemma-Dev/flemma.nvim

Demo: http://flemma.dev/flemma.nvim/blob/develop/README.md#-flemma

Show HN: Signet – Cryptographic action receipts for AI agents

I Built an AI Organisation

Ask HN: What are you working on? (April 2026)

JetBrains Blog RSS Support Is Now Generally Available

Show HN: Draw a Picture for My Cat

Australian government and Anthropic sign MOU for AI safety and research

Show HN: Inbox.camp, AWS WorkMail Replacement

We're Here. – VRChat

Show HN: I built a risk map that adapts to your worldview

CSS or BS?

The Open Source AI Engineering Platform for Agents, LLMs and Models

An AI/ML accuracy tale (2023)

8,400 GitHub repos share a naming convention that traces back to South Park joke

ToE: Inflationary Consistency Relation from Decoherence-Induced Occupancy

Google This: A module to retrieve organic search results and more from Google

Claude Code skills based on the book "Save Your Startup"

Category Theory Illustrated – Types

VibePad – New AI Padding Model

Reaching 100% Type Coverage by Deleting Unannotated Code

What One Month of Intense Red-Light Therapy Did to My Mind

Itsid – LLM with perfect input reproduction for e.g. license removal

Show HN: I made a Mario Galaxy game with Claude Code and Three.js in 53 days

Clock that shows what percentage of your life has passed

Garryslist Code Audit

Building an Arcade Cabinet (Part 1, Design & Materials)

"Why does this code look like this?" Nobody knows. That's the problem

Iran threatens Nvidia, Apple and other 18 tech companies

Show HN: Sycamore – next gen Rust UI library powered by fine-grained reactivity

Show HN: Apindex – self-hosted API catalog to map and understand internal APIs

Show HN: Agent Arnold – Gym tracker 100% vibe-coded from my phone between sets