frontpage.

Show HN: CodeRLM – Tree-sitter-backed code indexing for LLM agents

https://github.com/JaredStewart/coderlm/blob/main/server/REPL_to_API.md

1•jared_stewart•1h ago

I've been building a tool that changes how LLM coding agents explore codebases, and I wanted to share it along with some early observations.

Typically claude code globs directories, greps for patterns, and reads files with minimal guidance. It works in kind of the same way you'd learn to navigate a city by walking every street. You'll eventually build a mental map, but claude never does - at least not any that persists across different contexts.

The Recursive Language Models paper from Zhang, Kraska, and Khattab at MIT CSAIL introduced a cleaner framing. Instead of cramming everything into context, the model gets a searchable environment. The model can then query just for what it needs and can drill deeper where needed.

coderlm is my implementation of that idea for codebases. A Rust server indexes a project with tree-sitter, builds a symbol table with cross-references, and exposes an API. The agent queries for structure, symbols, implementations, callers, and grep results — getting back exactly the code it needs instead of scanning for it.

The agent workflow looks like:

1. `init` — register the project, get the top-level structure

2. `structure` — drill into specific directories

3. `search` — find symbols by name across the codebase

4. `impl` — retrieve the exact source of a function or class

5. `callers` — find everything that calls a given symbol

6. `grep` — fall back to text search when you need it

This replaces the glob/grep/read cycle with index-backed lookups. The server currently supports Rust, Python, TypeScript, JavaScript, and Go for symbol parsing, though all file types show up in the tree and are searchable via grep.

It ships as a Claude Code plugin with hooks that guide the agent to use indexed lookups instead of native file tools, plus a Python CLI wrapper with zero dependencies.

For anecdotal results, I ran the same prompt against a codebase to "explore and identify opportunities to clarify the existing structure".

Using coderlm, claude was able to generate a plan in about 3 minutes. The coderlm enabled instance found a genuine bug (duplicated code with identical names), orphaned code for cleanup, mismatched naming conventions crossing module boundaries, and overlapping vocabulary. These are all semantic issues which clearly benefit from the tree-sitter centric approach.

Using the native tools, claude was able to identify various file clutter in the root of the project, out of date references, and a migration timestamp collision. These findings are more consistent with methodical walks of the filesystem and took about 8 minutes to produce.

The indexed approach did better at catching semantic issues than native tools and had a key benefit in being faster to resolve.

I've spent some effort to streamline the installation process, but it isn't turnkey yet. You'll need the rust toolchain to build the server which runs as a separate process. Installing the plugin from a claude marketplace is possible, but the skill isn't being added to your .claude yet so there are some manual steps to just getting to a point where claude could use it.

Claude continues to demonstrate significant resistance to using CodeRLM in exploration tasks. Typically to use you will need to explicitly direct claude to use it.

---

Repo: github.com/JaredStewart/coderlm

Paper: Recursive Language Models https://arxiv.org/abs/2512.24601 — Zhang, Kraska, Khattab (MIT CSAIL, 2025)

Inspired by: https://github.com/brainqub3/claude_code_RLM

Don't Use Moving Averages (2024)

When Every "AI Headshot" Looked Fake, I Spent 2 Weeks Hacking Together My Own

Canonicalize Your Web Identity and Achieve Data Sovereignty with Pesos

The Barbican Basin

One in Four Smartphones Are Now iPhones

Google follows Anthropic: Antigravity sub can't be used in OpenCode/etc.

The Simulation Argument Revisited in a Functional Universe

Building a production-grade SaaS product just with AI

Show HN: Godot MCP – Give AI assistants full access to the Godot editor

Show HN: Tahc – AI chat widget that hands off to a human when it can't answer

"Energy Based" model vs. frontier AI models for Sudoku

FDA says companies can claim "no artificial colors" if they use natural dyes

So, if Rust is in Linux can it be in Emacs, too?

Show HN: Clap.Net – Source generated CLI Parsing for .NET (Inspired by Clap-Rs)

Apparently, there is a website that ships 52.5 MB of CSS

What I learned when I started assigning the hard reading again

Show HN: TapTap AI – Use Your OpenClaw Agent from Apple Watch/AirPods/CarPlay

Two co-founders of Elon Musk's xAI resign, joining exodus

Show HN: Renovate – The Kubernetes-Native Way

Show HN: Superjson – Simple, beautiful JSON explorer

Show HN: Minimal Pomodoro timer for macOS (1.7MB, now with keyboard shortcuts)

FAA Lifts Closure at El Paso Airport

An AI-generated pull request that makes sense

Deploying Rust to Production Checklist

Show HN: Triclock – A Triangular Clock

Show HN: Deeploy v0.3.0 – terminal-first VPS app deployment tool

Show HN: Eryx, a fast WASM-based Python sandbox with native extension support

Don't Go Monolithic; the Enterprise Agent Stack Is Stratifying

Ejabberd 26.02 / ProcessOne – Erlang Jabber/XMPP/Matrix Server – Communication

Sony to exit Blu-ray disc recorder market amid rise of streaming services