frontpage.

Show HN: Symdex-100 – Intent-based code search using 20-byte "Cypher" metadata

https://github.com/symdex-100/symdex

1•cpachmann•1h ago

Hi HN!

I am camillo and maker of symdex-100 - semantic fingerprints for fast and token-efficient code-base search.

Symdex-100 indexes every function in your repo into a small SQLite sidecar (`.symdex/index.db`). Each function gets a structured ~20-byte “Cypher” (e.g. `SEC:VAL_TOKEN--ASY` = security, validates token, async) instead of opaque embeddings. You search by intent—“where do we validate user tokens”—and get sub-second, ranked results from the index. Source files are never modified.

Why: Grep and full-text search scale poorly: keyword noise, no notion of “what this function does.” AI agents burn 5k+ tokens reading 10 files to find one function. Symdex compresses function semantics into a queryable index so both humans and agents can go straight to the right place. We see up to ~50x fewer tokens for agent code exploration and ~100x faster index lookup than grepping the same codebase.

Tech (short): Python AST → per-function metadata; LLM (or rule fallback) assigns a Cypher from a fixed taxonomy (domain : action _ object -- pattern). Tiered Cypher patterns (tight/medium/broad) + multi-lane retrieval (exact, domain wildcard, action, tags, name) over SQLite with a candidate cap. Call graph is indexed too (callers/callees/trace). MCP server so Cursor/Claude can `search_codebase("validate token")` and get one precise hit instead of reading half the repo.

Try it: Currently works only locally via clone and pip install -e ".[all]" (soon on pypi via pip install symdex).

Next, set `ANTHROPIC_API_KEY`, then `symdex index .` and `symdex search "validate user tokens"`. Works with OpenAI/Gemini too; or `SYMDEX_CYPHER_FALLBACK_ONLY=1` for no API key (rule-based Cyphers only). CLI, Python API, and MCP (stdio/Streamable HTTP). Docker image for remote MCP.

Repo: https://github.com/symdex-100/symdex/ Docs: README has architecture, benchmarks, FAQ.)

The Left has a Hyperpolitics problem

How to Migrate Your Custom GPTs to Claude

The Screening Machine

LingBot – open weights world model

Show HN: AppControl – A Modern Windows Task Manager with History

Show HN: Shuffled - Daily word puzzle game

Show HN: The Control and Memory Layer for AI Agents

Show HN: Vela – Modern programming language compiling to native code via LLVM

Show HN: Early detection of LLM hallucinations via structural dissonance

America's $1T AI Gamble

Show HN: Octrafic – AI agent for API testing from your terminal

Accelerando, but Janky

Show HN: Model Tools Protocol (MTP) – Forget MCP, bash is all you need

Thoughts on AI-Assisted Software Development in 2026

Show HN: Sign Any PDF Free – No account, no watermarks, no limits

AgentVault: Security Wrapper for OpenClaw (built in a couple hours))

UK justice ministry orders deletion of largest court archive court

SAIR: Terence Tao's Foundation Uniting Nobel, Turing, Fields Laureates and AI

Show HN: GPU ROI simulator based on token usage and model architecture

Hex Fiend – Simple hexadecimal math game

Show HN: Darna – Atomic commit validator for Go

What we can learn from tiny traces of ancient blood chemicals

The things I am good at

I Cut My Google Search Dependence in Half

Four Spaces Before <?php

Crocker's Rules

Shades of Halftone

Israel used weapons in Gaza that made Palestinians evaporate

Blockchain Is a Galactic Algorithm

Client PrompX