frontpage.

I got curious about a simple question: regular expressions are purely syntactic, but what happens if you add just a little bit of semantics?

To answer, I ended up building ngrep: a grep-like tool that extends regular expressions with a new operator ~(token) that matches a word by meaning using word2vec-style embeddings (FastText, GloVe, Wikipedia2Vec).

A simple demo: "~(big)+ \b~(animal;0.35)+\b" over Moby-Dick can find many ways used to refer to a large animal, surfacing "great whale", "enormous creature", "huge elephant" and so on. Pipe it through sort | uniq -c and the winner is, unsurprisingly, "great whale" :)

Built in Rust on top of the awesome fancy-regex, and ~() composes with all standard operators (negative lookahead, quantifiers, etc.). Currently a PoC with many missing optimizations (e.g: no caching, no compilation to standard regex, etc.), obviously without the guarantees of plain regex and subject to the limits of w2v-style embeddings...but thought it was worth sharing!

Show HN: Han – A Korean programming language written in Rust

Show HN: Ichinichi – One note per day, E2E encrypted, local-first

Show HN: GitAgent – An open standard that turns any Git repo into an AI agent

Show HN: Learn Arabic with spaced repetition and comprehensible input

Show HN: Costly – Open-source SDK that audits your LLM API costs

Show HN: I built an open-source agent-run trading fund with real capital

Show HN: Replacing $50k manual forensic audits with a deterministic .py engine

Show HN: AI coding agent for VS Code with pay-as-you-go pricing- no subscription

Show HN: ZaneOps, A beautiful and fast self hosted alternative to Vercel

Show HN: ngrep – grep plus word embeddings (Rust)

Show HN: Cloak – send and receive secrets from OpenClaw

Show HN: Json.express – Query and explore JSON in the browser, zero dependencies

Show HN: Pidrive – File storage for AI agents (mount S3, use ls/cat/grep)

Show HN: Data-anim – Animate HTML with just data attributes

Show HN: Ink – Deploy full-stack apps from AI agents via MCP or Skills

Show HN: Paperctl- An Arxiv CLI designed for agents

Show HN: Language Life – Learn a language by living a simulated life

Show HN: KeyID – Free email and phone infrastructure for AI agents (MCP)

Show HN: Channel Surfer – Watch YouTube like it’s cable TV

Show HN: Context Gateway – Compress agent context before it hits the LLM

Show HN: I built Wool, a lightweight distributed Python runtime

Show HN: Zap Code – AI code generator that teaches kids real HTML/CSS/JS

Show HN: Auto-Save Claude Code Sessions to GitHub Projects

Show HN: What was the world listening to? Music charts, 20 countries (1940–2025)

Show HN: Axe – A 12MB binary that replaces your AI framework

Show HN: Hedra – an open-world 3D game I wrote from scratch before LLMs

Show HN: SupplementDEX – The Evidence-Based Supplement Database

Show HN: OneCLI – Vault for AI Agents in Rust

Show HN: BirdDex – Pokémon Go, but with real life birds

Show HN: QKD eavesdropper detector using Krylov complexity-open source Python

Show HN: Han – A Korean programming language written in Rust

Show HN: Ichinichi – One note per day, E2E encrypted, local-first

Show HN: GitAgent – An open standard that turns any Git repo into an AI agent

Show HN: Learn Arabic with spaced repetition and comprehensible input

Show HN: Costly – Open-source SDK that audits your LLM API costs

Show HN: I built an open-source agent-run trading fund with real capital

Show HN: Replacing $50k manual forensic audits with a deterministic .py engine

Show HN: AI coding agent for VS Code with pay-as-you-go pricing- no subscription

Show HN: ZaneOps, A beautiful and fast self hosted alternative to Vercel

Show HN: ngrep – grep plus word embeddings (Rust)

Show HN: Cloak – send and receive secrets from OpenClaw

Show HN: Json.express – Query and explore JSON in the browser, zero dependencies

Show HN: Pidrive – File storage for AI agents (mount S3, use ls/cat/grep)

Show HN: Data-anim – Animate HTML with just data attributes

Show HN: Ink – Deploy full-stack apps from AI agents via MCP or Skills

Show HN: Paperctl- An Arxiv CLI designed for agents

Show HN: Language Life – Learn a language by living a simulated life

Show HN: KeyID – Free email and phone infrastructure for AI agents (MCP)

Show HN: Channel Surfer – Watch YouTube like it’s cable TV

Show HN: Context Gateway – Compress agent context before it hits the LLM

Show HN: I built Wool, a lightweight distributed Python runtime

Show HN: Zap Code – AI code generator that teaches kids real HTML/CSS/JS

Show HN: Auto-Save Claude Code Sessions to GitHub Projects

Show HN: What was the world listening to? Music charts, 20 countries (1940–2025)

Show HN: Axe – A 12MB binary that replaces your AI framework

Show HN: Hedra – an open-world 3D game I wrote from scratch before LLMs

Show HN: SupplementDEX – The Evidence-Based Supplement Database

Show HN: OneCLI – Vault for AI Agents in Rust

Show HN: BirdDex – Pokémon Go, but with real life birds

Show HN: QKD eavesdropper detector using Krylov complexity-open source Python

Show HN: ngrep – grep plus word embeddings (Rust)

Comments