frontpage.

Hey HN,

I've been building a small in-memory vector search library as a way to explore ANN systems from first principles. I was inspired by Spotify's annoy, and Meta's FAISS.

Currently, it's a CPU-first C++ library with Python bindings that supports Flat/IVF indexes and Cosine/L2 distance metrics. There's a Colab notebook linked in the README if you want to try it quickly without installing anything.

I went from a naive brute-force (with millisecond level latency) to under half a millisecond with IVF, when benchmarked against SIFT1M subset. After that I was able to increase the throughput by ~2.4x, by making the search multi-threaded on my 4 core CPU (U series). Using scalar quantization, I reduced the memory usage by ~73% with negligible loss in accuracy.

I have documented the changes in performance and overall architecture in my repository.

Currently, my focus is on tightening the memory alignment and tuning cache locality, and making top-k selection faster, and in the long term my aim is to implement IVF-PQ and HNSW.

I'd appreciate any feedback on how I may move forward, and think through the process.

I used AI tools for implementing serialization and bindings and for early architecture brainstorming; details/prompts for which are documented in the README's 'Disclosure' section.

Show HN: A local CLI that redacts secrets before you paste logs into ChatGPT

Fed on Reams of Cell Data, AI Maps New Neighborhoods in the Brain

Don't Touch It – AI app that suggests recipes from a photo of your fridge

TheCommander: Dual-panel file manager for macOS inspired by TotalCommander

Physicists Make Electrons Flow Like Water

Show HN: My crush is obsessed with Maggie Rogers so I made her this

Show HN: I built a tool that turns weekly AI/crypto signals into launch briefs

Daemon (Novel)

Programming Aphorisms

Railway Global Outage

Show HN: Turn Strava activities into GitHub-style contribution heatmaps

Third day of the week with a GitHub incident

Why Vampires Live Forever

Prompt Mixer - real-time LLM steering UI

Recreating Hi8

Text classification with Python 3.14's ZSTD module • Max Halford

Show HN: Host OpenClaw with native template and multi-agent support

Lessons learned building a Node.js malware scanner to 400 stars (Open Source)

Attention Sinks and Compression Valleys in LLMs

Part 2 - AI Chat Evaluation of the Formal Language in He Xin's PEPC System

Hand tool rewrites ancient Egyptian history

A note about personal security

Part 1 - AI Chat Evaluation of the Formal Language in He Xin's PEPC System

A Note on File History in Emacs

Revisionist History – Aliens, Secrets and Conspiracies

Show HN: cbt (C++ Build Tool)

Open model StepFun-3.5 is #1 on MathArena, an uncheatable math benchmark

Show HN: Bitcoin, GEB, and Bach's fugues share the same structural move

Functional Programming in M4

AI makes it easier to build the wrong thing faster

Show HN: Spheni – A Vector Search Engine in C++ built from scratch, for Python