frontpage.

Hi HN,

I've been working on a vector search engine called QSS (Quantized Similarity Search). It's written in C and explores the idea of aggressively quantizing embedding vectors to 1-bit per dimension. It uses XOR + popcount for fast approximate search, followed by re-ranking using cosine similarity on the original vectors.

The main goal is to see how far you can push quantization without sacrificing too much search quality—while gaining significantly in memory usage and speed.

How it works Embeddings are quantized to 1 bit per dimension (e.g. 300D → 300 bits → ~40 bytes).

Search is done using bitwise XOR and popcount (Hamming distance).

A shortlist is re-ranked using cosine similarity on the original (float) embeddings.

Supports GloVe, Word2Vec, and fastText formats.

Goals Analyze the trade-offs between quantization and search accuracy.

Measure potential speed and memory gains.

Explore how this approach scales with larger datasets.

Preliminary tests I’ve only run a few small-scale tests so far, but the early signs are encouraging:

For some queries (e.g. "hello", "italy"), the top 30 results matched the full-precision cosine search.

On Word2Vec embeddings, the quantized pipeline was up to 18× faster than the standard cosine similarity loop.

These results are anecdotal for now—I’m sharing the project early to get feedback before going deeper into benchmarks.

Other notes Word lookup is linear and unoptimized for now—focus is on the similarity search logic.

Testing has been done single-threaded on a 2018 iMac (3.6 GHz Intel i3).

If you're interested in vector search, quantization, or just low-level performance tricks, I'd love your thoughts:

Do you think this kind of aggressive quantization could work at scale?

Are there other fast approximate search techniques you'd recommend exploring?

Repo is here: https://github.com/buddyspencer/QSS

Thanks for reading!

One Architect's Quest to Save Mumbai's Heritage from Disappearing

Laid off from Microsoft after 23 years, and I'm still going into the office

Browser Security, Privacy, and Performance Trade-Offs in 2025

Scientists use bacteria to turn plastic waste into paracetamol

I Switched from Flutter and Rust to Rust and Egui

Review of Film Cooling Techniques for Aerospace Vehicles

Python for Excel Users

Putin knows we are spreadsheet warriors

Ask HN: How can I pivot from software engineering back into neuroscience?

How good are you at distinguishing AI images?

Handbook of Applied Cryptography

Interview Like a Consultant (2010)

Pixar's Newest Movie, 'Elio', Is a Box-Office Dud

IWP9 Talk Recordings

How many PhDs does world need? Doctoral graduates outnumber academia jobs

Assessing the Potential for Regime Change in Iran

PandasBench – The First Benchmark for the Pandas API

Ask HN: Please recommend an app for learning new languages

Interactive Book on Computer Science Algorithms

How I configure VS Code for agentic coding

Dream Recorder is a portal to your subconscious

How A Small Class at Caltech Helped Launch a Computer Revolution

Recent CS grad unemployment twice that of Art History grads

Matter vs. Force: Why There Are Two Types of Particles

Florida Builds 'Alligator Alcatraz' Detention Center for Migrants in Everglades

Couchbase Acquired for $1.5B

Trump announces Israel-Iran ceasefire

Show HN: My$10/30-day challenge: I built a SaaS starter kit

Simon Willison on Phoenix.new

Bazel's Original Sins

Ask HN: Feedback on "QSS" – A Quantized Vector Search Engine in C