frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Pg_sorted_heap–Physically sorted PostgreSQL with builtin vector search

https://github.com/skuznetsov/pg_sorted_heap
4•skuznetsov37•6h ago

Comments

skuznetsov37•6h ago

  Author here. pg_sorted_heap is a PostgreSQL table AM extension that does two things:

  1. Keeps data physically sorted by primary key with per-page zone maps. At 100M rows a point query reads 1 buffer page (0.045ms) vs 8 for btree (0.5ms) vs 520K for seq scan (1.2s).

  2. Built-in IVF-PQ vector search — no pgvector dependency.

  The vector search part is where it gets interesting. The physical clustering by PK prefix IS the inverted file index. You set partition_id (IVF cluster assignment) as the leading PK column, sorted_heap
  clusters rows by it, and the zone map skips irrelevant partitions at the I/O level. No separate index structure, no 800 MB HNSW graph.

  Numbers (103K × 2880-dim vectors, 1 Gi k8s pod):

    pgvector HNSW:  97% R@1, 14ms, 806 MB index, max 2,000 dims
    IVF-PQ:         97% R@1, 22ms,  27 MB index, max 32,000 dims

  Two vector types: svec (float32, 16K dims) and hsvec (float16, 32K dims). We tested float16 vs float32 on the same dataset — no measurable recall difference. PQ quantization is the bottleneck, not storage
  precision.

  The 2,000-dim limit matters: models like Nomic embed v2 output 2880 dims, and pgvector simply can't build HNSW or IVFFlat indexes for them.

  PostgreSQL 17 + 18, Apache 2.0. Full crash recovery, online compaction, TOAST, pg_upgrade all tested.

  Vector search docs: https://skuznetsov.github.io/pg_sorted_heap/vector-search

  Happy to answer questions about the architecture, benchmarks, or trade-offs vs pgvector.

Show HN: The Roman Industrial Revolution that could have been (Vol 2)

https://thelydianstone.com/volume-2
11•miki_tyler•51m ago•2 comments

Show HN: Moongate – Ultima Online server emulator in .NET 10 with Lua scripting

https://github.com/moongate-community/moongatev2
216•squidleon•9h ago•123 comments

Show HN: Kula – Lightweight, self-contained Linux server monitoring tool

https://github.com/c0m4r/kula
2•c0m4r•1m ago•0 comments

Show HN: WebBridge turns any website into MCP tools by recording browser traffic

https://github.com/jalabulajunx/WebBridge
2•nonstopnonsense•39m ago•1 comments

Show HN: Reconstruct any image using primitive shapes, runs in-browser via WASM

https://github.com/taiseiue/primitive-playground
18•taiseiue•3d ago•4 comments

Show HN: Claude-replay – A video-like player for Claude Code sessions

https://github.com/es617/claude-replay
54•es617•8h ago•24 comments

Show HN: I open-sourced my Steam game, 100% written in Lua, engine is also open

https://github.com/willtobyte/reprobate
2•delduca•1h ago•0 comments

Show HN: A trainable, modular electronic nose for industrial use

https://sniphi.com/
27•kwitczak•3d ago•16 comments

Show HN: Sqry – semantic code search using AST and call graphs

https://sqry.dev
2•verivusai•1h ago•1 comments

Show HN: Swarm – Program a colony of 200 ants using a custom assembly language

https://dev.moment.com/
178•armandhammer10•19h ago•60 comments

Show HN: mTile – native macOS window tiler inspired by gTile

https://github.com/protortyp/mTile
2•protortyp•1h ago•0 comments

Show HN: Interactive 3D globe of EU shipping emissions

https://seafloor.pages.dev
17•marcohaber•9h ago•7 comments

Show HN: PageAgent, A GUI agent that lives inside your web app

https://alibaba.github.io/page-agent/
137•simon_luv_pho•1d ago•70 comments

Show HN: Modembin – A pastebin that encodes your text into real FSK modem audio

https://www.modembin.com
20•a13x57•9h ago•4 comments

Show HN: Graph-Oriented Generation – Beating RAG for Codebases by 89%

https://github.com/dchisholm125/graph-oriented-generation
2•dchisholm125•3h ago•0 comments

Show HN: Mantle – Remap your Mac keyboard without editing Kanata config files

https://getmantle.app/
2•gsteezy•3h ago•0 comments

Show HN: VaultNote – Local-first encrypted note-taking in the browser

https://vaultnote.saposs.com/
3•powerwild•4h ago•1 comments

Show HN: Mog, a programming language for AI agents

https://gist.github.com/belisarius222/203ac5edbc3306c34bf0481f451d4003
3•belisarius222•5h ago•1 comments

Show HN: Best ways to organize research links

https://clipnotebook.com/blog/best-ways-to-organize-research-links-2026
6•diddddy•6h ago•0 comments

Show HN: Feedster, an RSS/feed reader focused on discovery and agent integration

https://feedster.co
3•pmarsceill•6h ago•0 comments

Show HN: Argus – VSCode debugger for Claude Code sessions

https://github.com/yessGlory17/argus
3•lydionfinance•6h ago•0 comments

Show HN: Jido 2.0, Elixir Agent Framework

https://jido.run/blog/jido-2-0-is-here
316•mikehostetler•1d ago•63 comments

Show HN: Tensor Spy: inspect NumPy and PyTorch tensors in the browser, no upload

https://tensorspy.com/
22•jacobn•4d ago•2 comments

Show HN: Solace – Mac menu bar app that adapts to the world around you. Finally

https://theodorehq.com/solace/
2•theodorehq•6h ago•0 comments

Show HN: Pg_sorted_heap–Physically sorted PostgreSQL with builtin vector search

https://github.com/skuznetsov/pg_sorted_heap
4•skuznetsov37•6h ago•1 comments

Show HN: LoRA gradients on Apple's Neural Engine at 2.8W

https://github.com/jmanhype/ane-lora-training
3•jmanhype•7h ago•1 comments

Show HN: Claude skill to do your taxes

https://github.com/robbalian/claude-tax-filing
7•rob313•7h ago•1 comments

Show HN: Anchor Engine – Deterministic Semantic Memory for LLMs Local (<3GB RAM)

https://github.com/RSBalchII/anchor-engine-node
3•BERTmackl1n•7h ago•2 comments

Show HN: Poppy – A simple app to stay intentional with relationships

https://poppy-connection-keeper.netlify.app/
175•mahirhiro•1d ago•113 comments

Show HN: Moltty – Organized, Persistent AI Coding Sessions

https://moltty.com/
8•ronreiter•12h ago•3 comments