frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Pg_sorted_heap–Physically sorted PostgreSQL with builtin vector search

https://github.com/skuznetsov/pg_sorted_heap
4•skuznetsov37•6h ago

Comments

skuznetsov37•6h ago

  Author here. pg_sorted_heap is a PostgreSQL table AM extension that does two things:

  1. Keeps data physically sorted by primary key with per-page zone maps. At 100M rows a point query reads 1 buffer page (0.045ms) vs 8 for btree (0.5ms) vs 520K for seq scan (1.2s).

  2. Built-in IVF-PQ vector search — no pgvector dependency.

  The vector search part is where it gets interesting. The physical clustering by PK prefix IS the inverted file index. You set partition_id (IVF cluster assignment) as the leading PK column, sorted_heap
  clusters rows by it, and the zone map skips irrelevant partitions at the I/O level. No separate index structure, no 800 MB HNSW graph.

  Numbers (103K × 2880-dim vectors, 1 Gi k8s pod):

    pgvector HNSW:  97% R@1, 14ms, 806 MB index, max 2,000 dims
    IVF-PQ:         97% R@1, 22ms,  27 MB index, max 32,000 dims

  Two vector types: svec (float32, 16K dims) and hsvec (float16, 32K dims). We tested float16 vs float32 on the same dataset — no measurable recall difference. PQ quantization is the bottleneck, not storage
  precision.

  The 2,000-dim limit matters: models like Nomic embed v2 output 2880 dims, and pgvector simply can't build HNSW or IVFFlat indexes for them.

  PostgreSQL 17 + 18, Apache 2.0. Full crash recovery, online compaction, TOAST, pg_upgrade all tested.

  Vector search docs: https://skuznetsov.github.io/pg_sorted_heap/vector-search

  Happy to answer questions about the architecture, benchmarks, or trade-offs vs pgvector.

The Mirror and the Gift

https://www.robpanico.com/articles/display/?entry_short=the-mirror-and-the-gift
1•retrocog•2m ago•1 comments

CERN upgrade to LHCb experiment threatened by UK funding cuts

https://physicsworld.com/a/cern-upgrade-to-lhcb-experiment-threatened-by-uk-funding-cuts/
1•elashri•3m ago•0 comments

Poet-X Enables Billion-Parameter LLM Training on Single H100 GPU

https://www.simplenews.ai/news/poet-x-enables-billion-parameter-llm-training-on-single-h100-gpu-ktw3
1•goldkey•4m ago•0 comments

Show HN: A simple habit tracker focused on clarity and consistency

https://play.google.com/store/apps/details?id=com.shadowstudios.habitonlinetracker&hl=en_US
1•Belsonsan•5m ago•0 comments

Show HN: Kaeso: an OAuth hub for AI agents

1•devinoldenburg•5m ago•0 comments

Show HN: WebBridge turns any website into MCP tools by recording browser traffic

https://github.com/jalabulajunx/WebBridge
1•nonstopnonsense•6m ago•0 comments

You were born without an account

https://simplex.chat/why/
1•Cider9986•7m ago•0 comments

Crowning the protein that makes jellyfish glow green as a model

https://phys.org/news/2026-02-crowning-protein-jellyfish-green-scientists.html
1•PaulHoule•9m ago•0 comments

Show HN: MultiPowerAI – Trust and accountability infrastructure for AI agents

https://multipowerai-trust.vercel.app
1•rogergrubb•11m ago•0 comments

Java beats Go, Python and Node.js in MCP server benchmarks

https://www.tmdevlab.com/mcp-server-performance-benchmark.html
1•lprimak•12m ago•0 comments

Show HN: Single-header C++ libraries for LLM APIs – zero deps beyond libcurl

1•Shmungus•13m ago•0 comments

Show HN: OpenGuard

https://openguard.sh
1•everlier•14m ago•0 comments

I built a tool to transcribe podcasts after struggling to learn languages

1•marstyl•14m ago•0 comments

Why Language Models Hallucinate (2025)

https://arxiv.org/abs/2509.04664
1•doener•15m ago•0 comments

Ask HN: What Are Your Biggest Career Regrets?

2•karakoram•15m ago•1 comments

There Was a Time before Mathematica (2013)

https://writings.stephenwolfram.com/2013/06/there-was-a-time-before-mathematica/
2•masfuerte•16m ago•1 comments

Russian Ransomware Administrator Pleads Guilty to Wire Fraud Conspiracy

https://www.justice.gov/usao-md/pr/russian-ransomware-administrator-pleads-guilty-wire-fraud-cons...
1•737min•18m ago•0 comments

Show HN: Rust-First L3 Limit Order Book Backtesting Engine with Python Bindings

https://github.com/chasemetoyer/Backtesting-Engine
1•chasemetoyer•18m ago•0 comments

Show HN: Ovumcy – self-hosted menstrual cycle tracker

https://github.com/terraincognita07/ovumcy
1•terrain07•19m ago•0 comments

Show HN: Sheila, an AI agent that replaced our accounting flow

https://soapbox.pub/blog/announcing-sheila/
4•knewter•27m ago•2 comments

Qualcomm CEO: 'Resistance Is Futile' as 6G Mobile Revolution Approaches

https://fortune.com/2026/03/03/qualcomm-ceo-resistance-is-futile-6g-mobile-revolution-approaches/
2•m463•28m ago•1 comments

Show HN: NeoNetrek – modernizing the internet's first team game (1988)

https://neonetrek.com
1•yuriksan•30m ago•0 comments

Show HN: Natural language queries for Prometheus Kafka metrics (StreamLens)

https://github.com/muralibasani/streamlens
1•muralibasani•30m ago•0 comments

Satellite firm pauses imagery after revealing Iran's attacks on US bases

https://arstechnica.com/space/2026/03/satellite-firm-pauses-imagery-after-revealing-irans-attacks...
1•consumer451•32m ago•0 comments

China Suspected in Breach of FBI Surveillance Network

https://www.wsj.com/politics/national-security/china-suspected-in-breach-of-fbi-surveillance-netw...
4•JumpCrisscross•32m ago•1 comments

Show HN: I created list of directories (1000) to create free backlinks

https://kitful.ai/directories
1•eashish93•35m ago•0 comments

Fishing crews in the Atlantic keep accidentally dredging up chemical weapons

https://arstechnica.com/health/2026/03/fishing-crews-in-the-atlantic-keep-accidentally-dredging-u...
3•jnord•36m ago•0 comments

45% of AI-Generated Code Has Security Vulnerabilities

https://www.linkedin.com/pulse/45-ai-generated-code-has-security-vulnerabilities-vijay-shankar-gu...
1•ohjeez•38m ago•0 comments

The National Videogame Museum Has Acquired the Mythical Nintendo PlayStation

https://www.engadget.com/gaming/the-national-videogame-museum-has-acquired-the-mythical-nintendo-...
2•breve•39m ago•0 comments

C# Strings Silently Kill Your SQL Server Indexes in Dapper

https://consultwithgriff.com/dapper-nvarchar-implicit-conversion-performance-trap
13•PretzelFisch•40m ago•4 comments