frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: QingMing – Exact vector search on consumer GPUs (no index)

https://github.com/uulong950/qingming-flat/blob/main/README.md
1•uulong•1h ago

Comments

uulong•1h ago
The Problem: Everyone is using HNSW (graph indexes) for vector search. It works great for servers, but it introduces build-time latency, memory overhead (edges), and random access patterns that kill performance on consumer hardware.

The Project: QingMing is a header-only C++ engine that implements exact brute-force search. Instead of pruning the search space, I optimized the memory access pattern to saturate the HBM/GDDR6 bandwidth of consumer GPUs.

Benchmarks (Consumer Hardware):

  Desktop (NVIDIA RTX 5090D - 24GB)
  ---------------------------------
  Dataset:     SIFT-1M (128-dim)
  Recall:      99.2% @ 1 (FP32 variance), 100% @ 10
  Throughput:  9,354 QPS (Batch=10k)
  Latency:     ~5.5ms (P99)
  Build Time:  0 seconds

  Desktop (AMD Radeon 7900 XTX - 24GB)
  ------------------------------------
  Dataset:     SIFT-1M (128-dim)
  Recall:      99.2% @ 1, 100% @ 10
  Throughput:  6,275 QPS (Batch=10k)
  Latency:     ~11.2ms (P99)
  Note:        Running via HIP/ROCm 6.2 on Ubuntu

  Mobile (Snapdragon 8 Gen 5)
  ---------------------------
  Scenario:    100k Vectors (128d) for personal knowledge base
  Latency:     ~8ms per query
  Endurance:   Ran 10k consecutive queries with ZERO thermal throttling
               (due to L3/System Cache residency optimization)
Why use this? 1. Local RAG: Run high-quality retrieval on your gaming PC or phone. 2. Simplicity: No hyperparameters to tune (ef_search, M, nprobe). 3. Deterministic: No approximation errors for critical data.

Happy to answer questions about the NEON/CUDA/HIP memory coalescing details!

Rise in Sophisticated Dark Patterns Designed to Trick and Trap Consumers (2022)

https://www.ftc.gov/news-events/news/press-releases/2022/09/ftc-report-shows-rise-sophisticated-d...
1•wslh•2m ago•0 comments

Change Blindness in UX (2018)

https://www.nngroup.com/articles/change-blindness-definition/
1•wslh•5m ago•0 comments

Rust's Standard Library on the GPU

https://www.vectorware.com/blog/rust-std-on-gpu/
1•sbt567•6m ago•0 comments

Community Pulse 2025 End of Year Wrap-Up [audio]

https://www.communitypulse.io/102-2025-wrap-up
1•mooreds•6m ago•0 comments

Every Enemy from Super Mario 64, 3D Printed [video]

https://www.youtube.com/watch?v=U6yxtHJcxAs
1•us-merul•7m ago•0 comments

StatechartX – performant state machine runtime written in Go

https://github.com/comalice/statechartx
1•all2•10m ago•1 comments

Show HN: Open-source multi-agent subtitle translator (self-hosted)

https://github.com/subtitlesdog/Subtitles.Translate.Agent
1•mrqjr•12m ago•0 comments

MIT's new 'recursive' framework lets LLMs process 10M tokens

https://venturebeat.com/orchestration/mits-new-recursive-framework-lets-llms-process-10-million-t...
1•prng2021•14m ago•0 comments

I don't like skiing in the shade, so I built a ski trail shade map

https://skishade.com
1•marcushyett•15m ago•0 comments

Tour website's AI sends visitors to Tasmanian sites that do not exist

https://www.abc.net.au/news/2026-01-22/ai-images-of-tasmania-on-tour-website/106253448
1•beatthatflight•16m ago•1 comments

198-Bit Constraint Framework: New Physics from First Principles

https://zenodo.org/records/18170177
1•More_Fee_Us•16m ago•1 comments

Trump FCC threatens to enforce equal-time rule on late-night talk shows

https://arstechnica.com/tech-policy/2026/01/trump-fcc-tries-to-get-more-republicans-on-late-night...
4•voxadam•22m ago•1 comments

NexDock is building a new Windows phone that you can buy in 2026

https://www.windowscentral.com/microsoft/windows-11/nexdock-is-building-a-new-windows-phone-that-...
3•LorenDB•22m ago•0 comments

Elizabeth Holmes asks President Donald Trump to let her out of prison early

https://www.cnn.com/2026/01/21/tech/elizabeth-holmes-theranos-trump-commute-sentence
6•g-b-r•23m ago•2 comments

Tsfresh

https://tsfresh.readthedocs.io/en/latest/
1•jonbaer•26m ago•0 comments

The Art of Craftsmanship (Monozukuri) in the Age of AI

https://rapha.land/the-art-of-craftsmanship-monozukuri-in-the-age-of-ai/
1•vinhnx•26m ago•0 comments

Myth of the Monolithic ERP: Why They Keep Failing [video]

https://www.youtube.com/watch?v=o6d94HNGV1s
1•rossdavidh•32m ago•0 comments

An A.I. Startup Says It Wants to Empower Workers, Not Replace Them

https://www.nytimes.com/2026/01/20/technology/humans-ai-anthropic-xai.html
3•bookofjoe•35m ago•2 comments

Testosterone went from prostate cancer villain to potential ally

https://theconversation.com/how-testosterone-went-from-prostate-cancer-villain-to-potential-ally-...
2•PaulHoule•37m ago•0 comments

Flashlabs releases the world’s first open-source voice cloning model

https://twitter.com/flashlabsdotai/status/2013993446047158550
3•sangwen•39m ago•2 comments

Show HN: iMessage-data-foundry – Synthetic iMessage Data Generator

https://github.com/johnlarkin1/imessage-data-foundry
2•jlarks32•41m ago•0 comments

Open4D – Open-Source 4D Geometry Processing, Compression and Streaming Library

https://github.com/SINRG-Lab/Open4D
1•hex823•43m ago•1 comments

Palantir CEO: With AI, economies won't need immigration

https://www.theregister.com/2026/01/21/palantir_ceo_karp_claims_ai/
3•abdelhousni•43m ago•1 comments

GPTZero finds 100 new hallucinations in NeurIPS 2025 accepted papers

https://gptzero.me/news/neurips/
3•dnw•43m ago•0 comments

MsgBored, Screaming into the Abyss

https://johntrager.net/projects/msg-bored/
2•jtrager•44m ago•0 comments

AI recruiters: faster, cheaper, and still clueless

https://pksunkara.com/thoughts/ai-recruiters-faster-cheaper-and-still-clueless/
2•pksunkara•45m ago•0 comments

Explore the Mandelbrot Set

https://math.hws.edu/eck/js/mandelbrot/MB.html
1•mooreds•45m ago•0 comments

Summary paper on the STAR-Vote system [pdf]

https://www.cs.rice.edu/~dwallach/pub/star-summative-2018.pdf
1•thechao•54m ago•0 comments

FCC: Late-night and daytime talk shows must offer equal time for candidates

https://www.nbcnews.com/politics/elections/fcc-late-night-daytime-talk-shows-equal-time-candidate...
3•ceejayoz•54m ago•0 comments

The divergence of centralized systems and individual agency

4•Kiplomat-SouCmp•56m ago•3 comments