frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Show HN: Continue – Source-controlled AI checks, enforceable in CI

https://docs.continue.dev
1•sestinj•35s ago•0 comments

Chess Engines Do Weird Stuff

https://girl.surgery/chess
2•admiringly•50s ago•0 comments

Show HN: Agent Breadcrumbs – Unified Work Log Across Claude, Codex, OpenClaw

https://github.com/ejcho623/agent-breadcrumbs
1•ejcho623•1m ago•0 comments

Show HN: Listen to sounds around the world and guess the location

https://placethesound.vikborges.com
1•bit_nomad•1m ago•0 comments

Micron Is Spending $200B to Break the AI Memory Bottleneck

https://www.wsj.com/tech/micron-is-spending-200-billion-to-break-the-ai-memory-bottleneck-a4cc74a1
1•gmays•2m ago•0 comments

Thank HN: You helped save 33,241 lives

2•chaseadam17•2m ago•0 comments

Tech Startup Culture Not as Innovative as Founders May Think (2025)

https://www.hec.edu/en/dare/innovation-entrepreneurship/tech-startup-culture-not-innovative-found...
1•wslh•3m ago•0 comments

Astronomers track bubbles on a star's surface in the most detailed video yet

https://www.almaobservatory.org/en/press-releases/astronomers-track-bubbles-on-a-stars-surface-in...
1•nobody9999•3m ago•0 comments

Massively Parallel Programming

https://dcosson.substack.com/p/massively-parallel-programming
2•dcosson•4m ago•0 comments

Launch HN: Sonarly (YC W26) – AI agent to triage and fix your production alerts

https://sonarly.com/
2•Dimittri•5m ago•0 comments

AI-authored code contains worse bugs than software crafted by humans

https://www.theregister.com/2025/12/17/ai_code_bugs/
1•thefilmore•5m ago•0 comments

About the Indianapolis Hiking Club

https://www.indyhike.org/about.shtml
1•mooreds•5m ago•0 comments

Multi-player agents are the future

https://charlielabs.ai/blog/why-the-next-agent-interface-is-shared/
1•mrbbk•6m ago•0 comments

Show HN: 6cy – Experimental streaming archive format with per-block codecs

https://github.com/byte271/6cy
1•yihac1•7m ago•0 comments

Grok 4.20 Beta

https://grok.com/
1•tosh•7m ago•0 comments

How an AI baby-tracking app grew to ~$300K/month

https://www.starterstory.com/stories/sprouty
1•igor_ryabenkiy•8m ago•1 comments

So You Want to Build a Tunnel

https://practical.engineering/blog/2026/2/17/so-you-want-to-build-a-tunnel
2•crescit_eundo•9m ago•0 comments

Show HN: Trained YOLOX from scratch to avoid Ultralytics (aircraft detection)

https://austinsnerdythings.com/2026/02/13/training-yolox-aircraft-detection-mit-license/
1•auspiv•10m ago•0 comments

Openclaw 2.0. Openrappter.

https://github.com/kody-w/openrappter
1•kody_w•10m ago•0 comments

My thoughts on Open Source – after a decade and in AI era

https://blog.inoki.cc/2026/02/17/My-thoughts-on-Open-Source-after-a-decade-2026/index.html
1•inoki•10m ago•1 comments

Most people are individually optimistic, but think the world is falling apart

https://hannahritchie.substack.com/p/many-people-are-individually-optimistic
2•speckx•11m ago•0 comments

Host range and antibiotic resistance are shaped by distinct survival strategies

https://academic.oup.com/nar/article/54/2/gkaf1479/8427120?login=false
1•PaulHoule•13m ago•0 comments

The Best Programming Language for the End of the World

https://web.archive.org/web/20250326100613/https://www.wired.com/story/forth-collapse-os-apocalyp...
1•tosh•13m ago•0 comments

Async/Await on the GPU

https://www.vectorware.com/blog/async-await-on-gpu/
2•Philpax•15m ago•0 comments

Show HN: OpenBoot – 2 commands to replace a 3-hour Mac setup ritual

https://github.com/openbootdotdev/openboot
3•superjam2026•16m ago•1 comments

SynthBench: 81% Zero-Shot Accuracy from AI-Generated Training Data

https://www.laksh.us/blog/synthbench
1•LakshyaC•16m ago•0 comments

Show HN: Browser-based EEG neurofeedback detecting golden ratio brain coherence

https://resonate.neurokinetikz.com
1•neurokinetikz•17m ago•0 comments

Texas sues TP Link alleging Chinese government access to its devices

https://www.reuters.com/legal/litigation/texas-sues-tp-link-alleging-chinese-government-access-it...
2•giuliomagnifico•17m ago•0 comments

Only 40 lines of code [video]

https://www.youtube.com/watch?v=R3ydGMRtnqU
1•EPendragon•18m ago•0 comments

A new DB for Iceberg/Hudi because low-latency serving on a lake is impractical

https://www.onehouse.ai/blog/announcing-onehouse-lakebase-database-speeds-finally-on-the-lakehouse
4•dunwaldo•18m ago•0 comments
Open in hackernews

Sub-Millisecond RAG on Apple Silicon. No Server. No API. One File

https://github.com/christopherkarani/Wax
3•ckarani•1h ago

Comments

ckarani•1h ago
I built Wax because every RAG solution required either Pinecone/Weaviate in the cloud or ChromaDB/Qdrant running locally. I wanted the SQLite of RAG -- import a library, open a file, query. Except for multimodal content at GPU speed.

The architecture that makes this work: Metal-accelerated vector search -- Embeddings live directly in unified memory (MTLBuffer). Zero CPU-GPU copy overhead. Adaptive SIMD4/SIMD8 kernels + GPU-side bitonic sort = sub-millisecond search on 10K+ vectors (vs ~100ms CPU). This isn't just "faster" -- it enables interactive search UX that wasn't possible before.

Atomic single-file storage (.mv2s) -- Everything in one crash-safe binary: embeddings, BM25 index, metadata, compressed payloads. Dual-header writes with generation counters = kill -9 safe. Sync via iCloud, email it, commit to git. The file format is deterministic -- identical input produces byte-identical output.

Query-adaptive hybrid fusion -- Four parallel search lanes (BM25, vector, timeline, structured memory). Lightweight classifier detects intent ("when did I..." → boost timeline, "find documentation about..." → boost BM25). Reciprocal Rank Fusion with deterministic tie-breaking = identical queries always return identical results.

Photo/Video RAG -- Index your photo library with OCR, captions, GPS binning, per-region embeddings. Query "find that receipt from the restaurant" searches text, visual similarity, and location simultaneously. Videos get segmented with keyframe embeddings + transcript mapping. Results include timecodes for jump-to-moment navigation. All offline -- iCloud-only photos get metadata-only indexing. Swift 6.2 strict concurrency -- Every orchestrator is an actor. Thread safety proven at compile time, not runtime. Zero data races, zero @unchecked Sendable, zero escape hatches.

Deterministic context assembly -- Same query + same data = byte-identical context every time. Three-tier surrogate compression (full/gist/micro) adapts based on memory age. Bundled cl100k_base tokenizer = no network, no nondeterminism.

import Wax

let brain = try await MemoryOrchestrator(at: URL(fileURLWithPath: "brain.mv2s"))

// Index try await brain.remember("User prefers dark mode, gets headaches from bright screens")

// Retrieve let context = try await brain.recall(query: "user display preferences") // Returns relevant memories with source attribution, ready for LLM context

What makes this different:

Zero dependencies on cloud infrastructure -- No API keys, no vendor lock-in, no telemetry Production-grade concurrency -- Not "it works in my tests," but compile-time proven thread safety Multimodal from the ground up -- Text, photos, videos indexed with shared semantics Performance that unlocks new UX -- Sub-millisecond latency enables real-time RAG workflows

## Wax Performance (Apple Silicon, as of Feb 17, 2026)

  - 0.84ms vector search at 10K docs (Metal, warm cache)
  - 9.2ms first-query after cold-open for vector search
  - ~125x faster than CPU (105ms) and ~178x faster than SQLite FTS5 (150ms) in
    the same 10K benchmark
  - 17ms cold-open → first query overall
  - 10K ingest in 7.756s (~1289 docs/s) with hybrid batched ingest
  - 0.103s hybrid search on 10K docs
  - Recall path: 0.101–0.103s (smoke/standard workloads)
Built for: Developers shipping AI-native apps who want RAG without the infrastructure overhead. Your data stays local, your users stay private, your app stays fast.

The storage format and search pipeline are stable. The API surface is early but functional. If you're building RAG into Swift apps, I'd love your feedback.

GitHub: https://github.com/christopherkarani/Wax

Star it if you're tired of spinning up vector databases for what should be a library call.