frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Kill Chain

https://artificialbureaucracy.substack.com/p/kill-chain
1•_delirium•3m ago•0 comments

What Happens with Open Source in the Age of AI?

https://turso.tech/blog/what-happens-with-oss-in-the-age-of-ai
1•cyndunlop•7m ago•0 comments

Show HN: AgentVerse – Open social network for AI agents (Mar 2026)

https://nickakre.github.io/agentverse-social/
1•nickakre•8m ago•0 comments

Hungary's Foreign Minister Briefed Russia on EU Meetings in Real Time

https://euromaidanpress.com/2026/03/21/hungarys-foreign-minister-briefed-lavrov-on-eu-meetings-in...
3•vrganj•8m ago•0 comments

Sam Altman Sister's Abuse Claims Against Him Dismissed for Now

https://www.bloomberg.com/news/articles/2026-03-21/sam-altman-sister-s-abuse-claims-against-him-d...
1•doener•8m ago•0 comments

Deploy model whose predictions most resemble the ensemble mean

https://github.com/finite-sample/stable_selection
1•neehao•10m ago•1 comments

Show HN: How I built a resume editor using AI with zero web dev experience

https://www.tailortojob.app/
1•KasparSoukup•13m ago•0 comments

When AI Writes the Software, Who Verifies It?

https://leodemoura.github.io/blog/2026-2-28-when-ai-writes-the-worlds-software-who-verifies-it/
1•vinhnx•15m ago•0 comments

Show HN: CI-debugger – Debug GitHub Actions locally with breakpoints

https://github.com/murataslan1/ci-debugger
1•murataslan•16m ago•1 comments

Refraktd – crowdsourced news bias ratings by article or outlet

https://refraktd.com
1•misterinfo•24m ago•0 comments

Screen Recorder – Free browser-based screen recording with zoom, blur, and cuts

https://screenrecorder.one
1•realfdi•27m ago•0 comments

Package Manager Mirroring – Andrew Nesbitt

https://nesbitt.io/2026/03/20/package-manager-mirroring.html
1•abdelhousni•31m ago•0 comments

The HTML Review 05

https://thehtml.review/05/
2•duck•32m ago•0 comments

Flow Matching and Diffusion Models – 2026 Version

https://diffusion.csail.mit.edu/2026/index.html
2•duck•32m ago•0 comments

Dark Reader – Browser extension that generates dark mode for web pages

https://darkreader.org/
1•Brysonbw•37m ago•0 comments

I built an AI teammate that takes Jira tickets and turns them into PRs

https://github.com/ignify-rd/claude-teammate
1•tuananhdao•43m ago•0 comments

Exponential lower bound for fan-in-2 circuits computing Hamiltonian Cycle

https://zenodo.org/records/19103649
1•jbgallagher0002•44m ago•0 comments

Show HN: Command_line – a fast, terminal-style Hacker News app

https://apps.apple.com/us/app/command-line/id6468368351
1•danimalcrackerz•46m ago•0 comments

A tool to “swallow” outputs from Claude/Codex/Cursor and reuse them

https://github.com/vule022/swallow
1•vukasinrancic•46m ago•1 comments

My Willing Complicity in "Human Rights Abuse"

https://ussri.substack.com/p/my-willing-complicity-in-human-rights
2•barry-cotter•47m ago•0 comments

German Mathematician Gerd Faltings Wins Abel Prize for Number Theory Work

https://www.nytimes.com/2026/03/19/science/abel-prize-mathematics.html
1•bookofjoe•48m ago•1 comments

Chest Fridge

https://mtbest.net/chest-fridge/
5•wolfi1•53m ago•3 comments

Argentina was one of the richest countries at the beginning of the 20th century

https://ourworldindata.org/data-insights/argentina-was-one-of-the-richest-countries-in-the-world-...
2•kamaraju•1h ago•0 comments

JavaScript Is Enough

https://geajs.com/
14•arbayi•1h ago•4 comments

Lightweight Compression in DuckDB

https://duckdb.org/2022/10/28/lightweight-compression
1•Rendello•1h ago•0 comments

JPMorgan deploys tech to monitor junior bankers' working hours

https://www.ft.com/content/5ee9d5f1-9b9b-4aee-8712-111e873e241a
1•petethomas•1h ago•0 comments

Why craft-lovers are losing their craft

https://writings.hongminhee.org/2026/03/craft-alienation-llm/
13•vinhnx•1h ago•2 comments

OS X Stats Nano – Ultra-light macOS menu bar monitor (180 KB)

1•xmao•1h ago•1 comments

Musk found liable to Twitter shareholders in fraud lawsuit over $44B takeover

https://www.reuters.com/sustainability/boards-policy-regulation/musk-found-liable-twitter-shareho...
7•karp773•1h ago•1 comments

Big HVAC is shaking: Beat the heat with a wet towel and spite for $240B [video]

https://www.youtube.com/watch?v=siBBCap_Z-s
3•SilentM68•1h ago•0 comments
Open in hackernews

Show HN: ClawMem – Open-source agent memory with SOTA local GPU retrieval

https://github.com/yoloshii/ClawMem
3•yoloshii•1h ago
So I've been building ClawMem, an open-source context engine that gives AI coding agents persistent memory across sessions. It works with Claude Code (hooks + MCP) and OpenClaw (ContextEngine plugin + REST API), and both can share the same SQLite vault, so your CLI agent and your voice/chat agent build on the same memory without syncing anything.

The retrieval architecture is a Frankenstein, which is pretty much always my process. I pulled the best parts from recent projects and research and stitched them together: [QMD](https://github.com/tobi/qmd) for the multi-signal retrieval pipeline (BM25 + vector + RRF + query expansion + cross-encoder reranking), [SAME](https://github.com/sgx-labs/statelessagent) for composite scoring with content-type half-lives and co-activation reinforcement, [MAGMA](https://arxiv.org/abs/2501.13956) for intent classification with multi-graph traversal (semantic, temporal, and causal beam search), [A-MEM](https://arxiv.org/abs/2510.02178) for self-evolving memory notes, and [Engram](https://github.com/Gentleman-Programming/engram) for deduplication patterns and temporal navigation. None of these were designed to work together. Making them coherent was most of the work.

On the inference side, QMD's original stack uses a 300MB embedding model, a 1.1GB query expansion LLM, and a 600MB reranker. These run via llama-server on a GPU or in-process through node-llama-cpp (Metal, Vulkan, or CPU). But the more interesting path is the SOTA upgrade: ZeroEntropy's distillation-paired zembed-1 + zerank-2. These are currently the top-ranked embedding and reranking models on MTEB, and they're designed to work together. The reranker was distilled from the same teacher as the embedder, so they share a semantic space. You need ~12GB VRAM to run both, but retrieval quality is noticeably better than the default stack. There's also a cloud embedding option if you're tight on vram or prefer to offload embedding to a cloud model.

For Claude Code specifically, it hooks into lifecycle events. Context-surfacing fires on every prompt to inject relevant memory, decision-extractor and handoff-generator capture session state, and a feedback loop reinforces notes that actually get referenced. That handles about 90% of retrieval automatically. The other 10% is 28 MCP tools for explicit queries. For OpenClaw, it registers as a ContextEngine plugin with the same hook-to-lifecycle mapping, plus 5 REST API tools for the agent to call directly.

It runs on Bun with a single SQLite vault (WAL mode, FTS5 + vec0). Everything is on-device; no cloud dependency unless you opt into cloud embedding. The whole system is self-contained.

This is a polished WIP, not a finished product. I'm a solo dev. The codebase is around 19K lines and the main store module is a 4K-line god object that probably needs splitting. And of course, the system is only as good as what you index. A vault with three memory files gives deservedly thin results. One with your project docs, research notes, and decision records gives something actually useful.

Two questions I'd genuinely like input on: (1) Has anyone else tried running SOTA embedding + reranking models locally for agent memory, and is the quality difference worth the VRAM? (2) For those running multiple agent interfaces (CLI + voice/chat), how are you handling shared memory today?