frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Change the model. Same output. The pipeline decides. VAC Memory System

1•ViktorKuz•1h ago
I’ve been experimenting with long-term memory architectures for agent systems and wanted to share some technical results that might be useful to others working on retrieval pipelines. Benchmark: LoCoMo (10 runs × 10 conversation sets) Average accuracy: 80.1% Setup: full isolation across all 10 conv groups (no cross-contamination, no shared memory between runs)

Architecture (all open weights except answer generation)

1. Dense retrieval

BGE-large-en-v1.5 (1024d)

FAISS IndexFlatIP

Standard BGE instruction prompt: “Represent this sentence for searching relevant passages.”

2. Sparse retrieval

BM25 via classic inverted index

Helps with low-embedding-recall queries and keyword-heavy prompts

3. MCA (Multi-Component Aggregation) ranking A simple gravitational-style score combining:

keyword coverage

token importance

local frequency signal MCA acts as a first-pass filter to catch exact-match questions. Threshold: coverage ≥ 0.1 → keep top-30

4. Union strategy Instead of aggressively reducing the union, the system feeds 112–135 documents directly to a re-ranker. In practice this improved stability and prevented loss of rare but crucial documents.

5. Cross-Encoder reranking

bge-reranker-v2-m3

Processes the full union (rare for RAG pipelines, but worked best here)

Produces a final top-k used for answer generation

6. Answer generation

GPT-4o-mini, used only for the final synthesis step

No agent chain, no tool calls, no memory-dependent LLM logic

Performance

<3 seconds per query on a single RTX 4090

Deterministic output between runs

Reproducible test harness (10×10 protocol)

Why this worked

Three things seemed to matter most:

MCA-first filter to stabilize early recall

Not discarding the union before re-ranking

Proper dense embedding instruction, which massively affects BGE performance

Notes

LoCoMo remains one of the hardest public memory benchmarks: 5,880 multi-hop, temporal, negation-rich QA pairs derived from human–agent conversations. Would be interested to compare with others working on long-term retrieval, especially multi-stage ranking or cross-encoder heavy pipelines.

Github: https://github.com/vac-architector/VAC-Memory-System

Show HN: A diagram a code rendering tool for network documentation

https://drawthenet.io/
1•ratchetclank•2m ago•0 comments

Meta Reportedly Set to Raise VR Headset Prices, Keep Existing Devices Longer

https://www.roadtovr.com/meta-quest-3-price-hike-report/
1•rob74•2m ago•0 comments

Oral Exams Improve Engineering Student Performance, Motivation (2023)

https://today.ucsd.edu/story/oral-exams-improve-engineering-student-performance-motivation
1•sonabinu•3m ago•0 comments

Ask HN: The AGI Race That Might Not Be a Race

1•razodactyl•3m ago•0 comments

Towards Modeling Road Access Deprivation in Sub-Saharan Africa

https://arxiv.org/abs/2512.02190
1•PaulHoule•4m ago•0 comments

Oasis: Pooling PCIe Devices over CXL to Boost Utilization

https://dl.acm.org/doi/10.1145/3731569.3764812
1•blakepelton•4m ago•1 comments

Will blockbuster obesity drugs revolutionize addiction treatment?

https://www.nature.com/articles/d41586-025-03911-x
1•qnleigh•8m ago•0 comments

White House Turning to Private Firms in Cyber Offensive

https://www.bloomberg.com/news/articles/2025-12-12/trump-administration-turning-to-private-firms-...
1•newer_vienna•8m ago•0 comments

New York's Best Espresso? (2002)

https://www.nytimes.com/2002/05/15/dining/critic-s-notebook-new-york-s-best-espresso.html
1•nivethan•8m ago•1 comments

CM0 – a new Raspberry Pi you can't buy

https://www.jeffgeerling.com/blog/2025/cm0-new-raspberry-pi-you-cant-buy
3•speckx•9m ago•0 comments

Stanford Medicine study shows mRNA-based Covid-19 vaccines can cause myocarditis

https://med.stanford.edu/news/all-news/2025/12/myocarditis-vaccine-covid.html
2•DeusExMachina•9m ago•0 comments

Nicaea: The council that shaped the West

https://engelsbergideas.com/notebook/nicaea-the-council-that-shaped-the-west/
1•teleforce•9m ago•0 comments

Show HN: Quorum – Multi-agent CLI debates (AutoGen back end and React/Ink TUI)

https://github.com/Detrol/quorum-cli
1•Detrol•10m ago•0 comments

Intel has tested chipmaking tools from firm with sanctioned China unit

https://www.reuters.com/world/china/intel-has-tested-chipmaking-tools-firm-with-sanctioned-china-...
1•payamb•12m ago•0 comments

Show HN: Dssrf – A safe‑by‑construction SSRF defense library for Node.js

1•relunsec•12m ago•0 comments

Beyond Data Filtering: Knowledge Localization for Capability Removal in LLMs

https://arxiv.org/abs/2512.05648
1•yorwba•13m ago•0 comments

Anyone used clear aligners like Smile White?

https://www.smilewhite.co.uk/
1•trooperbill•15m ago•0 comments

Rivian Reveals New AI Assistant, Lidar, and Autonomous Driving Coming in 2026

https://gizmodo.com/rivian-reveals-new-ai-assistant-lidar-and-autonomous-driving-coming-in-2026-2...
1•frozenseven•15m ago•0 comments

The Ring, the Holon, and the Revenge of the Objective

https://estebanandthecollective.substack.com/p/the-ring-the-holon-and-the-revenge
1•asplake•18m ago•0 comments

AI that reads your Git history so you don't have to write status reports

1•slmslm•18m ago•0 comments

Show HN: PageSpeak – Talk to book characters in their world or ours

https://www.pagespeak.ai/books
1•drkph•19m ago•0 comments

Amazon pledges $35B worth of investments in India with AI focus

https://www.cnbc.com/2025/12/10/amazon-pledges-35-billion-investments-in-indias-ai-space-through-...
1•gmays•20m ago•0 comments

Show HN: TubeDL – Native macOS video downloader built on yt-dlp

https://tubedl-landing.vercel.app/
1•ricky_trujillot•20m ago•0 comments

We used Codex to build Sora for Android in 28 days

https://openai.com/index/shipping-sora-for-android-with-codex
1•meetpateltech•23m ago•0 comments

Ask HN: Pointless thing you've read that gratified your intellectual curiosity?

2•blenderob•24m ago•0 comments

Show HN: DailyGame.online – a minimal daily puzzle arcade built with GPT-5.2

https://dailygame.online
1•xmasdong•25m ago•0 comments

Using "AI" to manage your Fedora system seems like a bad idea

https://www.osnews.com/story/144006/using-ai-to-manage-your-fedora-system-seems-like-a-really-bad...
3•voxadam•25m ago•0 comments

Show HN: Captain Hook AI – Create Viral Hooks for Social Media

https://captain-hook.ai/
1•shanital•26m ago•0 comments

Zevo wants to add robotaxis to car-share fleet, starting with newcomer Tensor

https://techcrunch.com/2025/12/12/zevo-wants-to-add-robotaxis-to-its-car-share-fleet-starting-wit...
1•frozenseven•29m ago•0 comments

Ask HN: Anyone else doing login-free trials with localStorage tokens?

3•dsmurrell•29m ago•0 comments