frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

O(1) Context Retrieval for Agents Using Weightless Neural Networks

https://tryrice.com
7•aperi•1d ago

Comments

aperi•1d ago
Hi HN, I am Anil and I am building Rice (https://tryrice.com), a low latency context orchestration layer for AI agents.

Rice replaces the standard HNSW vector search with Weightless Neural Networks (WNNs) to enable O(1) retrieval speeds, specifically designed for realtime voice agents and high-frequency multi agent workflows.

The problem we ran into while building voice agents was simple: Latency kills immersion.

Between STT (Speech-to-Text), the LLM inference, and TTS (Text-to-Speech), we had a strict latency budget. Spending 200ms+ on a Vector DB lookup (plus reranking) was eating up too much of that budget. On top of that, we found that stateless RAG meant our agents were constantly hallucinating permissions and accessing data they shouldn't, or failing to remember a constraint set by another agent 10 seconds ago.

The industry standard is to throw everything into Pinecone or pgvector and handle the logic in the application layer. That works for chatbots, but for autonomous agents that need mutable memory (read/write state 50 times a minute), standard vector indexes are too heavy and slow to update.

Rice is our attempt to fix the Working Memory problem.

Under the hood:

Rice is an indexing and state management engine that sits between your LLM and your data. Instead of using HNSW graphs (which are O(log N)), we rely on Weightless Neural Networks (similar to WiSARD architectures).

- Deep Semantic Hashing: We train a lightweight model to compress dense embeddings into sparse binary codes while preserving semantic relationships. - O(1) Lookup: These binary codes are mapped directly to memory addresses. This effectively turns "Search" into a hash table lookup.

The Result: Retrieval latency stays flat (<50ms) even as your context grows to millions of items, and updates to the memory state are instant (no reindexing penalty).

We wrap this WNN core in a State Machine that handles Access Control (ACLs). When an Agent requests context, Rice checks the identity and state before the retrieval, ensuring you don't leak data between users or agents. Think of it as "Supabase for Agent Context", a managed backend that handles the memory graph and security policies so you don't have to write raw SQL RLS queries for every RAG call.

Where we are now

Rice is currently in closed beta/alpha. We are working with a few design partners in the voice and support automation space who need that sub 100ms retrieval speed.

We know using WNNs for semantic search is a contrarian bet compared to the massive investment in Vector DBs. We are specifically optimizing for "Hot State" (short term, high velocity memory) rather than "Cold Storage" (archival knowledge), though the lines are blurring.

Use Cases we are seeing: - Voice Agents: Shaving 200ms off RAG latency to make conversation feel natural. - Multi-Agent Hand-offs: Agent A (Sales) updates a "Customer Mood" state, and Agent B (Support) sees it instantly without hallucinating. - Internal Tools: Enforcing strict ACLs (e.g., "Junior Devs can't query the Salary Table") at the infrastructure layer.

We are looking for engineers who are pushing the limits of agent latency or struggling with state management to try it out and tell us where it breaks.

I’m especially interested in hearing your skepticism on the WNN approach - we know it’s weird, but for our specific constraints, the speed tradeoff has been worth it.

ob_mobly•1d ago
Interesting take on the matter. Joined the waitlist, would like to see it in action.

I Like My F# Code Type Annotation-Free

https://www.planetgeek.ch/2025/12/10/i-like-my-f-code-type-annotation-free/
1•Kerrick•20s ago•0 comments

Anytime Algorithm

https://en.wikipedia.org/wiki/Anytime_algorithm
1•raw_anon_1111•3m ago•0 comments

TypeSlayer – a TypeScript types performance tool [video]

https://www.youtube.com/watch?v=IP6EZXzXBzY
2•wildpeaks•8m ago•0 comments

I build a live crypto-sentiment analyzer

https://risingwave.com/blog/risingwave-python-udf-tutorial/
1•WavyPeng•8m ago•0 comments

Pebble Index

https://repebble.com/index
1•mcyc•12m ago•0 comments

Neuroscientist Doris Tsao joins Astera to lead its new neuroscience program

https://astera.org/neuroscientist-doris-tsao-joins-astera-to-lead-its-new-neuroscience-program/
1•memming•23m ago•0 comments

Parachutists told to check software after jumper dangled from a plane

https://www.theregister.com/2025/12/11/atsb_parachute_snagged_software/
2•defrost•26m ago•0 comments

Tool for analyzing GitLab SOS bundles without Elasticsearch

https://gitlab.com/gitlab-com/support/toolbox/soslab
1•s_shaik•26m ago•1 comments

A Letter from My Grandfather

https://lorn.us/posts/a-letter-from-my-grandfather/
2•atropoles•31m ago•0 comments

A Friendly Guide to Exorcising Maxwell's Demon (Paper)

https://journals.aps.org/prxquantum/abstract/10.1103/phkv-wrsd
1•mrcgnc•36m ago•0 comments

The Component Gallery

https://component.gallery/
1•handfuloflight•40m ago•0 comments

Fish Alpinism

https://triapul.cz/_/1765291397
1•todsacerdoti•43m ago•0 comments

Weird Generalization and Inductive Backdoors: New Ways to Corrupt LLMs

https://arxiv.org/abs/2512.09742
1•bearseascape•49m ago•0 comments

Slovenia gives cash constitutional protection

https://sloveniatimes.com/45857/slovenia-gives-cash-constitutional-protection
2•walterbell•49m ago•0 comments

China's AI Power Play: Cheap Electricity from Biggest Grid

https://www.wsj.com/tech/china-ai-electricity-data-centers-d2a86935
2•perihelions•50m ago•0 comments

Portals must bend gravity [video]

https://www.youtube.com/watch?v=DydIhwLrbMk
1•chii•51m ago•0 comments

GLM-4.6V: Open-Source Multimodal Models with Native Tool Use

https://z.ai/blog/glm-4.6v
2•gmays•56m ago•0 comments

Ask HN: Why are people using Claude or ChatGPT when Gemini is free?

3•muunbo•1h ago•1 comments

Trump launches $1M 'gold card' immigration visas

https://www.bbc.com/news/articles/cj4q1lddj8go
5•e2e4•1h ago•0 comments

Is it possible to fix the "Power Law" problem in user-generated content?

https://ideavo.tripivo.co.in
2•ideavo•1h ago•1 comments

Are there Proton Drive alternatives with true client-only key handling?

2•hasanur_m•1h ago•1 comments

OpenAI (2015)

https://openai.com/index/introducing-openai/
2•vinhnx•1h ago•0 comments

Shapes Inc founders committed the cardinal sin of mass emailing by CCing

https://twitter.com/Zencep_NA/status/1998965773126218184
1•matthewsh•1h ago•1 comments

The Wild West tale of the first cow-buffalo hybrid

https://www.popsci.com/science/cow-buffalo-hybrid-history/
1•gmays•1h ago•0 comments

A list of parks around the world that are perfect to sit down and enjoy a book

https://www.placestoread.xyz/
2•animal_spirits•1h ago•0 comments

Instagram gives users control of their algorithms in new feature

https://abcnews.go.com/GMA/Living/instagram-users-control-algorithms-new-feature/story?id=128252102
2•SilverElfin•1h ago•0 comments

Oil Tanker U.S. Seized Has Faked Its Location Before, Data Shows

https://www.nytimes.com/2025/12/10/us/politics/oil-tanker-venezuela-tracking-data.html
8•jbegley•1h ago•2 comments

High Performance SSH/SCP

https://www.psc.edu/hpn-ssh-home/
2•gslin•1h ago•0 comments

Show HN: DocLet – End-to-end encrypted storage with user-owned key branches

https://doclet.app/
1•hasanur_m•1h ago•0 comments

Incomplete list of mistakes in the design of CSS

https://wiki.csswg.org/ideas/mistakes
35•OuterVale•1h ago•10 comments