frontpage.

Most AI systems rely on vector search. It finds similar fragments, but not the right context. It can tell you two passages are related, but not how they connect or why they matter together.

So everyone ends up “engineering context” — manually deciding what to stuff into prompts using RAG pipelines, agentic search, or trees of thought. These tricks work for small demos, but not at scale. That’s why MIT found that 95% of AI pilots fail, and why you keep seeing threads about vector search breaking down.

We built a different approach: a retrieval model that predicts the right context for every turn in a conversation. On Stanford’s STaRK benchmark it ranks #1. It’s also fast enough for voice chat, where even 100ms of lag kills the experience.

We also introduced a new metric: retrieval loss. Like language model loss, but for retrieval. Traditional systems get worse as your dataset grows. With Papr, retrieval loss drops as your dataset grows — meaning more knowledge makes your system smarter, not dumber.

Our memory APIs are available to try out with a generous free tier. We’d love feedback, questions, and brutal critique. Full details here - https://substack.com/home/post/p-172573217

Cosmos Cloud: A secure, all-in-one platform for self-hosting Docker apps

Suppressing property variability in recycled plastics via bioinspired design

Jean Leray in Edelbach [pdf]

The Currents of a Founder

A Scalper Explains Why They Love Ripping You Off

Indices, not Pointers

Show HN: CompareGPT – Making LLMs More Trustworthy by Reducing Hallucinations

Have foreign tourists avoided America this year?

Why boomers have more money than everyone else

Show HN: Slack-explorer-MCP – Let AI find historical context in Slack

How AI Is Changing Bookkeeping

The maths you need to start understanding LLMs

Ask HN: Short term housing for founders / entrepreneurs in the Bay Area / SF?

US Manufacturing Activity Contracted in August for a Sixth Month

Show HN: AI Agent for Game UI

EVs reduce climate pollution, but by how much? New U-M research has the answer

The Trust Quotient (TQ)

TextJam

The case against Almost Always auto in C++

This blog is running on a recycled Google Pixel 5

The Millionaire Who Left Wall Street to Become a Paramedic

Spec-Driven Development with A

What Every Data Scientist Should Know About Graph Transformers

Google, Apple, and Mozilla Win in the Antitrust Case Google Lost

Views from onboard Starship's tenth flight test

Google says Gmail security is "strong and effective" as it denies major breach

World’s biggest iceberg breaks up after 40 years

Parallel AI agents are a game changer

Researchers Are Already Leaving Meta's New Superintelligence Lab

Health Effects of Cousin Marriage: Evidence from US Genealogical Records