news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Chasing AI Memory SOTA: Beating the Benchmark, Missing the Point

https://xmemory.ai/chasing-sota-in-ai-memory/

2•alex_petrov•1h ago

Comments

alex_petrov•1h ago

66.88%. 80.1%. 85%. 90.79%. 93%. 100%.

These are all SOTA scores on agentic memory benchmarks. None of them tell you whether the system will work in production.

The deeper problem isn't the data — it's that we often misunderstand what these numbers actually measure. In our recent white paper we open-sourced datasets that target specific memory functions. Today we published a follow-up that explains why we think the well-known agentic memory benchmarks (LoCoMo, LongMemEval) miss the mark for production systems, and what we measure instead.

We're in a field that is measuring itself against itself. The real question isn't 'are we beating last week's leaderboard?' — it's 'are we building something that makes people's work meaningfully better?' That's harder to measure. It's also the only thing that matters.

Slay the Spire 2 Review-Bombing

https://www.ign.com/articles/slay-the-spire-2-review-bombing

1•croes•42s ago•0 comments

Agent-Native Extensibility

https://www.nibzard.com/agent-native

1•nkko•1m ago•0 comments

Boris Johnson sees a "blessing" in falling British births

https://www.governance.fyi/p/boris-johnson-sees-a-blessing-in

1•bigbobbeeper•2m ago•0 comments

Show HN: Jobs – A transparent job stack where companies pay to rank

https://www.100jobs.page

1•typekev•2m ago•0 comments

Show HN: Orbit – open-source server panel in Go with security plugins

https://kenyanredwoods01.github.io/Orbit/

1•RedwoodsKenyan•3m ago•0 comments

QVAC MedPsy: Medical and Healthcare Models for Edge Device

https://huggingface.co/blog/qvac/medpsy

1•qvac•5m ago•0 comments

A job board for remote AI training jobs

https://aitrainerjobs.cv

1•calebro•6m ago•0 comments

For Every Patient Their Own Drug

https://nautil.us/for-every-patient-their-own-drug-1280433

1•lschueller•6m ago•0 comments

Apple to Pay $250M to Settle Claims It Overpromised on iPhone AI Features

https://www.law.com/corpcounsel/2026/05/07/apple-to-pay-250m-to-settle-claims-it-overpromised-on-...

1•1vuio0pswjnm7•7m ago•0 comments

I built a tool to let you export and organize your X bookmarks

https://x-archive.netlify.app/

1•xarchive•7m ago•2 comments

Deletion of government data is hurting Americans – from infant deaths to hunger

https://www.theguardian.com/us-news/ng-interactive/2026/may/07/trump-administration-deleting-data

1•robaato•10m ago•0 comments

TOML: A config file format for humans

https://toml.io/en/

2•thunderbong•11m ago•0 comments

Wraplet: TypeScript, OOP, and the DOM – in one working model

https://wraplet.dev/

1•enador•12m ago•0 comments

A Screen Addict on the Couch

https://manualdousuario.net/en/screen-addiction/

1•rpgbr•13m ago•0 comments

Show HN: CLI for Testing Raw Data Against Google Data Studio Dashboards

https://github.com/spookyuser/datastudio-cli

1•spookyuser•14m ago•0 comments

SpaceX to rent data centre capacity to Anthropic

https://www.ft.com/content/aa0239b8-0d57-4dc8-8c1a-ed7ac4d689fb

1•1vuio0pswjnm7•15m ago•0 comments

I built a WP plugin to solve the "AI Search" problem (YouTube-to-Blog and RAG)

https://www.indiehackers.com/post/i-built-a-wp-plugin-to-solve-the-ai-search-problem-youtube-to-b...

1•shahisoft•15m ago•0 comments

Big Ass Data Broker Opt-Out List

https://github.com/yaelwrites/Big-Ass-Data-Broker-Opt-Out-List

1•l1am0•15m ago•0 comments

For a Certain Kind of Guy, Even a Diaper Bag Needs to Be 'Tactical'

https://www.nytimes.com/2026/05/07/magazine/tactical-gear-marketing-boys-trend.html

1•bookofjoe•17m ago•1 comments

Show HN: Monocurl – Interactive STEM animation language and IDE

https://www.monocurl.com

1•enigmurl•18m ago•0 comments

Show HN: I made a vertical-pedalling bike with a novel drivetrain [video]

https://www.youtube.com/watch?v=4HLOsi2gWXQ

1•tonyonodi•18m ago•0 comments

Anxiety

https://www.alessiocavallo.it/articles/anxiety

2•jxad•20m ago•0 comments

Tensor Network Attention

https://mainlymatmul.com/blog/tensor-network-attention/

1•mezark•20m ago•0 comments

Span Announces XFRA, a Mini Household Data Center

https://www.span.io/blog/span-announces-xfra-a-distributed-data-center-solution-to-close-the-spee...

1•fastest963•21m ago•0 comments

An Agent Run Is Not Done When the Model Stops Talking

https://jeremyblankenship.dev/writing/agent-run-not-done

1•aliasocracy•22m ago•0 comments

Casio S100X Japanese Lacquer Edition (JP Page Only)

https://www.casio.com/jp/basic-calculators/premium/en-s100x-jc1-u/

2•dr_kiszonka•25m ago•0 comments

What an AI productivity surge would mean for the fiscal outlook

https://www.axios.com/2026/05/06/ai-productivity-yale-fiscal-outlook

1•simonpure•25m ago•0 comments

I took a 250k-mile minivan through Germany's rigorous car inspection

https://www.jalopnik.com/i-took-a-250-000-mile-minivan-through-germanys-rigorous-1845197100/

1•probably_wrong•25m ago•0 comments

EV-QA-Framework–Open-source battery QA anomaly detection and SOH prediction

https://github.com/remontsuri/EV-QA-Framework

1•remontsuri•26m ago•0 comments

An HTTP header caused time.gov to skew from UTC

https://alexsci.com/blog/how-time-gov-works/

2•birdculture•26m ago•0 comments