frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

PostGIS-compatible spatial functions for ClickHouse

https://github.com/bacek/chgeos/
1•zX41ZdbW•35s ago•0 comments

Pipe to Bash

https://gaganpreet.in/posts/pipe-to-bash/
1•speckx•1m ago•0 comments

Prediction Market VC Fund Backed by Kalshi and Polymarket Founders

https://www.5cc.capital
1•salkahfi•1m ago•0 comments

Assignable Research bot for Linear and Slack tasks

https://github.com/serpapi/serpapi-research-bot
1•jamescollinssp•1m ago•0 comments

US govt pays TotalEnergies nearly $1B to stop US offshore wind projects

https://www.lemonde.fr/en/international/article/2026/03/23/us-and-totalenergies-reach-nearly-1-bi...
1•lode•2m ago•0 comments

Make OpenClaw your company's training program for agentic AI

https://www.robert-glaser.de/you-banned-openclaw-now-make-it-your-training-program/
1•youngbrioche•2m ago•0 comments

LLM-Native Advertising (What Ads in GenAI Will Look Like)

https://adgpt.rlafuente.com/
1•andes314•2m ago•0 comments

Selling Stuff

https://ftrain.com/selling-stuff
1•FigurativeVoid•4m ago•0 comments

Microsoft Copilot Is Confronting Its Identity Crisis

https://www.bloomberg.com/news/newsletters/2026-03-23/microsoft-msft-ai-copilot-confronts-its-ide...
2•Brajeshwar•4m ago•0 comments

What Does a Hologram Trademark Signify When the Hologram Isn't There?

https://blog.ericgoldman.org/archives/2026/03/what-does-a-hologram-trademark-signify-when-the-hol...
1•hn_acker•7m ago•1 comments

Microsoft's "Fix" for Windows 11: Flowers After the Beating

https://www.sambent.com/microsofts-plan-to-fix-windows-11-is-gaslighting/
2•speckx•8m ago•0 comments

Migrating Snapchat's AB Pipelines to GPU-Accelerated Spark

https://eng.snap.com/snap-nvidia-gcp
1•Kaedon•8m ago•0 comments

We Built an AI Memory System That Learns

https://getcoherence.io/blog/how-we-built-an-ai-memory-system-that-actually-learns-55bcdf82
2•keithfawcett•8m ago•0 comments

Gen 13: how we built our most powerful server yet

https://blog.cloudflare.com/gen13-config/
1•NicoJuicy•9m ago•0 comments

ChatGPT and the Meaning of Life: Guest Post by Harvey Lederman

https://scottaaronson.blog/?p=9030
1•gwintrob•11m ago•0 comments

Free Multilingual Dictionaries

https://yap.town/d/
1•ChadNauseam•12m ago•1 comments

Ancient machine gun was used by Romans to attack Pompeii

https://www.telegraph.co.uk/world-news/2026/03/22/ancient-machine-gun-was-used-by-romans-to-attac...
2•Stratoscope•13m ago•0 comments

A Ramsey-Style Problem on Hypergraphs

https://epoch.ai/frontiermath/open-problems/ramsey-hypergraphs
1•yusufozkan•13m ago•0 comments

Canvas Unrolls AI Teaching Agent

https://www.insidehighered.com/news/tech-innovation/artificial-intelligence/2026/03/23/canvas-unr...
1•speckx•14m ago•0 comments

The Magnet Suspension Skateboard

https://www.youtube.com/watch?v=yzXZ7cZXifo
1•mhb•14m ago•0 comments

macOS app to copy LaTeX renders/text/QR codes from screenshot automatically

https://github.com/Blobosle/screen-copy/
1•blobosle•16m ago•0 comments

Show HN: Zoom Auto-Joiner

https://github.com/PiotrMackowski/auto-joiner
1•ptrmc•18m ago•1 comments

Neo Store: The modern and feature-rich F-Droid client for everyone

https://github.com/NeoApplications/Neo-Store
1•pretext•20m ago•0 comments

Simply looking up inspires scientific exploration

https://bigthink.com/starts-with-a-bang/why-we-look-up/
1•Brajeshwar•22m ago•0 comments

XMind MCP Server – Incremental mind map editing for LLMs

https://github.com/sc0tfree/xmind-mcp
1•sc0tfree•24m ago•1 comments

Intel Core Ultra 200S Plus Content Creation Review

https://www.pugetsystems.com/labs/articles/intel-core-ultra-200s-plus-content-creation-review/
1•zdw•24m ago•0 comments

Philips to drop Google TV for European-based Titan OS

https://9to5google.com/2026/03/23/google-tv-just-lost-a-big-tv-brand-to-web-app-based-titan-os/
4•pentagrama•24m ago•3 comments

Unix philosophy is dead Long live something else?

https://sdomi.pl/weblog/27-manifesto-of-a-burnt-out-hacker/
1•caminanteblanco•25m ago•0 comments

A single Keycloak commit broke our p99 latency

https://old.reddit.com/r/KeyCloak/comments/1rto63s/how_a_single_keycloak_commit_broke_our_p99/
1•mooreds•28m ago•0 comments

10MB Go Alternative to OpenClaw (Full Clawhub Skills)

https://github.com/General-Specialist/capabot
2•gen_specialist•29m ago•1 comments
Open in hackernews

LoCoMo AI Benchmark: 6.4% of answer key wrong, judge accepts 63% of fake answers

https://github.com/dial481/locomo-audit
2•dial481•1h ago

Comments

dial481•1h ago
We audited the LoCoMo benchmark (one of the most cited eval for LLM agent memory) and found 99 score-corrupting errors in 1,540 questions (6.4%). Separately, we tested the LLM judge with adversarially generated wrong answers, it accepted 62.81% of vague-but-topical wrong answers. Some published system scores barely clear that bar. Full audit with methodology, all 99 errors documented, and reproducible scripts.
PaulHoule•1h ago
I've worked in IR and this has been true about TREC data sets from the beginning and it has also been true about visual data sets. The first step to build a world beating commercial system has been to clean up the garbage in open evals to raise the possible accuracy ceiling.