news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Tooltip Components Should Not Exist

https://tkdodo.eu/blog/tooltip-components-should-not-exist

1•coloneltcb•39s ago•0 comments

It Feels to Be an Outsider

https://angadh.com/inkhaven-13

1•speckx•1m ago•0 comments

Ask HN: Are current frontier model apps capable of handling Epstein files?

1•ivape•2m ago•0 comments

Sōzu HTTP Reverse Proxy

https://github.com/sozu-proxy/sozu

1•birdculture•2m ago•0 comments

Gustav Klimt portrait breaks modern art record with $236M sale

https://apnews.com/article/golden-toilet-cattelan-auction-sothebys-ef4c0b1ccb2841c078ff59756fe6d7b2

1•geox•2m ago•0 comments

Show HN: MemBrowse – CI/CD memory footprint tracking for embedded firmware

https://membrowse.com

1•revolmich•4m ago•0 comments

Show HN: Chess960v2 – The New Fischer Random Chess (over 300 rounds played)

https://chess960v2.com/en

1•lavren1974•4m ago•0 comments

40% of young U.S. women want to leave the country: Gallup poll

https://www.deseret.com/politics/2025/11/18/gallup-poll-finds-us-women-want-leave-country-permane...

3•saubeidl•6m ago•0 comments

Creationists try to spin Homo Naledi evidence

https://twitter.com/ChrisStringer65/status/1787861489606357254

1•DrierCycle•6m ago•0 comments

Installing Linux on a drive formatted as NTFS [video]

https://www.youtube.com/watch?v=ByRs07PLBNs

2•exploraz•6m ago•0 comments

Arrival Radar

https://entropicthoughts.com/arrival-radar

1•crescit_eundo•7m ago•0 comments

Why Wait for a Steam Machine When You Can Just Use a Steam Machine

https://pcper.com/2025/11/why-wait-for-a-steam-machine-when-you-can-just-use-a-steam-machine/

2•speckx•7m ago•0 comments

What happens when even college students can't do math anymore?

https://www.theatlantic.com/ideas/2025/11/math-decline-ucsd/684973/

3•fortran77•9m ago•1 comments

Narcissistic Traits in Sexually Motivated Serial Killers (open access)

https://link.springer.com/article/10.1007/s11896-025-09780-4

1•DrierCycle•11m ago•0 comments

OpenAI prepares GPT-5.1-Codex-MAX for large-scale projects

https://www.testingcatalog.com/openai-prepares-gpt-5-1-codex-max-for-large-scale-projects/

1•pomarie•12m ago•0 comments

What Makes the Intro to Crafting Interpreters So Good?

https://refactoringenglish.com/blog/crafting-interpreters-intro/

1•mtlynch•12m ago•0 comments

Show HN: YaraDB Python Client – A clean interface for my custom WAL-based DB

https://github.com/illusiOxd/yaradb-client-py

1•ashfromsky•13m ago•0 comments

Dedicated Agents for devs who have had enough of context Archaeology

https://www.weppo.co

1•safoan_eth•13m ago•1 comments

OnlineOrNot's lessons from Cloudflare's outage on 2025-11-18

https://onlineornot.com/onlineornot-lessons-from-cloudflare-outage-2025-11-18

1•rozenmd•15m ago•0 comments

How to work with Product: At the tea table

https://blog.nilenso.com/blog/2025/11/18/how-to-work-with-product-at-the-tea-table/

1•sriharis•15m ago•0 comments

The paradox is that when I accept myself just as I am, I change

https://www.henrikkarlsson.xyz/p/diaries

1•jger15•17m ago•0 comments

Show HN: I made a landing page component picker

https://landinggo.com/tools/landing-page-component-picker-quiz

1•bkrisa•18m ago•0 comments

Europe's cookie nightmare is crumbling. EC wants preference at browser level

https://www.theverge.com/news/823788/europe-cookie-prompt-browser-changes-proposal

4•nopakos•19m ago•0 comments

Show HN: tweakcc (OSS)–customize Claude Code's system prompt and LSP and /title

https://github.com/Piebald-AI/tweakcc/releases/tag/v3.1.1

2•bl-ue•20m ago•1 comments

Show HN: Hypercamera – a browser-based 4D camera simulator

https://dugas.ch/4d_creatures/4d_camera.html

1•chronolitus•20m ago•0 comments

Maurizio Cattelan Golden Toilet Sells for $12.1m At Sotheby's

https://www.artnews.com/art-news/news/maurizio-cattelan-america-sothebys-sale-toilet-12-million-1...

1•bookofjoe•21m ago•0 comments

Show HN: GetViralSEO – Find competitors' top SEO pages instantly

https://getviralseo.com/

1•natia_kurdadze•21m ago•0 comments

AI-Assisted Scientific Research?

https://www.aims.healthcare

1•martinclayton•22m ago•1 comments

Show HN: Folderhost – selfhosted file sharing app in Go

https://github.com/MertJSX/folderhost

1•mertjsx•22m ago•0 comments

How to overcome your Indie hacking (Solopreneur) Writer's block

https://hugoib.beehiiv.com/p/how-to-overcome-indie-hacking-solopreneur-writer-block

1•hugoib•23m ago•0 comments

Open in hackernews

RAG Is Set Consumption, Not Ranking: A Metric Designed for RAG Evaluation

https://vectors.run/posts/a-rarity-aware-set-based-metric/

2•etoud•1h ago

Comments

etoud•1h ago

This post argues that production RAG should be evaluated as set consumption, not as a user scrolling a ranked list. Classic IR metrics (nDCG / MAP / MRR) assume a human eyeball stepping through positions with monotone position discount, which doesn’t match how an LLM ingests a fixed top-K context.

I propose a small family of set-based metrics:

• RA-nWG@K – “How good is the actual top-K set we fed the LLM vs the global oracle on the labeled corpus?”

• PROC@K – pool-restricted oracle ceiling: “How good could we have done with this retrieval pool if selection were perfect?”

• %PROC@K – reranker/selection efficiency: “Given that ceiling, how much did our actual top-K realize?”

The goal is to cleanly separate retrieval quality from reranking headroom instead of squinting at one nDCG number.

I’m actively refining this; if you see flaws, better decompositions, or edge cases where this breaks, I’d really like to hear them.