frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Pyversity – Fast Result Diversification for Retrieval and RAG

https://github.com/Pringled/pyversity
3•Tananon•2h ago
Hey HN! I’ve just open-sourced Pyversity, a lightweight library for diversifying retrieval results. Most retrieval systems optimize only for relevance. This often leads to top-k results that look almost identical. Pyversity efficiently re-ranks results to balance relevance and diversity, surfacing items that remain relevant but are less redundant.

Main features:

- Unified API: one function (diversify) supporting several well-known strategies: MMR, MSD, DPP, and COVER (with more to come)

- Lightweight: the only dependency is NumPy, keeping the package small and easy to install

- Fast: efficient implementations for all supported strategies; diversify results in milliseconds

Re-ranking with cross-encoders is very popular right now, but also very expensive. From my experience, you can usually improve retrieval results with simpler and faster methods, such as the ones implemented in this package. This helps retrieval, recommendation, and RAG systems present richer, more informative results by ensuring each new item adds new information.

Code and docs: github.com/pringled/pyversity

Let me know if you have any feedback, or suggestions for other diversification strategies to support!

Welcome to the 'Papers, Please' Internet

https://www.theverge.com/column/798159/age-gating-internet
1•HotGarbage•22s ago•0 comments

20 bird species can understand each other's anti-cuckoo call

https://www.newscientist.com/article/2498809-20-bird-species-can-understand-each-others-anti-cuck...
1•bookofjoe•4m ago•1 comments

ShadowMail Secure Temporary Emails

https://shadowmail.win
1•andrexdev•4m ago•1 comments

Show HN: Syncweb: Towards an 'offline-first' distributed web

https://github.com/chapmanjacobd/syncweb-py
1•xk3•4m ago•0 comments

Enigma archive is now complete

https://enigmaticcode.wordpress.com/2025/10/08/enigma-archive-is-now-complete/
1•gnabgib•5m ago•0 comments

The Morality of Modeling

https://isaacbound.substack.com/p/the-morality-of-modeling
1•thegrimmest•5m ago•0 comments

A sudo shim that uses run0 internally

https://github.com/LordGrimmauld/run0-sudo-shim
1•RGBCube•8m ago•1 comments

Functional guarantees for semantic awareness on graphs

https://www.researchgate.net/publication/396442019_Semantic_Awareness_and_Depthness_A_Synthetic_G...
1•HenryAI•11m ago•1 comments

GitHub Copilot: Remote Code Execution via Prompt Injection (CVE-2025-53773)

https://embracethered.com/blog/posts/2025/github-copilot-remote-code-execution-via-prompt-injection/
6•kerng•13m ago•0 comments

Hiring in the Age of AI

https://morningcoffee.io/hiring-in-the-age-of-ai.html
2•shiroyasha•13m ago•0 comments

Show HN: Poddown, a simple, granular podcast episode downloader

https://github.com/bittere/poddown
1•_bittere•14m ago•0 comments

What's Behind the Mysterious Ancient Wall in the Gobi Desert?

https://news.artnet.com/art-world/the-hunt-gobi-wall-mongolia-2674588
1•derbOac•16m ago•0 comments

Roles and Intelligence for Individual Contributors

https://raees.me/blog/role-and-intelligence/
1•diptanu•16m ago•0 comments

In 1776, Thomas Paine made the best case for fighting kings −and being skeptical

https://theconversation.com/in-1776-thomas-paine-made-the-best-case-for-fighting-kings-and-for-be...
5•rntn•17m ago•0 comments

Perseverance, thy name is Bette

https://www.uspto.gov/learning-and-resources/journeys-innovation/historical-stories/perseverance-...
2•yzydserd•17m ago•0 comments

Decoding Without Pictures

https://hollisrobbinsanecdotal.substack.com/p/decoding-without-pictures
1•paulpauper•22m ago•0 comments

#1. Hello from Berlin. and 43 Years of German Failure

https://gersemann.substack.com/p/1-hello-from-berlin-and-43-years
3•paulpauper•23m ago•1 comments

Will more lending be freed up by relaxing bank capital rules?

https://www.ft.com/content/8c69189c-7594-4612-bf29-0847ed995d98
1•paulpauper•23m ago•0 comments

Machine Learning Attack Series: Image Scaling Attacks (2020)

https://embracethered.com/blog/posts/2020/husky-ai-image-rescaling-attacks/
3•kerng•24m ago•0 comments

Sam 3 name claimed by 2026 ICLR submission: Segment Anything with Concepts

https://openreview.net/forum?id=r35clVtGzw
2•sch-sara•24m ago•0 comments

Month of AI Bugs (August 2025)

https://monthofaibugs.com/
3•kerng•26m ago•0 comments

Gemini Enterprise

https://cloud.google.com/blog/products/ai-machine-learning/introducing-gemini-enterprise
1•smoser•27m ago•0 comments

I'm making a Chrome extension a day for a month

1•jasonlernerman•30m ago•0 comments

A data-rich look at New York's battle against rats

https://economist.com/united-states/2025/10/09/a-data-rich-look-at-new-yorks-battle-against-rats
1•hheikinh•32m ago•0 comments

Earning the Right to Be Illegible

https://www.joshbeckman.org/blog/practicing/earning-the-right-to-be-illegible
2•bckmn•33m ago•0 comments

Show HN: A Dead Simple Parser

3•pankaj9296•37m ago•2 comments

Faster Target Quality Image Compression

https://giannirosato.com/blog/post/oavif/
2•computerbuster•37m ago•0 comments

Wow Mind Blowing Technology

https://wealth-ai.in/
2•WoWSaaS•38m ago•1 comments

ChatGPT prompts and a lighter led investigators to a suspectedarsonist

https://www.cnn.com/2025/10/11/us/palisades-fire-suspect-jonathan-rinderknecht
2•andy99•38m ago•0 comments

Intel gives first look at new chips: Panther Lake, Clearwater Forest

https://www.cnbc.com/2025/10/09/intel-chips-panther-lake-clearwater-forest.html
1•gmays•39m ago•1 comments