frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Beyond Semantic Similarity

https://arxiv.org/abs/2605.05242
16•44za12•1h ago

Comments

kgeist•2m ago
When I implemented retrieval in our production system a few months ago, one of the most important benchmarks was cross-language retrieval (query in one language, documents in another), which is a common situation in large enterprises (headquarters + branches). I suspect their idea will perform poorly if the source language and the target language are too different from one another, like English and Hindi (grep often will not return anything).

Another requirement was keeping latency as low as possible (I managed to get < 5 seconds with 85%+ accuracy). Their approach seems to have very unpredictable latencies, sometimes up to thousands of seconds (may be fine for background tasks), and it scales poorly with corpus size.

Interesting research anyway, but I'd still stick with embedding/reranker-based retrieval because you do not waste time wandering around blindly each time, trying to find the minimal context to start from, which could have been found immediately with an index. Another issue is that research papers often implement subpar baselines for the approaches they compare against. When I was implementing retrieval, the straightforward implementation gave me 40% accuracy, and various tricks/parameter tuning pushed it to 85%+ without changing the overall architecture (took about a month of non-stop experimentation).

Googlebook

https://googlebook.google/
420•tambourine_man•3h ago•670 comments

How to make your text look futuristic (2016)

https://typesetinthefuture.com/2016/02/18/futuristic/
70•_vaporwave_•1h ago•8 comments

CERT is releasing six CVEs for serious security vulnerabilities in dnsmasq

https://lists.thekelleys.org.uk/pipermail/dnsmasq-discuss/2026q2/018471.html
155•chizhik-pyzhik•3h ago•54 comments

Show HN: Needle: We Distilled Gemini Tool Calling into a 26M Model

https://github.com/cactus-compute/needle
156•HenryNdubuaku•3h ago•49 comments

Why senior developers fail to communicate their expertise

https://www.nair.sh/guides-and-opinions/communicating-your-expertise/why-senior-developers-fail-t...
249•nilirl•6h ago•129 comments

The Future of Obsidian Plugins

https://obsidian.md/blog/future-of-plugins/
232•xz18r•5h ago•91 comments

Rendering the Sky, Sunsets, and Planets

https://blog.maximeheckel.com/posts/on-rendering-the-sky-sunsets-and-planets/
356•ibobev•8h ago•34 comments

Quack: The DuckDB Client-Server Protocol

https://duckdb.org/2026/05/12/quack-remote-protocol
92•aduffy•3h ago•21 comments

Reimagining the mouse pointer for the AI era

https://deepmind.google/blog/ai-pointer/
78•devhouse•3h ago•64 comments

Learning Software Architecture

https://matklad.github.io/2026/05/12/software-architecture.html
485•surprisetalk•12h ago•97 comments

Dead.Letter (CVE-2026-45185) – How XBOW found an unauthenticated RCE on Exim

https://xbow.com/blog/dead-letter-cve-2026-45185-xbow-found-rce-exim
46•fedek_•3h ago•16 comments

Launch HN: Voker (YC S24) – Analytics for AI Agents

https://voker.ai
32•ttpost•5h ago•15 comments

Bambu Lab is abusing the open source social contract

https://www.jeffgeerling.com/blog/2026/bambu-lab-abusing-open-source-social-contract/
933•rubenbe•6h ago•325 comments

Is this why science advances one funeral at a time?

https://nautil.us/is-this-why-science-advances-one-funeral-at-a-time-1280650
7•Brajeshwar•4h ago•6 comments

Beyond Semantic Similarity

https://arxiv.org/abs/2605.05242
16•44za12•1h ago•1 comments

Show HN: Statewright – Visual state machines that make AI agents reliable

https://github.com/statewright/statewright
39•azurewraith•7h ago•11 comments

Show HN: Agentic interface for mainframes and COBOL

https://www.hypercubic.ai/hopper
40•sai18•4h ago•16 comments

When life gives you lemons, write better error messages

https://wix-ux.com/when-life-gives-you-lemons-write-better-error-messages-46c5223e1a2f
82•luispa•4d ago•24 comments

Screenshots of Old Desktop OSes

http://www.typewritten.org/Media/
612•adunk•16h ago•322 comments

A Preview of the Future

https://unsung.aresluna.org/a-preview-of-the-future/
12•zdw•1d ago•1 comments

Canada’s Bill C-22 Is a Repackaged Version of Last Year’s Surveillance Nightmare

https://www.eff.org/deeplinks/2026/05/canadas-bill-c-22-repackaged-version-last-years-surveillanc...
158•Brajeshwar•3h ago•50 comments

We accidentally recreated old Facebook

https://amrshawky.com/posts/we-accidentally-recreated-fb/
28•amr_shawky•2d ago•21 comments

Testing UPS Output Waveforms

https://www.lttlabs.com/articles/2026/05/12/ups-exploration
43•LabsLucas•4h ago•39 comments

Show HN: Gigacatalyst – Extend your SaaS with an embedded AI builder

30•namanyayg•5h ago•8 comments

Text Blaze (YC W21) Is Hiring for a No-AI Summer Internship

https://www.ycombinator.com/companies/text-blaze/jobs/P4CCN62-the-blaze-no-ai-summer-internship
1•scottfr•9h ago

Instructure pays ransom to Canvas hackers

https://www.insidehighered.com/news/tech-innovation/administrative-tech/2026/05/11/instructure-pa...
188•Cider9986•18h ago•183 comments

The Real Story of Troy

https://storica.club/blog/troy-was-real/
42•cemsakarya•2d ago•18 comments

They Live (1988) inspired Adblocker

https://github.com/davmlaw/they_live_adblocker
525•tokenburner•20h ago•176 comments

SQL: Incorrect by Construction

https://chreke.com/posts/sql-incorrect-by-construction
20•ingve•3h ago•17 comments

The Surprisingly Long Life of the Vacuum Tube

https://www.construction-physics.com/p/the-surprisingly-long-life-of-the
53•surprisetalk•1d ago•35 comments