frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Lessons from building search for vague, human queries

1•jeffmanu•1h ago
I’ve been building a search system for long form content where the goal isn’t “find the right document,” but more precision.

On paper, it looked straightforward: embeddings, a vector DB, some metadata filters. In reality, the hardest problems weren’t model quality or infrastructure, but how the system behaves when users are vague, data is messy, and most constraints are inferred rather than explicitly stated.

Early versions tried to deeply “understand” the query up front, infer topics and constraints, then apply a tight SQL filter before doing any semantic retrieval. It performed well in demos and failed with real users. One incorrect assumption about topic, intent, or domain didn’t make results worse—it made them disappear. Users do not debug search pipelines; they just leave.

The main unlock was separating retrieval from interpretation. Instead of deciding what exists before searching, the system always retrieves a broad candidate set and uses the interpretation layer to rank, cluster, and explain.

At a high level, the current behavior is:

Candidate retrieval always runs, even when confidence in the interpretation is low.

Inferred constraints (tags, speakers, domains) influence ranking and UI hints, not whether results are allowed to exist.

Hard filters are applied only when users explicitly ask for them (or through clear UI actions).

Ambiguous queries produce multiple ranked options or a clarification step, not an empty state.

The system is now less “certain” about its own understanding but dramatically more reliable, which paradoxically makes it feel more intelligent to people using it.

I’m sharing this because most semantic search discussions focus on models and benchmarks, but the sharpest failure modes I ran into were architectural and product level.

If you’ve shipped retrieval systems that had to survive real users especially hybrid SQL + vector stacks I’d love to hear what broke first for you and how you addressed it.

Apple can't secure enough chips as iPhone demand surges, memory prices rise

https://www.cnbc.com/2026/01/29/apple-iphone-soc-memory-tsmc.html
2•1659447091•5m ago•0 comments

Apple Reports Record-Setting 1Q 2026 Results: $42.1B Profit on $143.8B Revenue

https://www.macrumors.com/2026/01/29/apple-1q-2026-earnings/
1•tosh•5m ago•0 comments

How linguistic framing in pitch decks influence investors' judgment – St. Gallen

https://www.pitchwise.se/blog/the-science-of-cold-outreach-a-research-on-why-your-pitch-deck-slid...
1•dabojula•10m ago•0 comments

AI creates asymmetric pressure on Open Source

https://dri.es/ai-creates-asymmetric-pressure-on-open-source
2•7777777phil•10m ago•0 comments

Show HN: Configlock, App Lock for Dotfiles

https://github.com/baggiiiie/configlock
1•baggiiiie•11m ago•0 comments

Cutting down 90% of database spending at Capacities by migrating to Postgres

https://capacities.io/blog/migration-to-postgres
1•steffenbleher•14m ago•1 comments

Show HN: Codeusse – mobile SSH with GUI file browser and LLM config gen

1•wrbl•14m ago•0 comments

Billionaires trying to prolong their life end up wasting it

https://www.thetimes.com/business/companies-markets/article/biohacking-longevity-anti-ageing-0rzf...
1•petethomas•15m ago•1 comments

Microsoft lost $357B in market cap as stock plunged most since 2020

https://www.cnbc.com/2026/01/29/microsoft-market-cap-earnings.html
4•1vuio0pswjnm7•16m ago•0 comments

Show HN: GetSheetAPI – Turn Any Google Sheet into a REST API in 60 Seconds

https://getsheetapi.com
1•sara_builds•18m ago•0 comments

Show HN: Subverted Academy – Rebelling against how cybersecurity is taught

https://academy.subverted.io
4•x_ulla•19m ago•2 comments

Ordercli – CLI for food delivery order history and tracking

https://github.com/steipete/ordercli
1•anupamchugh•21m ago•0 comments

Ask HN: Favourite Moltbot Extensions

1•janpmz•22m ago•0 comments

Benchmarking with Vulkan: the curse of variable GPU clock rates

https://mropert.github.io/2026/01/29/benchmarking_vulkan/
1•ingve•24m ago•0 comments

Google to pay $135M to settle Android data transfer lawsuit

https://www.reuters.com/sustainability/boards-policy-regulation/google-pay-135-million-settle-and...
3•1vuio0pswjnm7•24m ago•0 comments

Show HN: A simple, privacy-focused time ledger (no login required)

https://timekeeping.click/
1•dirkchou•26m ago•1 comments

Ashcan Comic

https://en.wikipedia.org/wiki/Ashcan_comic
1•benbreen•28m ago•0 comments

unfuck-microwave.sh

https://cofe.rocks/notice/B2gUBiWWuW3IyBpk6y
1•robin_reala•28m ago•0 comments

Treasures found on HS2 route stored in secret warehouse

https://www.bbc.co.uk/news/articles/c93v21q5xdvo
3•mellosouls•29m ago•0 comments

The Crypto CEO Who's Become Enemy No. 1 on Wall Street

https://www.wsj.com/finance/currencies/coinbase-ceo-brian-armstrong-wall-street-a7895786
1•1vuio0pswjnm7•32m ago•0 comments

Portugal builds Europe's first dedicated drone carrier, D João II

https://www.euronews.com/2026/01/29/portugal-builds-europes-first-dedicated-drone-carrier-d-joao-ii
1•saubeidl•34m ago•0 comments

AI Interview Coach

https://heroikk.info/voice-chat/
1•tikyda•37m ago•1 comments

Abusers using AI and digital tech to attack and control women, charity warns

https://www.theguardian.com/society/2026/jan/30/abusers-using-ai-and-digital-tech-to-attack-and-c...
1•zeristor•37m ago•0 comments

Show HN: I built a marketing operating system with long-term memory

https://theaicmo.com/
1•rogaai•39m ago•0 comments

UK's first rapid-charging battery train ready for boarding this weekend

https://www.theguardian.com/business/2026/jan/30/uk-first-rapid-charging-battery-train
1•zeristor•40m ago•0 comments

Trump threatens to tariff, decertify Canadian aircraft in latest trade war move

https://www.cbc.ca/news/politics/trump-tariffs-decertify-canadian-planes-9.7067498
1•N19PEDL2•41m ago•1 comments

Anthropic: AI Coding shows no productivity gains; impairs skill development

https://arxiv.org/abs/2601.20245
6•northfield27•48m ago•0 comments

The fat you can't see could be shrinking your brain

https://www.sciencedaily.com/releases/2026/01/260127112127.htm
2•1659447091•50m ago•0 comments

Landing Page Academy by Erik Kennedy

https://www.learnui.design/courses/landing-page-academy.html
1•nyratarg•56m ago•0 comments

Diversification Is Overrated

https://nox.sh/posts/diversification-is-overrated/
1•mattredact•56m ago•0 comments