frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Ask HN: MCP/API search vs. vector search – what's winning for you?

1•ngkw•1h ago
TL;DR: I have a hunch that demand for classic RAG (embeddings + vector DB) will shrink. Reasons:

1. Embedding ops cost (re-indexing, freshness) is high.

2. LLMs are getting good at iterative query expansion over plain search APIs (BM25-style).

3. Embedding quality is still uneven across domains/languages. Curious what you are actually seeing in production.

Context: We’re a \~10-person team inside a large company. People use different UIs (ChatGPT, Claude, Dify, etc.). Cost/security aren’t our main issues; we just want higher throughput. We can wire MCP-style connectors (Notion/Slack/Drive) or run our own vector index—trying to pick battles that really move the needle.

Hypotheses I’m testing:

* For fast-changing corp knowledge, BM25 + LLM query expansion + light re-ranking beats maintaining a vector store (lower ops, decent recall).

* MCP/API search gives “good enough” docs if you union a few expanded queries and re-rank.

* Vectors still win for long-tail semantic matches and noisy phrasing—but only when content is relatively stable or you can afford frequent re-embeds.

What I want from HN (war stories, not vendor pitches):

1. Have you sunset or avoided vector DBs because ops/freshness pain outweighed gains? What were the data size, update rate, and latency targets?

2. If you kept vectors, what made them clearly superior (metrics, error classes, language/domain)? Any concrete thresholds (docs/day churn, avg doc length, query mix) where vectors start paying off?

3. Anyone running pure API search + LLM query expansion (multi-query, aggregation, re-rank) at scale? How many queries per task? Latency/cost vs. vector search?

4. Hybrid setups that worked: e.g., API search to narrow → vector re-rank; or vector recall → LLM judge → final set. What cut false positives/negatives the most?

5. Multilingual/Japanese/domain jargon: where do embeddings still fail you? Did re-ranking (LLM or classic) fix it?

6. Freshness strategies without vectors: caching, recency boosts, metadata filters? What actually reduced “stale answer” complaints?

7. For MCP-style connectors (Notion/Slack/Drive): do you rely on vendor search, or do you replicate content and index yourself? Why?

8. If you’d start from scratch today for a 10-person team, what baseline would you ship first?

Why I’m asking: Our goal is throughput (less time hunting, more time shipping). I’m leaning to:

* Phase 1: MCP/API search + LLM query expansion (3–5 queries), union top-N, local re-rank; no vectors. * Phase 2 (only if needed): add a vector index for the failure cases we can’t fix with expansion/re-rank.

Happy to share a summary of takeaways after the thread. Thanks!

Show HN: Truth Wave – Community Driven Truth or Myth Game

https://truth-wave.lovable.app/
1•liltofu•51s ago•0 comments

Show HN: Web Components SSR and hydration in 1KB– just a decorator, no framework

https://github.com/alexandregiordanelli/web-server-components
1•agiordanelli•1m ago•0 comments

Eye movement patterns reveal subtle signs of cognitive and memory decline

https://medicalxpress.com/news/2025-08-eye-movement-patterns-reveal-subtle.html
1•pseudolus•4m ago•0 comments

Show HN: Hedge UI – React starter kit for trading applications

https://www.hedgeui.com
1•oliverbenns•7m ago•0 comments

We’re Not So Special: A new book challenges human exceptionalism

https://democracyjournal.org/magazine/78/were-not-so-special/
2•nobet•8m ago•0 comments

Help Seed the Smithsonian Archive

https://neuromatch.social/@jonny/115057753332773769
1•keyboardJones•12m ago•1 comments

Plus, Minus: A Gentle Introduction to the Physics of Orthogonal

https://www.gregegan.net/ORTHOGONAL/00/PM.html
1•swyx•13m ago•0 comments

Online M3U8 video downloader – no more manual fragment merging

https://m3u8downloader.org/
1•AdamRichic•13m ago•1 comments

LLMs Are Letter-Blind and Here's Why Enterprises Should Care

https://viveksgag.substack.com/p/llms-are-letter-blind
1•vivganes•13m ago•0 comments

Microsoft workers occupy HQ in protest against company ties to Israeli military

https://www.theguardian.com/technology/2025/aug/19/microsoft-workers-protest-washington-israel
1•t0lo•15m ago•1 comments

Is radicalization reinforced by social media censorship? (2021)

https://arxiv.org/abs/2103.12842
1•Jimmc414•16m ago•0 comments

Show HN: Textideo – Generate short AI videos from text prompts

https://textideo.com/
2•Lily12138•18m ago•0 comments

Canva Valuation Increases to 42B

https://www.afr.com/technology/overnight-millionaires-as-65b-canva-staff-share-sale-starts-20250820-p5moaf
1•hamish-b•18m ago•1 comments

Show HN: PineBill – make invoices in the browser (free, no ads, no account)

https://pinebill.com/
2•glockxx•18m ago•0 comments

The Long Season of Langdev

https://blog.fogus.me/langdev/long-season.html
1•todsacerdoti•19m ago•0 comments

Doom on the Anker Prime Charging Station

https://mastodon.social/@Atc1441/115051397563188886
1•luu•23m ago•0 comments

Author Rie Qudan: Why I used ChatGPT to write my prize-winning novel

https://www.theguardian.com/books/2025/aug/18/author-rie-qudan-why-i-used-chatgpt-to-write-my-prize-winning-novel
1•pseudolus•26m ago•0 comments

Show HN: Qwen Image Edit– Intelligent Image Editing with Qwen-Image-Edit Vision

https://qwenimageedit.cc
1•dahuangf•28m ago•0 comments

Multimodal Sensing-Enabled LLMs for Automated Emotional Regulation

https://www.mdpi.com/1424-8220/25/15/4763
1•PaulHoule•31m ago•0 comments

AlexNet

https://en.wikipedia.org/wiki/AlexNet
1•RyanShook•32m ago•0 comments

The Moving of Kiruna Church Livestream

https://lkab.com/en/events/the-moving-of-kiruna-church/
1•mooreds•32m ago•0 comments

Landrecords – cheap nationwide parcel dataset standardized using gemma3

https://landrecords.us
1•mapsperson•32m ago•1 comments

The Fallacies of Management – The Network Is Reliable

https://xangelo.medium.com/the-fallacies-of-management-the-network-is-reliable-89cd2c958f6c
2•mooreds•33m ago•0 comments

The Value of Hitting the HN Front Page

https://www.mooreds.com/wordpress/archives/3530
3•mooreds•33m ago•1 comments

Image Prompt

https://imageprompt.site
2•MintNow•34m ago•0 comments

The Biden Administration's Gamble to Freeze China's AI Future

https://www.wired.com/story/chips-china-artificial-intelligence-controls/
1•mhga•35m ago•0 comments

Self-Driving Postgres

https://postgres.fm/episodes/self-driving-postgres
1•funerr•36m ago•0 comments

Skill issues – Dialectical Behavior Therapy and its discontents (2024)

https://www.thedriftmag.com/skill-issues/
2•zt•36m ago•0 comments

'American'

https://kieranhealy.org/blog/archives/2025/06/28/american/
2•Bogdanp•38m ago•0 comments

Trump is demanding a Panama-China break-up

https://www.politico.com/news/magazine/2025/08/19/donald-trump-panama-canal-mulino-00513603
4•mhga•39m ago•0 comments