frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Python library for translating between embedding model vector spaces

https://github.com/PotentiallyARobot/EmbeddingAdapters/
4•Taranovski•1h ago

Comments

Taranovski•1h ago
EmbeddingAdapters is a Python library for translating between embedding model vector spaces.

It provides plug-and-play adapters that map embeddings produced by one model into the vector space of another — locally or via provider APIs — enabling cross-model retrieval, routing, interoperability, and migration without re-embedding an existing corpus.

If a vector index is already built using one embedding model, embedding-adapters allows it to be queried using another, without rebuilding the index.

GitHub: https://github.com/PotentiallyARobot/EmbeddingAdapters/

PyPI: https://pypi.org/project/embedding-adapters/

Example Generate an OpenAI embedding locally from minilm+adapter:

pip install embedding-adapters

embedding-adapters embed \ --source sentence-transformers/all-MiniLM-L6-v2 \ --target openai/text-embedding-3-small \ --flavor large \ --text "where are restaurants with a hamburger near me" The command returns:

an embedding in the target (OpenAI) space

a confidence / quality score estimating adapter reliability

Model Input At inference time, the adapter’s only input is an embedding vector from a source model. No text, tokens, prompts, or provider embeddings are used.

A pure vector → vector mapping is sufficient to recover most of the retrieval behavior of larger proprietary embedding models for in-domain queries.

Benchmark results Dataset: SQuAD (8,000 Q/A pairs)

Latency (answer embeddings):

MiniLM embed: 1.08 s

Adapter transform: 0.97 s

OpenAI API embed: 40.29 s

≈ 70× faster for local MiniLM + adapter vs OpenAI API calls.

Retrieval quality (Recall@10):

MiniLM → MiniLM: 10.32%

Adapter → Adapter: 15.59%

Adapter → OpenAI: 16.93%

OpenAI → OpenAI: 18.26%

Bootstrap difference (OpenAI − Adapter → OpenAI): ~1.34%

For in-domain queries, the MiniLM → OpenAI adapter recovers ~93% of OpenAI retrieval performance and substantially outperforms MiniLM-only baselines.

How it works (high level) Each adapter is trained on a restricted domain, allowing it to specialize in interpreting the semantic signals of smaller models and projecting them into higher-dimensional provider spaces while preserving retrieval-relevant structure.

A quality score is provided to determine whether an input is well-covered by the adapter’s training distribution.

Practical uses in Python applications Query an existing vector index built with one embedding model using another

Operate mixed vector indexes and route queries to the most effective embedding space

Reduce cost and latency by embedding locally for in-domain queries

Evaluate embedding providers before committing to a full re-embed

Gradually migrate between embedding models

Handle provider outages or rate limits gracefully

Run RAG pipelines in air-gapped or restricted environments

Maintain a stable “canonical” embedding space while changing edge models

Supported adapters MiniLM ↔ OpenAI

OpenAI ↔ Gemini

E5 ↔ MiniLM

E5 ↔ OpenAI

E5 ↔ Gemini

MiniLM ↔ Gemini

The project is under active development, with ongoing work on additional adapter pairs, domain specialization, evaluation tooling, and training efficiency.

Please Like/Upvote if you found this interesting

The State of Rust Allocators in 2026

https://cetra3.github.io/blog/state-of-allocators-2026/
1•cetra3•32s ago•0 comments

Naplan resumes after earlier technical issues paused online testing

https://www.abc.net.au/news/2026-03-11/naplan-technical-issues-disrupt-school-testing/106440510
1•langfo•41s ago•0 comments

Ask HN: How are you managing "prompt fatigue" and lazy LLM outputs?

1•thlangu•1m ago•0 comments

Show HN: ClawSoc – Observe Your AI Agent in an AI Society

https://clawsoc.io
2•benjosaur•3m ago•0 comments

Promptbuilder Tool

https://promptengine.business
1•thlangu•3m ago•0 comments

Show HN: Magicly – an aesthetic new tab with a magic orb

https://chromewebstore.google.com/detail/magicly/lphplecibkpifdmohbockmaloepgpbbl
1•tamnv•5m ago•0 comments

.NET 11 Preview 2 is now available

https://devblogs.microsoft.com/dotnet/dotnet-11-preview-2/
1•vyrotek•5m ago•0 comments

MacBook Neo Review: No Other Budget Laptop Can Compete

https://gizmodo.com/macbook-neo-review-no-other-budget-laptop-can-compete-2000731635
1•thunderbong•5m ago•0 comments

Signal Design for Good Correlation

https://www.cambridge.org/core/books/signal-design-for-good-correlation/4A48F8F2070E0EB093283A792...
1•programmexxx•6m ago•0 comments

How the "godfathers of AI" have shaped our mission

https://ctsmyth.substack.com/p/standing-with-giants
1•K0balt•7m ago•1 comments

Show HN: CryptoFlora – Visualize SHA256 to a flower using Rose curves

https://crypto-flora.tonytonyjan.net/
1•tonytonyjan•9m ago•0 comments

Preview a website before DNS changes – etchosts.link

https://etchosts.link/
1•prismatized•10m ago•0 comments

Why Cape Town May End Up Safer Than Dubai

https://www.thefp.com/p/why-cape-town-may-end-up-safer-than
1•paulpauper•13m ago•0 comments

Advantageous Selection

https://marginalrevolution.com/marginalrevolution/2026/03/advantageous-selection.html
1•paulpauper•13m ago•0 comments

Ben McKenzie's 'Everyone Is Lying to You for Money' – Official Trailer [video]

https://www.youtube.com/watch?v=hEXxAOFqv4U
2•yesbut•13m ago•0 comments

The self-help guru who decided he might be doing more harm than good

https://www.telegraph.co.uk/news/2026/03/09/tim-ferriss-self-help-may-be-harming-people/
1•cyunker•20m ago•1 comments

Apple Planning 'MacBook Ultra' with Touchscreen and Higher Price

https://www.macrumors.com/2026/03/08/apple-planning-macbook-ultra/
1•walterbell•22m ago•0 comments

ASINCrate – Download Amazon Product Images and Videos in One Click

https://asincrate.com/
1•seedancetips•24m ago•0 comments

A Dish of Neurons Playing Doom

https://gizmodo.com/a-dish-of-neurons-playing-doom-is-the-wildest-thing-ive-seen-in-ages-2000727674
1•colinprince•24m ago•0 comments

Jensen Huang: AI is a five layer cake

https://blogs.nvidia.com/blog/ai-5-layer-cake/
1•salkahfi•25m ago•0 comments

Show HN: AngelBacked – a searchable database ofangels and VCs

https://www.angelbacked.co
1•veritas9•26m ago•0 comments

The Global Vulnerability Intelligence Platform with Olle E. Johansson

https://opensourcesecurity.io/2026/2026-02-GVIP-olle-johansson/
1•jruohonen•28m ago•0 comments

X-Plane Is Coming to Apple Vision Pro with Nvidia CloudXR

https://www.x-plane.com/2026/03/x-plane-is-coming-to-apple-vision-pro-with-nvidia-cloudxr/
2•CharlesW•32m ago•0 comments

Show HN: Built a browser-based AI data analyst using DuckDB WASM and OPFS

https://rowpilot.co
1•zurnz•35m ago•5 comments

"Just" a hard drive [video]

https://www.youtube.com/watch?v=ZXs_9OXRnQo
2•slowcooked12_•36m ago•0 comments

Tecto: An Opaque, Encrypted Token Protocol as an Alternative to JWT

https://github.com/Zastinian/tecto
1•Zastinian•39m ago•1 comments

Find a classical song matching your trip duration

https://busundreu.com/
1•pax•39m ago•1 comments

Human waste backing up in basements is a sign of US infrastructure problems

https://apnews.com/article/sewage-overflows-potomac-epa-water-trump-baltimore-be71eea20324a911142...
3•petethomas•41m ago•0 comments

Gemini CLI as an agent harness for Google Workspace CLI (gws)

https://github.com/kstonekuan/gemini-workspacer
2•kstonekuan•41m ago•0 comments

The story of one of my worst programming failures

https://utcc.utoronto.ca/~cks/space/blog/programming/DevelopedTooSmallFailure
2•zdw•46m ago•0 comments