frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Vector Inspector – A forensic tool for vector databases

https://vector-inspector.divinedevops.com
1•spitefowl•1h ago
I’ve been working with vector databases a lot recently and I kept running into the same problem: it’s really hard to see what’s going on.

Every provider has its own UI (or none), debugging embeddings is guesswork, and migrating data between systems is painful. I wanted a single tool where I could browse, search, visualize, and compare vector data across providers.

So I built Vector Inspector.

It currently supports:

- Chroma

- Qdrant (local + server)

- Postgres/pgvector

- Pinecone (partial support)

You can browse collections, inspect metadata, run searches, compare distances, visualize embeddings, and debug cases where a vector “should” match but doesn’t. The goal is to make it feel like a forensic tool for vector data — something that helps you understand what your embeddings are actually doing.

There’s an OSS tier (Vector Inspector) and a more advanced version (Vector Studio) with upcoming features like clustering overlays, model-to-model comparison, and provenance coloring.

One of the biggest problems I kept hitting was missing provenance. You load a collection and you have no idea:

- what model produced these vectors

- whether they were normalized

- whether some vectors came from a different model entirely

- whether the source text was cleaned or chunked differently

Without that context, debugging is almost impossible. Vector Inspector tries to make provenance a first-class concept: if the metadata exists, it shows it; if it’s missing, it makes that visible too, so you can actually debug your embeddings instead of guessing.

I’d love feedback from the HN crowd — especially around:

- workflows you’d want for multi-provider setups

- what’s missing for real debugging

- how you’d expect migrations to work

- any pain points you’ve hit with embeddings or vector DBs

- how you would like to work with creation workflows

Repo: https://github.com/anthonypdawson/vector-inspector

Landing page: https://vector-inspector.divinedevops.com

Comments

spitefowl•1h ago
Happy to answer questions or go deeper on anything. A few notes that might help set expectations:

- Provider support is solid for Chroma, Qdrant, and Postgres/pgvector. Pinecone works for most read workflows but isn’t full parity yet.

- The tool is designed to be “forensic first”: surfacing metadata, provenance, and mismatches rather than hiding them behind abstractions.

- Visualization is intentionally minimal right now; clustering overlays and model-to-model comparison are in progress.

- I’m especially interested in how people think about creation workflows (re-embedding, mixed-model collections, reproducibility, etc.) since teams handle this very differently.

Just to set expectations: it’s basically been me running it so far. PyPI has been getting a lot of traffic, but real-world usage is still very small. I’m really curious how it behaves with other people’s data and workflows — that feedback is incredibly helpful at this stage.

If you hit anything confusing, missing, or surprising, I’d love to hear it. Real-world debugging stories are gold for shaping the next set of features.

Building multiple small AI tools instead of one big SaaS

1•mohiuddinansari•1m ago•0 comments

Quantitative developmental biology in vitro using micropatterning(2021)

https://pmc.ncbi.nlm.nih.gov/articles/PMC8353268/
1•rolph•2m ago•0 comments

Show HN: TypeSync – Generate TypeScript type guards from your database schema

https://typesync-db-to-ts-type-guard.vercel.app
1•notmayorpete•3m ago•0 comments

Coding Agent VMs on NixOS with Microvm.nix

https://michael.stapelberg.ch/posts/2026-02-01-coding-agent-microvm-nix/
1•denysvitali•4m ago•0 comments

Systems of pattern formation within developmental biology(2021)

https://www.sciencedirect.com/science/article/abs/pii/S0079610721001140
1•rolph•4m ago•0 comments

Agency – Open-source multi-agent platform for autonomous software development

https://github.com/jarredkenny/agency-ai
1•jarredkenny•4m ago•1 comments

The AI Grand Prix

https://www.dcl-project.com/
2•quadzilla•8m ago•1 comments

SkyNet Project

https://zenodo.org/records/18452356
1•KaoruAK•10m ago•0 comments

Data Structure Alignment

https://en.wikipedia.org/wiki/Data_structure_alignment
1•Brysonbw•10m ago•0 comments

Ask HN: What serious task have you accomplished with Moltbot / OpenClaw?

1•lukol•11m ago•0 comments

I put AoE II sounds in my Claude Code Worktree/Sandbox Manager and it's glorious

https://www.agent-of-empires.com/docs/sounds.html
2•river_otter•14m ago•3 comments

Scaling markets with non-human operators

https://selectfromwhereand.com/musings/scaling_operators/
2•iamsam123•21m ago•1 comments

Show HN: Wikipedia as a doomscrollable social media feed

https://xikipedia.org
4•rebane2001•24m ago•0 comments

Artemis II: A Step Towards Permanent Human Activity Beyond Low Earth Orbit

https://www.realcleardefense.com/articles/2026/01/31/artemis_ii_a_step_towards_permanent_human_ac...
1•Gaishan•26m ago•0 comments

Oracle to Raise Up to $50B This Year for Cloud Investment

https://www.bloomberg.com/news/articles/2026-02-01/oracle-to-raise-up-to-50-billion-this-year-for...
2•zerosizedweasle•27m ago•2 comments

The Physics of Glitches: Analyzing 'The Backrooms' as a Systems Failure

https://misssandwich.substack.com/p/the-yellow-perversion-of-the-real-eed
1•misssandwich•28m ago•1 comments

We built an AI sysadmin that works (and won't delete /usr)

https://github.com/goshops-com/opsagent
2•sjcotto•34m ago•1 comments

Time Machine-style Backups with rsync (2018)

https://samuelhewitt.com/blog/2018-06-05-time-machine-style-backups-with-rsync
2•accrual•36m ago•0 comments

VoidLink: The Cloud-Native Malware Framework Weaponizing Linux Infrastructure

https://blog.checkpoint.com/research/voidlink-the-cloud-native-malware-framework-weaponizing-linu...
1•PaulHoule•38m ago•0 comments

Testing your fit for policy careers (2024)

https://emergingtechpolicy.org/essentials/policy-fit-testing/
2•jstrieb•39m ago•0 comments

It's All About the Pixel Economy

https://cvalenzuelab.com/pixel-economy
1•nsm•39m ago•0 comments

Before ChatGPT-HW debate there were other "If students use X to do HW" debates

https://blog.computationalcomplexity.org/2026/02/before-chatgpt-hw-debate-there-were.html
1•zdw•39m ago•0 comments

Selfhosted Bible PWA

https://mobilebible.net/
2•PaxSubChristo•40m ago•3 comments

Otava: Change Detection for Continuous Performance Engineering

https://github.com/apache/otava
1•tanelpoder•41m ago•0 comments

History and Timeline of the Proco Rat Pedal (2021)

https://web.archive.org/web/20211030011207/https://thejhsshow.com/articles/history-and-timeline-o...
2•brudgers•42m ago•1 comments

Show HN: I made a voice cloning Discord bot

https://copykitten.gg/
1•TheSaltySeaCow•45m ago•0 comments

Two kinds of AI users are emerging. The gap between them is astonishing

https://martinalderson.com/posts/two-kinds-of-ai-users-are-emerging/
2•martinald•51m ago•0 comments

How One Line of Python Triggers 12,000 Lines of Code [video]

https://www.youtube.com/watch?v=5B6W2OGfxq0
2•thunderbong•1h ago•0 comments

Show HN: Cut Your Pinecone Bill by 50% (Open Source Cost Auditor)

https://github.com/billycph/VectorDBCostSavingInspector
1•billycph•1h ago•0 comments

Aliasing and the Heisenberg Uncertainty Principle

http://blog.sigfpe.com/2013/01/aliasing-and-heisenberg-uncertainty.html
2•wtrm•1h ago•0 comments