frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Semantic search over the National Gallery of Art

https://nga.demo.mixedbread.com/
35•breadislove•2h ago

Comments

philipkglass•1h ago
How does this work? I thought it was probably powered by embeddings and maybe some more traditional search code, but I checked out the linked github repo and I didn't see any model/inference code. The public code is a wrapper that communicates with your commercial API?

Some searches work like magic and others seem to veer off target a lot. For example, "sculpture" and "watercolor" worked just about how I'd expect. "Lamb" showed lambs and sheep. But "otter" showed a random selection of animals.

breadislove•1h ago
It is powered by Mixedbread Search which is powered by our model Omni. Omni is multimodal (text, video, audio, images) and multi vector, which helps us to capture more information.

The search is in beta and we improving the model. Thank you for reporting the queries which are not working well.

Edit: Re the otter, I just checked and I did not found otters in the dataset. We should not return any results if the model is not sure to reduce confusion.

justincormack•1h ago
neither "blue pictures" nor "multiples" worked well.
breadislove•1h ago
thank you for reporting these. we will improve on them for the next iteration.
philipkglass•1h ago
There's at least a little bit of otter in the data. The one relevant result I saw was "Plate 40: Two Otters and a Beaver" by Joris Hoefnagel.

I also expected semantic search to return similar results for "fireworks" and "pyrotechnics," since the latter is a less common synonym for the former. But I got many results for fireworks and just one result for pyrotechnics.

This is still impressive. My impulse is to poke at it with harder cases to try to reason about how it could be implemented. Thanks for your Show HN and for replying to me!

breadislove•1h ago
If you find more such cases please feel free to send them over to aamir at domain name of the Show HN. I would love to see those cases and see how we can improve on them. Thank you so much for the feedback.
yawnxyz•1h ago
hey, your service is back up again!!! Mixedbread was my favorite tool for so long since your pivot, and I'm so glad y'all are back
breadislove•1h ago
We have a lot more things coming up soon. It just took us some time building Mixedbread Search.
nmitchko•1h ago
In case anyone wants to do this themselves, check out the pipeline here: https://github.com/isc-nmitchko/iris-document-search

Colnomic and nvidia models are great for embedding images and MUVERA can transform those to 1D vectors.

dfc•1h ago
It would be nice if took you to the NGA page about the item. I cant even copy the text easily for easy search.

"Images of german shepherds" never fails to provide some humor.

breadislove•1h ago
Thank you for pointing this out. We will add this tomorrow morning.
dfc•1h ago
The results for "Mark Rothko", "Paintings by Mark Rothko", "Paintings similar to mark rothko" etc does not bring up anything that I was expecting. NGA has a large collection of Rothko paintings but none of them come up.

This NGA link returns over a thousand pieces by Rothko: https://www.nga.gov/artists/1839-mark-rothko/artworks

breadislove•1h ago
We are right now not including the artist name. Which will be done in the next iteration of the model (next week). Right now the search is only based on what the model can "see". And it seems like that the model does not understand the art of Mark Rothko.

The next version can see the image and read the metadata.

A bit more context: We are include everything in the latent space (embeddings) without trying to maintain multiple indexes and hack around things. There is still a huge mountain to climb. But this one seems really promising.

Computer0•1h ago
This is neat, not sure how to report queries that are working poorly as you have mentioned. But when I search "Waltz" I am presented with Kitchen Utensils and only one piece of dancing folks. Presumably this is due to the Artist's name being 'Walton'.
breadislove•1h ago
We will add a feedback form tomorrow morning. For now please feel free to write to aamir at domain name of the page. thank you so much! this helps us a lot.

I Built the Perfect Workflow and attracted some friends in the process

https://www.graemefawcett.ca/blog/strange-attractor
2•graemefawcett•6m ago•1 comments

Most Investments Are Bad

https://www.lynalden.com/most-investments-are-bad/
3•jameslk•6m ago•1 comments

AMD Solarflare X4 NICs Launched for Low Latency Trading

https://www.servethehome.com/amd-solarflare-x4-nics-launched-for-low-latency-trading/
1•kcb•8m ago•0 comments

Download: The True Story of the Internet – E02: Search [video]

https://www.youtube.com/watch?v=QJyb8hhe6gA
1•alan-stark•10m ago•2 comments

Show HN: I created an AI Assistant for language teachers

https://classgenie.ca
1•gabceboli•17m ago•1 comments

Newsom signs historic housing bill to bring density to transit hubs

https://www.latimes.com/california/story/2025-10-10/newsom-signs-historic-housing-bill-bringing-d...
3•toomanyrichies•18m ago•0 comments

DC Comics declares it will never use generative AI

https://www.avclub.com/dc-comics-wont-use-generative-ai
1•geox•20m ago•0 comments

Why MNAV Rainbow Bands Match Historical Cycles Better

https://posix4e.github.io/btc-mnav-rainbow/
1•alexnewman•21m ago•1 comments

The invention of microbiology

https://www.historytoday.com/archive/months-past/invention-microbiology
2•hhs•21m ago•0 comments

What Jihad al-Shamie's three wives tell us about terror

https://www.thetimes.com/comment/columnists/article/what-jihad-al-shamies-three-wives-tell-us-abo...
2•binning•22m ago•0 comments

Pegatron Revenue Declines 12.3%

https://www.taipeitimes.com/News/biz/archives/2025/10/11/2003845273
1•mgh2•23m ago•0 comments

Bidirectional type checking step by step (in Ruby)

https://luizpvas.github.io/11_bidirectional_type_checking.html
1•luizpv9•24m ago•0 comments

Using Run-Kit to Run Python, R, and JavaScript Inside Rust – 26 Languages

https://www.esubalew.et/blog/2025/10/11/using-run-kit-multi-language-rust
1•esubaalew•25m ago•1 comments

What billionaire Peter Thiel said in his private 'Antichrist lectures'

https://www.washingtonpost.com/technology/2025/10/10/peter-thiel-antichrist-lectures-leaked/
3•reaperducer•26m ago•0 comments

Xtra: The company that lets DJI sneak its popular cameras into the US

https://www.theverge.com/report/795016/xtra-muse-dji-osmo-pocket-3-us-customs-tariffs
1•cocoflunchy•27m ago•0 comments

More screen time linked to lower test scores for elementary students

https://www.cbc.ca/news/canada/screentime-test-scores-9.6935108
1•empressplay•30m ago•0 comments

Apple nears deal to acquire talent, tech from AI startup Prompt AI

https://www.cnbc.com/2025/10/10/apple-nears-deal-to-acquire-talent-tech-from-ai-startup-prompt-ai...
1•coloneltcb•32m ago•0 comments

The Porcelain to Come

https://stackdiver.com/posts/the-porcelain-to-come/
1•stackdiver•32m ago•0 comments

Socket Integrates with Bun 1.3's Security Scanner API

https://socket.dev/blog/socket-integrates-with-bun-1-3-security-scanner-api
1•feross•34m ago•0 comments

A million-solar-mass object detected at cosmological distance

https://arxiv.org/abs/2510.07382
1•bikenaga•34m ago•0 comments

DEF CON 33 videos now available

https://defcon.social/@defcon/115352196547755197
5•8organicbits•35m ago•0 comments

Make a procedurally generated planet

https://kayleegeorge.github.io/blog/generative-worlds/
1•kayleegeorge•37m ago•0 comments

Discover Claude Code Plugins and Marketplaces

https://claudecodemarketplace.com
1•joesaunderson•40m ago•1 comments

Olive Morris: The black feminist icon and rabble rouser whose life was cut short

https://metro.co.uk/2025/10/04/olive-morris-black-feminist-icon-rabble-rouser-whose-life-tragical...
1•binning•40m ago•0 comments

Nobel peace prize officials investigate surge in bets for winner

https://www.theguardian.com/world/2025/oct/10/nobel-peace-prize-bets-polymarket
3•c420•44m ago•0 comments

President Trump creates export controls on "any and all critical software" Nov 1

https://www.cnbc.com/2025/10/10/trump-trade-tariffs-china-software.html
8•dabockster•45m ago•3 comments

Printing Petscii Faster

https://retrogamecoders.com/printing-petscii-faster/
2•ibobev•46m ago•0 comments

Full Browser-Based CP/M emulator – finally

https://retrogamecoders.com/cpm-new/
2•ibobev•46m ago•0 comments

GPT-5 for AI-assisted discovery

https://www.johndcook.com/blog/2025/10/10/gpt-5-for-ai-assisted-discovery/
1•ibobev•46m ago•0 comments

"It was just a neural net tripping balls, bro"

https://twitter.com/bendiken/status/1976769934144553019
1•arto•47m ago•0 comments