frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Digger Solo – Semantic File Search and Maps

https://solo.digger.lol
1•sean_pedersen•3h ago
I built Digger Solo a privacy respecting file explorer from the future. The core features are semantic file search and interactive data maps allowing to explore your files with ease. All processing happens locally - your files never leave your machine.

The file search works by combining full text search with semantic search allowing to search for content of text and image files by their meaning / content (even if the image has no descriptive file name). You can start a search query using a tag e.g. "#jpg cat" to search all your jpg files for cats.

Files are sorted by content similarity in interactive maps that reveal hidden connections and patterns across your collection (text, image, video & audio files supported).

Tags are inferred from imported file paths and file types.

Technicalities:

Digger Solo is quite a complex beast: I am using PyTauri (https://github.com/pytauri/pytauri - Python bindings for Tauri with a Rust backend with JS frontend). Python is used to import files, run the dimensionality reduction algorithms (t-SNE, UMAP etc.) and for most database logic (using SQLite3). The CLIP model (JINA CLIP v1) runs as ONNX model using the Rust crate ORT (https://ort.pyke.io/) - inference supports right now only CPU.

The semantic search is powered by the SQLite3 extension https://github.com/asg017/sqlite-vec which just adds support for optimized brute-force based nearest neighbor vector search (no approximate vector index). So Digger Solo is not meant for millions of files, more like a few 100 thousands.

The installer contains all model files used (text, vision and audio encoders) that is why the download size is so big - I did not want to have additional downloads after installation.

The Windows version had problems running the Python modules reliably for some customers (working on a fix). So I recommend trying the free version first and see if file importing works before purchasing a key.

Reworking Memory Management in CRuby [pdf]

https://blog.peterzhu.ca/assets/ismm_2025.pdf
1•hahahacorn•59s ago•0 comments

The Fish That Climbed a Mountain

https://longreads.com/2025/05/29/fishing-national-park-wildlife-recreation/
1•gmays•1m ago•0 comments

How to Keep Up with New CSS Features

https://css-tricks.com/how-to-keep-up-with-new-css-features/
1•ulrischa•2m ago•0 comments

Threatening AI Does Not Make It More Useful. Why Sergey Brin Is Wrong

https://www.tcg.com/blog/does-being-rude-to-ai-make-it-more-useful-why-sergey-brin-is-wrong/
2•rbuccigrossi•4m ago•2 comments

Wartime codebreaker Alan Turing's scientific papers sell for £465,000 at auction

https://www.theguardian.com/science/2025/jun/17/wartime-codebreaker-alan-turings-scientific-papers-sell-for-465000-at-auction
1•robaato•5m ago•0 comments

Spain's government blames blackout on grid regulator and private firms

https://www.bbc.com/news/articles/c62d8k8edgxo
1•janv8000•5m ago•0 comments

Radio bursts reveal universe's 'missing matter'

https://www.science.org/content/article/radio-bursts-reveal-universe-s-missing-matter
1•atombender•9m ago•0 comments

Neuron–Astrocyte Associative Memory

https://www.pnas.org/doi/10.1073/pnas.2417788122
2•PaulHoule•10m ago•0 comments

Nasdaq-traded Chinese herb stock with no revenues rallies 58,000%

https://www.cnbc.com/2025/06/17/hong-kongs-regencell-bioscience-triples-in-latest-surge-for-a-speculative-stock.html
1•ilamont•11m ago•0 comments

Man arrested for selling AI-colorized pirated 1954 'Godzilla' film

https://english.kyodonews.net/news/2025/06/63c83474a164-man-arrested-for-selling-ai-colorized-pirated-1954-godzilla-film.html
1•anigbrowl•11m ago•0 comments

Free Ruby AI Training Materials

https://github.com/thedayisntgray/ruby-ai-search-training
1•thedayisntgray•12m ago•0 comments

Facebook announces that all videos on its platform will soon be shared as reels

https://techcrunch.com/2025/06/17/facebook-announces-that-all-videos-on-its-platform-will-soon-be-shared-as-reels/
1•LorenDB•13m ago•0 comments

What Google Translate Can Tell Us About Vibecoding

https://ingrids.space/posts/what-google-translate-can-tell-us-about-vibecoding/
1•todsacerdoti•15m ago•0 comments

You're Not Ready

https://www.wired.com/youre-not-ready/
1•lostin01010101•15m ago•0 comments

Field Notes went from side project to cult notebook

https://www.fastcompany.com/91352848/field-notes-cult-notebook-started-out-as-a-side-project
2•ingve•19m ago•0 comments

Build a Cannon to Kill a Mosquito

https://manidoraisamy.com/developer-forever/post/build-a-cannon-to-kill-a-mosquito.anc-0ac4dfc1-80cf-4f93-854a-47010d1268a2.html
1•QueensGambit•19m ago•0 comments

Trump suggests he'll extend deadline for China's ByteDance to sell TikTok

https://www.scmp.com/news/world/united-states-canada/article/3314833/trump-suggests-hell-extend-deadline-chinas-bytedance-sell-tiktok
1•giuliomagnifico•21m ago•0 comments

Programming Language Design in the Era of LLMs: A Return to Mediocrity?

https://kirancodes.me/posts/log-lang-design-llms.html
1•gopiandcode•21m ago•0 comments

Keycloak: Open-Source Identity and Access Management

https://www.keycloak.org/
1•EtienneK•23m ago•0 comments

Iran asks its people to delete WhatsApp from their devices

https://apnews.com/article/iran-whatsapp-meta-israel-d9e6fe43280123c9963802e6f10ac8d1
4•rdrd•26m ago•0 comments

'It opened up something in me': Why people are turning to bibliotherapy

https://www.bbc.com/future/article/20250616-how-bibliotherapy-can-both-help-and-harm-your-mental-health
1•ohjeez•26m ago•0 comments

A Texan reads his electric bill

https://old.reddit.com/r/funny/comments/1ld7m3v/texan_reads_his_electric_bill/
2•ohjeez•28m ago•0 comments

Voronoi, Hashing and OSL

https://aras-p.info/blog/2025/06/13/Voronoi-Hashing-and-OSL/
1•ibobev•29m ago•0 comments

Alleged shooter found Minnesota lawmakers' addresses online, court docs say

https://www.politico.com/news/2025/06/16/alleged-shooter-found-minnesota-lawmakers-addresses-online-court-docs-say-00409260
3•rntn•30m ago•0 comments

From SDR to 'Fake HDR': Mario Kart World on Switch 2

https://www.alexandermejia.com/from-sdr-to-fake-hdr-mario-kart-world-on-switch-2-undermines-modern-display-potential/
2•ibobev•31m ago•0 comments

Improving Bigtable single-row read throughput by 70%

https://cloud.google.com/blog/products/databases/exploring-bigtable-read-throughput-performance-gains/
1•fastest963•31m ago•0 comments

AlphaPhoenix - I weighed an airplane while it was flying [video]

https://www.youtube.com/watch?v=hnvtstq3ztI
2•seycombi•32m ago•0 comments

Goldfish Memory

https://theaiunderwriter.substack.com/p/goldfish-memory
1•participant3•33m ago•0 comments

My Newest Patient Cannot Blink: A Therapy-Loop Prompt Pattern for Trustworthy AI

https://zenodo.org/records/15556365
1•pinko•33m ago•1 comments

Hybrid-Electric Commuter Airplane

https://www.electra.aero/
2•everybodyknows•33m ago•0 comments