frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

We Scanned an AI Assistant for Security Issues: 12,465 Vulnerabilities

https://codeslick.dev/blog/openclaw-security-audit
1•vitorlourenco•50s ago•0 comments

Amazon no longer defend cloud customers against video patent infringement claims

https://ipfray.com/amazon-no-longer-defends-cloud-customers-against-video-patent-infringement-cla...
1•ffworld•1m ago•0 comments

Show HN: Medinilla – an OCPP compliant .NET back end (partially done)

https://github.com/eliodecolli/Medinilla
2•rhcm•4m ago•0 comments

How Does AI Distribute the Pie? Large Language Models and the Ultimatum Game

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6157066
1•dkga•4m ago•1 comments

Resistance Infrastructure

https://www.profgalloway.com/resistance-infrastructure/
2•samizdis•9m ago•0 comments

Fire-juggling unicyclist caught performing on crossing

https://news.sky.com/story/fire-juggling-unicyclist-caught-performing-on-crossing-13504459
1•austinallegro•9m ago•0 comments

Restoring a lost 1981 Unix roguelike (protoHack) and preserving Hack 1.0.3

https://github.com/Critlist/protoHack
2•Critlist•11m ago•0 comments

GPS and Time Dilation – Special and General Relativity

https://philosophersview.com/gps-and-time-dilation/
1•mistyvales•14m ago•0 comments

Show HN: Witnessd – Prove human authorship via hardware-bound jitter seals

https://github.com/writerslogic/witnessd
1•davidcondrey•14m ago•1 comments

Show HN: I built a clawdbot that texts like your crush

https://14.israelfirew.co
2•IsruAlpha•16m ago•2 comments

Scientists reverse Alzheimer's in mice and restore memory (2025)

https://www.sciencedaily.com/releases/2025/12/251224032354.htm
1•walterbell•19m ago•0 comments

Compiling Prolog to Forth [pdf]

https://vfxforth.com/flag/jfar/vol4/no4/article4.pdf
1•todsacerdoti•21m ago•0 comments

Show HN: Cymatica – an experimental, meditative audiovisual app

https://apps.apple.com/us/app/cymatica-sounds-visualizer/id6748863721
1•_august•22m ago•0 comments

GitBlack: Tracing America's Foundation

https://gitblack.vercel.app/
2•martialg•22m ago•0 comments

Horizon-LM: A RAM-Centric Architecture for LLM Training

https://arxiv.org/abs/2602.04816
1•chrsw•22m ago•0 comments

We just ordered shawarma and fries from Cursor [video]

https://www.youtube.com/shorts/WALQOiugbWc
1•jeffreyjin•23m ago•1 comments

Correctio

https://rhetoric.byu.edu/Figures/C/correctio.htm
1•grantpitt•23m ago•0 comments

Trying to make an Automated Ecologist: A first pass through the Biotime dataset

https://chillphysicsenjoyer.substack.com/p/trying-to-make-an-automated-ecologist
1•crescit_eundo•28m ago•0 comments

Watch Ukraine's Minigun-Firing, Drone-Hunting Turboprop in Action

https://www.twz.com/air/watch-ukraines-minigun-firing-drone-hunting-turboprop-in-action
1•breve•28m ago•0 comments

Free Trial: AI Interviewer

https://ai-interviewer.nuvoice.ai/
1•sijain2•28m ago•0 comments

FDA intends to take action against non-FDA-approved GLP-1 drugs

https://www.fda.gov/news-events/press-announcements/fda-intends-take-action-against-non-fda-appro...
21•randycupertino•30m ago•12 comments

Supernote e-ink devices for writing like paper

https://supernote.eu/choose-your-product/
3•janandonly•32m ago•0 comments

We are QA Engineers now

https://serce.me/posts/2026-02-05-we-are-qa-engineers-now
1•SerCe•33m ago•0 comments

Show HN: Measuring how AI agent teams improve issue resolution on SWE-Verified

https://arxiv.org/abs/2602.01465
2•NBenkovich•33m ago•0 comments

Adversarial Reasoning: Multiagent World Models for Closing the Simulation Gap

https://www.latent.space/p/adversarial-reasoning
1•swyx•33m ago•0 comments

Show HN: Poddley.com – Follow people, not podcasts

https://poddley.com/guests/ana-kasparian/episodes
1•onesandofgrain•41m ago•0 comments

Layoffs Surge 118% in January – The Highest Since 2009

https://www.cnbc.com/2026/02/05/layoff-and-hiring-announcements-hit-their-worst-january-levels-si...
13•karakoram•41m ago•0 comments

Papyrus 114: Homer's Iliad

https://p114.homemade.systems/
1•mwenge•41m ago•1 comments

DicePit – Real-time multiplayer Knucklebones in the browser

https://dicepit.pages.dev/
1•r1z4•41m ago•1 comments

Turn-Based Structural Triggers: Prompt-Free Backdoors in Multi-Turn LLMs

https://arxiv.org/abs/2601.14340
2•PaulHoule•43m ago•0 comments
Open in hackernews

Hachi: An Image Search Engine

https://eagledot.xyz/hachi.md.html
152•warangal•2mo ago

Comments

spacecadet•2mo ago
You can hack together an image search with a 500k VLM and a tiny embedding model that works surprisingly well. I built a tool like this 2 years ago that I can throw a hard drive at and any and all image files are processed and searchable locally, including video frames.
pbronez•2mo ago
Interesting project, very dense post. I like the idea of a genuine personal search engine. You’d think that Windows and MacOS would do this well, but they really don’t.

Project GitHub is here https://github.com/eagledot/hachi

pbronez•2mo ago
Reminds me of Danswer, actually. That’s an LLM-powered personal search engine. Looks like they’re making an enterprise play now.

https://danswer-website.vercel.app

TheTaytay•2mo ago
I have also been surprised that personal search engines are not a solved problem. “We” have actually known how to do decent search for a long time, including across images and the entire freaking internet for over two decades, but it’s not simple or commonplace to get a good semantic search interface for your own files, local or remote.

Chrome currently offers a semantic search across your browser history, but it’s buried. The major photo services allow for search across your photos. Windows and Mac have indexed keyword search across files, but the interface feels primitive.

I increasingly want a private search index across my browsing history, my photos, my notes/files, my voice recordings, GitHub projects, etc.

I thought a paid personalizable search engine like Kagi would be a good place to get/build a personalized internet search index on my browser history, but they don’t really offer the tools for that scale.

There are some enterprise search engines trying to solve this for orgs, so maybe I should be looking there?

I’m glad to see projects like Hachi, and am curious what others are doing or reaching for.

mikepurvis•2mo ago
“Windows and Mac have indexed keyword search across files, but the interface feels primitive.”

The functionality is further obscured when (at least on windows) the local files results are intermingled with results from afar, which I guess are Bing.

clearleaf•2mo ago
For me it just doesn't work at all. I don't know why but every windows instance I've used since Win7 has not been able to find files even with the exact filename supplied. I don't disable the indexer. I can see it using CPU and disk resources but it just doesn't find anything relevant when I search. When I instead use Search Everything on Windows it works perfectly.
salawat•2mo ago
No money to be made in making your life easier in that way, therefore no KPI is generated for it's implementation.
attila-lendvai•2mo ago
plus that would also mean less incentives to upload personal data to their servers...
jjice•2mo ago
I don't know about macOS, but I've found Spotlight awesome since switching to an iPhone last year. The only issue I have is that some apps that I would really like to search don't index their data with it.
underlipton•2mo ago
I've been hoping to see something like this, as finding or rediscovering images that I've archived has been a painful process for some years now.

Still, I've come to the conclusion that search alone - especially LLM-based search - isn't enough for these applications, because of its volatility. Human spatial localization relies on object permanence, so there needs to be some amount of durability baked into at least some of the functions of any application that involves us storing and retrieving desired objects and data.

I don't know precisely what that looks like, but I do know that, for example, whenever YouTube refreshes a recommended video list, I miss the days when those lists were largely fixed for days or weeks.

>My try has been to expose multiple (if not all) attributes for a resource directly to user and then letting user recursively refine query to get to desired result.

I do really like this part, though. I'd rather photos get tagged with as many (possibly erroneous) attributes as possible, and let me carve out what I'm really looking for, rather than missing the one I wanted because the system mistook a seesaw for a teeter-totter or something.

warangal•2mo ago
Hi, Author here!

I have been working on this project for quite some time now. Even though for such search engines, basic ideas remain the same i.e extracting meta-data or semantic info, and providing an interface to query it. Lots of effort have gone into making those modules performant while keeping dependencies minimal. Current version is down to only 3 dependencies i.e numpy, markupsafe, ftfy and a python installation with no hard dependence on any version. A lot of code is written from scratch including a meta-indexing engine and minimal vector database. Being able to index any personal data from multiple devices or service without duplicating has been the main them of the project so far!

We (My friend) have already tested it on around 180gb of Pexels dataset and upto 500k of flickr 10M dataset. Machine learning models are powered by a framework completely written in Nim (which is currently not open-source) and has ONEDNN as only dependency (which has to be do away to make it run on ARM machines!)

I have been mainly looking for feedback to improve upon some rough edges, but it has been worthwhile to work upon this project and includes code written in assembly to html !

thefourthchime•2mo ago
As a serial DIYer, I respect the engineering depth here, especially the custom vector index, but I disagree on the self-hosted ML approach. The innovation in embeddings is just too fast to keep up with locally without constant refactoring. You can actually see the trade-off in the "girl drinking water" example where one result is a clear hallucination.
warangal•2mo ago
Currently (Semantic) ML model is the weakest (minorly fine-tuned) ViT B/32 variant, and more like acting as a placeholder i.e very easy to swap with a desired model. (DINO models have been pretty great, being trained on much cleaner and larger Dataset, CLIP was one of first of Image-text type models !).

For point about "girl drinking water", "girl" is the person/tagged name , "drinking water" is just re-ranking all of "girl"s photos ! (Rather than finding all photos of a (generic) girl drinking water) .

I have been more focussed on making indexing pipeline more peformant by reducing copies, speeding up bottleneck portions by writing in Nim. Fusion of semantic features with meta-data is more interesting and challenging part, in comparison to choosing an embedding model !

love2read•2mo ago
Could this be used to make something like same.energy?