How it works:
- You upload a photo.
- A local vision model running entirely in your browser captions it and picks a prominent object from the image.
- You guess the word just like Wordle.
It uses a very tiny model so it is not very smart https://huggingface.co/onnx-community/Florence-2-base-ft
codingdave•1h ago
This is probably a really clever coding exercise, but an enjoyable game it is not. Maybe share more about the code?
ud0•53m ago