Segment anything however was able to segment all 5 dog legs when prompted to. Which means that meta is doing something else under the hood here, and may lend itself to a very powerful future LLM.
Right now some of the biggest complaints people have with LLMs stems from their incompetence processing visual data. Maybe meta is onto something here.
I would recommend bounding boxes.
trevorhlynn•1h ago
https://news.ycombinator.com/item?id=45982073