In other words... after all this vibe coding could you identify the model strictly off vibes?
If yes, how long would it take you to be confident? And what constraints would you need for the test to be meaningful (i.e. familiar codebase vs greenfield, real bugs vs toy tasks, time-boxed, language/framework, etc)?