Text inputs are too slow for complex prompting if you're vibe coding or generating media. I built a full-stack Voice Mode component (UI + logic + transcription) for React/Next.js. It handles the awkward browser audio stuff so you don't have to.
Also used Gemini 3 to generate that entire page in one prompt. :-)
andupotorac•2mo ago
Here are some examples of different tools, compared to Same.dev that also captures screenshots of pages - and all the others.
https://x.com/andupoto/status/1992928743925690382