GitHub: https://github.com/loffloff/gemini-live-react
What it does: - Stream mic audio to Gemini, get voice responses back - Share your screen so AI can see what you're doing - Real-time transcripts for both sides - Tool calling support
Just added: useCapturedSurfaceControl hook for Chrome 124+ that lets you programmatically scroll/zoom a captured tab without injecting scripts.
Built this for deflectionrate.com (AI support that resolves issues before they become tickets), extracted the core into this package.
npm install gemini-live-react
Happy to answer questions.
Want me to tweak anything?