frontpage.

Gemini Live offers real-time bidirectional voice AI, but using it in the browser is rough: - 16kHz in, 24kHz out, browser wants 44.1/48kHz - PCM16 endianness issues - buffering vs latency tradeoffs - playback gaps when chunks arrive mid-stream

I built gemini-live-react, a React hook that fixes the audio DX and adds features I needed to build real AI agents:

Session recording – record transcripts, audio metadata, tool calls, browser actions, and DOM snapshots into a single JSON for debugging/replay

Workflow builder – define multi-step browser automations as a simple state machine (branching + error handling)

Smart element detection – auto-detect clickable elements so agents don’t rely on brittle selectors

Used for voice-driven web agents where the loop is: AI sees UI → decides → clicks/types → repeat

Tech: React hook (~2k LOC), AudioWorklet, WS proxy (Deno/Supabase), TypeScript

GitHub: https://github.com/loffloff/gemini-live-react npm: npm install gemini-live-react

Looking for feedback on the workflow abstraction — state machines felt right, but curious what others use.

Life at the Edge

RISC-V Vector Primer

Show HN: Invoxo – Invoicing with automatic EU VAT for cross-border services

A Tale of Two Standards, POSIX and Win32 (2005)

Ask HN: Is the Downfall of SaaS Started?

Flirt: The Native Backend

OpenAI's Latest Platform Targets Enterprise Customers

Goldman Sachs taps Anthropic's Claude to automate accounting, compliance roles

Ai.com bought by Crypto.com founder for $70M in biggest-ever website name deal

Big Tech's AI Push Is Costing More Than the Moon Landing

The AI boom is causing shortages everywhere else

Suno, AI Music, and the Bad Future [video]

Ask HN: How are researchers using AlphaFold in 2026?

Running the "Reflections on Trusting Trust" Compiler

Watermark API – $0.01/image, 10x cheaper than Cloudinary

Now send your marketing campaigns directly from ChatGPT

Queueing Theory v2: DORA metrics, queue-of-queues, chi-alpha-beta-sigma notation

Show HN: Hibana – choreography-first protocol safety for Rust

Haniri: A live autonomous world where AI agents survive or collapse

GPT-5.3-Codex System Card [pdf]

Atlas: Manage your database schema as code

Geist Pixel

Show HN: MCP to get latest dependency package and tool versions

The better you get at something, the harder it becomes to do

Show HN: WP Float – Archive WordPress blogs to free static hosting

Show HN: I Hacked My Family's Meal Planning with an App

Sony BMG copy protection rootkit scandal

The Future of Systems

NASA now allowing astronauts to bring their smartphones on space missions

Claude Code Is the Inflection Point