frontpage.

Hi HN,

Built a real-time voice AI agent console for a YC W25 startup assessment (Freya Voice). Focus was on production-ready implementation with minimal latency.

GitHub: https://github.com/05sanjaykumar/Freya-Voice-YC25-Assessment

Key specs: - 133ms average latency (voice input → AI response → audio output) - LiveKit for WebRTC streaming - Next.js frontend + Python FastAPI backend - Multi-stage Docker deployment - Full observability and session management - In-memory storage for speed (design trade-off for assessment scope)

Tech stack: - Frontend: Next.js, TypeScript, LiveKit client SDK - Backend: FastAPI, LiveKit server SDK, OpenAI - Infrastructure: Docker multi-stage builds, production configs

Design decisions I made: - Voice-first interface (no text fallback) to match real-world use case - In-memory session storage (speed over persistence for MVP) - LiveKit over WebSocket (proven real-time infrastructure) - Concurrent audio processing to hit latency targets

Built in ~1 week for the assessment. Didn't land the role (extremely competitive) but learned a ton about real-time systems and WebRTC optimization.

Happy to discuss the latency optimization techniques or design trade-offs!

Can Quantum-Mechanical Description of Physical Reality Be Considered Complete? [pdf]

Kessler Syndrome Has Started [video]

Complex Heterodynes Explained

EVs Are a Failed Experiment

MemAlign: Building Better LLM Judges from Human Feedback with Scalable Memory

CCC (Claude's C Compiler) on Compiler Explorer

Homeland Security Spying on Reddit Users

Actors with Tokio (2021)

Can graph neural networks for biology realistically run on edge devices?

Deeper into the shareing of one air conditioner for 2 rooms

Weatherman introduces fruit-based authentication system to combat deep fakes

Why Embedded Models Must Hallucinate: A Boundary Theory (RCC)

A Curated List of ML System Design Case Studies

Pony Alpha: New free 200K context model for coding, reasoning and roleplay

Show HN: Tunbot – Discord bot for temporary Cloudflare tunnels behind CGNAT

Open Problems in Mechanistic Interpretability

Bye Bye Humanity: The Potential AMOC Collapse

Dexter: Claude-Code-Style Agent for Financial Statements and Valuation

Digital Iris [video]

Essential CDN: The CDN that lets you do more than JavaScript

They Hijacked Our Tech [video]

Vouch

HRL Labs in Malibu laying off 1/3 of their workforce

Show HN: High-performance bidirectional list for React, React Native, and Vue

Show HN: I built a Mac screen recorder Recap.Studio

Ask HN: Codex 5.3 broke toolcalls? Opus 4.6 ignores instructions?

Vectors and HNSW for Dummies

Sanskrit AI beats CleanRL SOTA by 125%

'Washington Post' CEO resigns after going AWOL during job cuts

Claude Opus 4.6 Fast Mode: 2.5× faster, ~6× more expensive