frontpage.

Show HN: Dograh – voice agents that pick Recordings over TTS using LLM

https://github.com/dograh-hq/dograh

2•a6kme•1h ago

TL;DR: Dograh is an open-source platform to build voice AI agents with drag-and-drop workflows. New in v1.20: Gemini 3.1 live support, Pre-recorded audio support for lower latency and more natural responses. Fully self-hostable, no vendor lock-in.

Hi HN,

We’re the Dograh team (YC alumni). While building voice bots, we found that wiring WebRTC/ Telephony + STT + LLM + TTS took more time than the bots themselves. Teams are spending weeks on plumbing - handling call flows, extracting variables, dealing with telephony edge cases, and redeploying for small changes. Tools like Vapi/Retell are easy to start with but come with lock-in and platform fees. So we built Dograh: a 100% open-source platform that handles the full stack, with a visual workflow builder and self-hosting by default.

Dograh v1.20 introduces two major additions: 1. Gemini 3.1 Live support Run fully real-time voice agents using Gemini’s streaming APIs, without stitching together separate STT + LLM + TTS components. 2. Pre-recorded audio (hybrid voice) Upload real voice clips (greetings, confirmations, etc.), and the agent plays them instantly while using TTS only for dynamic responses. This reduces latency, improves naturalness, and cuts TTS costs.

It also includes:

- Plug-and-play LLM / STT / TTS (including self-hosted models) - Telephony integrations (Twilio, Vonage, Telnyx) along with Call Transfer - Post-call QA, transcripts, and variable extraction - Observability via Langfuse (OpenTelemetry traces + prompt playground)

Try it now: If you have Docker, you can run the below command for a 2-minute setup (no API keys needed out of the box).

https://gist.github.com/a6kme/072252bf885270787bbb8376687c67... [ sorry, HN wont let me post the entire command ]

Looking Ahead: We’re expanding self-hosted model support: you can already bring any LLM (e.g. Llama, Qwen) or TTS (Kokoro, Voxtral) by configuring API endpoints. We are working on updates that will enable anyone to run everything on a single server - your AI models along with Dograh Orchestration.

Looking forward to hearing thoughts of the community.

Apple Obeyed Russia and Britain in the Same Week

MacBook Neo Reviews

Made a toy systems programming language (tin)

Show HN: Built a Google Chrome extension to give AI model information on hover

Easily Create preview environments for iOS and Android

Solar Balconies Take Europe by Storm

Apple Confirms Mac Pro Is Dead, No Future Models Planned

JetStream 3: A modern benchmark for high-performance, compute-intensive Web apps

Software You Can Love 2026 tickets are on sale

Bloomberg Announces Open Source Partnership, Will Contribute to OpenTelemetry

Accelerating the Next Phase of AI

Mercor AI has allegedly been breached by Lapsus

Ask HN: What custom instructions do you use to minimize LLM sycophancy?

Meat-Based LLM Proxies

Ten Months with Copilot Coding Agent in Dotnet/Runtime

Next.js template for US credit card affiliate sites

Claude Code's source code appears to have leaked

Deploy Aikido Safe-Chain via Iru (Kandji) or Jamf

The .env File Nobody Needs

AWS SCP Catalog

I Read Claude Code's Leaked Source – Here's What's Inside

Super Micro Computer Investors Look for Exits

The architectural trade-offs of AI code generation

Prefer do notation over Applicative operators when assembling records (2024)

Iran's internet blackout has entered day 32 [video]

Chernobyl Fungus Seems to Have Evolved the Ability to Harness Radiation

Show HN: The 42-Day Vibe – A mockumentary on "Vibe Coding" taken to the extreme

Job Isn't Programming

Should AI First Do No Harm?

Producer by Google