frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Dograh – an OSS Vapi alternative to quickly build and test voice agents

https://github.com/dograh-hq/dograh
14•a6kme•1w ago
Hi HN, I have been building voice agents for sometime now. I was earlier automating parts of visa processing, and we needed real-time, multilingual voice calling.

I assumed the hard work was just wiring LiveKit/Pipecat + STT/TTS + an LLM. It wasn’t.

Even with solid OSS (Pipecat/LiveKit), we still had to do a lot of plumbing- variable extraction, tracing, testing etc and any workflow changes required constant redeploys.

We eventually realized we’d spent more time building infrastructure than building the actual agents. Everything felt custom. We hit every possible pain with Pipecat and VAPI style systems.

So we built Dograh - a fully open-source voice agent framework that includes all the boring, painful pieces by default.

What’s different:

- Pipecat-based engine, but forked - custom event model, and concurrency fixes

- One-click start template generated by an LLM Agent for a quick get start template for any use case

- Drag-and-drop visual agent builder for quick iteration (the thing we wished existed earlier)

- Variable extraction layer (name/order/date/etc.) baked into the LLM loop

- Built in Telephony integration (Twilio/ Vonage/ Vobiz/ Cloudonix)

- Multilingual support end-to-end

- Select any LLM TTS STT (add their credits, if any)

- AI-to-AI call testing: automatically stress-test an agent before shipping (still a work in progress- so patchy as of now)

- Fully Open Source

It's built and maintained by YC alumni / exit founders who got tired of rebuilding the same plumbing.

Why we open-sourced it: We kept feeling that the space was drifting toward closed SaaS abstractions (VAPI, Retell). Those are good for demos, but once you need data controls, privacy or self/offline deployment, you end up stuck. We wanted a stack where you can see every part, fork it, self-host it, and patch it as needed.

Try it:

- Repo: https://github.com/dograh-hq/dograh

This spins up a basic multilingual agent with everything pre-wired.

Who this is for:

- If you are looking for self hosting a Vapi like platform for Data Privacy etc.

- Anyone trying to build production-grade voice agents without reinventing audio plumbing.

- If you’ve tried to glue STT→LLM→TTS manually, you probably know the exact pain this is built for

Happy to answer technical questions, show the architecture, or hear how we can improve the product.

Comments

a6kme•1w ago
Earlier I was using other platforms for production voice agents. One thing that became obvious was the cost: 60–70% of our total spend was the Vapi platform fee, and only 30-40% was actual LLM/STT/TTS usage. Platform cost dominated everything. That alone pushed us toward something self-hosted.

But when we switched to OSS stacks (Pipecat, LiveKit), we realise that even with great OSS, the plumbing was still painful and necessary- no standard way to extract variables from conversations (name/date/order ID), no straightforward tracing of LLM calls, no way to run AI-to-AI test loops, and no fast workflow iteration - and every change meant another redeploy.

The infrastructure glue kept ballooning, and each time it felt like rebuilding the same system from scratch.

Dograh came out of that combination of cost pain and integration pain. Happy to dig deeper into anything.

pritesh1908•1w ago
Hey HN, sometime back someone on HN asked for an open-source alternative for Vapi or Retell and we replied there (https://news.ycombinator.com/item?id=45884165) That thread just confirmed otehrs running into the same problems we had been dealing with. Now Dograh is more mature.

We are happy to share some technical details for anyone interested. A lot of Dograh’s internal work went into extending the functionality of the pipeline by including custom Frames and Processors, creating a ReactFlow based visual agent builder and creating an Engine that can parse that Agent JSON and call conversational LLM loops with function calling. Also we enhanced the functionality by creating easier access to extracted variables, call transcripts and recordings - things that are needed in any production deployment.

One thing we are still trying to understand better: how teams handle long-running conversations while keeping context tight and cheap. Would love to hear how others have approached that.

eddywebs•19h ago
Just did a test drive, CONGRATULATIONS first of all for getting this launched. Few pointers:

1) It would be great to provide different voice personas like vapi does maybe it's there already but couldn't find the config. 2) My agent reported some lag in getting responses during the call, perhaps that's just resource issue ?

Either Way you're to a great start and I look forward for this project to grow, starred the repo on GH,I think I was the 100th one :).

Multicomp•19h ago
Thank you for sharing your hard work with the world! I get to play with these AI technologies without having to train my own model or wire up an entire composition because of precompiled systems ither have made and shared, like yours.

I hope you find product market fit and are able to do what you desire with this product. In the meantime, I am grateful that you are helping us advance towards the Star Trek Voice Computer being defictionalized!

android521•19h ago
is end to end speech model like openai real time /gemini live or open source qwen 3 omni better in terms of latency?
brihati•18h ago
Thank you so much for sharing this with the community. Starred the project and will definitely try it out within my company. More power to you!

Economics of Orbital vs. Terrestrial Data Centers

https://andrewmccalip.com/space-datacenters
41•flinner•1h ago•38 comments

Fix HDMI-CEC weirdness with a Raspberry Pi and a $7 cable

https://johnlian.net/posts/hdmi-cec/
78•jlian•1h ago•27 comments

1/4 of US-Trained Scientists Eventually Leave. Is the US Giving Away Its Edge?

https://arxiv.org/abs/2512.11146
55•bikenaga•2h ago•40 comments

“Are you the one?” is free money

https://blog.owenlacey.dev/posts/are-you-the-one-is-free-money/
83•samwho•3d ago•10 comments

Nature's many attempts to evolve a Nostr

https://newsletter.squishy.computer/p/natures-many-attempts-to-evolve-a
18•fiatjaf•4d ago•5 comments

A kernel bug froze my machine: Debugging an async-profiler deadlock

https://questdb.com/blog/async-profiler-kernel-bug/
35•bluestreak•2h ago•9 comments

Upcoming Changes to Let's Encrypt Certificates

https://community.letsencrypt.org/t/upcoming-changes-to-let-s-encrypt-certificates/243873
191•schmuckonwheels•3h ago•147 comments

Essential Semiconductor Physics [pdf]

https://nanohub.org/resources/43623/download/Essential_Semiconductor_Physics.pdf
77•akshatjiwan•2d ago•4 comments

US TikTok investors in limbo as deal set to be delayed again

https://www.bbc.com/news/articles/cp34442z25ko
145•1659447091•3d ago•83 comments

“Super secure” messaging app leaks everyone's phone number

https://ericdaigle.ca/posts/super-secure-maga-messaging-app-leaks-everyones-phone-number/
451•e_daigle•4h ago•199 comments

Umbrel – Personal Cloud

https://umbrel.com
105•oldfuture•3h ago•57 comments

Chafa: Terminal Graphics for the 21st Century

https://hpjansson.org/chafa/
79•birdculture•5h ago•14 comments

Cosmic-ray bath in a past supernova gives birth to Earth-like planets

https://www.science.org/doi/10.1126/sciadv.adx7892
73•toomuchtodo•6h ago•25 comments

Avoid UUID Version 4 Primary Keys in Postgres

https://andyatkinson.com/avoid-uuid-version-4-primary-keys
325•pil0u•13h ago•337 comments

The appropriate amount of effort is zero

https://expandingawareness.org/blog/the-appropriate-amount-of-effort-is-zero/
31•gmays•3h ago•23 comments

Carrier Landing in Top Gun for the NES

https://relaxing.run/blag/posts/top-gun-landing/
341•todsacerdoti•9h ago•139 comments

We are discontinuing the dark web report

https://support.google.com/websearch/answer/16767242?hl=en
68•satertek•8h ago•27 comments

I'm Kenyan. I don't write like ChatGPT, ChatGPT writes like me

https://marcusolang.substack.com/p/im-kenyan-i-dont-write-like-chatgpt
489•florian_s•11h ago•322 comments

It seems that OpenAI is scraping [certificate transparency] logs

https://benjojo.co.uk/u/benjojo/h/Gxy2qrCkn1Y327Y6D3
182•pavel_lishin•9h ago•94 comments

Secret Documents Show Pepsi and Walmart Colluded to Raise Food Prices

https://www.thebignewsletter.com/p/secret-documents-show-pepsi-and-walmart
49•connor11528•1h ago•9 comments

We architected an edge caching layer to eliminate cold starts

https://www.mintlify.com/blog/page-speed-improvements
26•skeptrune•7h ago•18 comments

Problems with D-Bus on the Linux desktop

https://blog.vaxry.net/articles/2025-dbusSucks
250•LorenDB•4h ago•161 comments

Ask HN: What Are You Working On? (December 2025)

384•david927•1d ago•1280 comments

Building an efficient hash table in Java

https://bluuewhale.github.io/posts/building-a-fast-and-memory-efficient-hash-table-in-java-by-bor...
71•birdculture•2d ago•7 comments

Show HN: I Ching simulator with accurate Yarrow Stalk probabilities

https://castiching.com/
50•jackzhuo•1d ago•32 comments

Show HN: A pager

https://www.udp7777.com/
75•keepamovin•1d ago•32 comments

Optery (YC W22) Hiring CISO, Release Manager, Tech Lead (Node), Full Stack Eng

https://www.optery.com/careers/
1•beyondd•11h ago

JetBlue flight averts mid-air collision with US Air Force jet

https://www.reuters.com/world/americas/jetblue-flight-averts-mid-air-collision-with-us-air-force-...
5•divbzero•35m ago•1 comments

SoundCloud has banned VPN access

https://old.reddit.com/r/SoundCloudMusic/comments/1pltd19/soundcloud_just_banned_vpn_access/
249•empressplay•20h ago•168 comments

US Tech Force

https://techforce.gov/
162•purple_ferret•6h ago•219 comments