frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Ava – AI Voice Agent for Traditional Phone Systems(Python+Asterisk/ARI)

https://github.com/hkjarral/AVA-AI-Voice-Agent-for-Asterisk
4•hkjarral•1h ago
Hi HN, I'm the creator of AVA - AI Voice Agent for Asterisk

My repo was shared here once before by someone else so I wanted to follow up with the progress since then.

https://news.ycombinator.com/item?id=46380399

I've been working with Asterisk/FreePBX systems for years. I wanted to add AI voice capabilities to legacy phone systems without paying per-minute SaaS fees or ripping out the entire telephony stack.

So I built AVA, a self-hosted AI voice agent that can integrate into any traditional phone system. While most solutions demand expensive migrations to cloud-only providers, AVA provides a self-hosted path to connect AI agents to existing phone systems while ensuring data privacy and lowering operational costs

AVA is a Dockerized Python app that sits alongside your Asterisk server. It connects via ARI (Asterisk REST Interface) and routes call audio to AI providers — OpenAI Realtime, Deepgram, Google Live API, ElevenLabs, Telnyx, or fully local models (Vosk + llama.cpp + Piper). You can mix and match STT/LLM/TTS in a modular pipeline, or use a single provider end-to-end.

Two audio transport paths: We support both AudioSocket (low-latency TCP with TLV framing) and ExternalMedia RTP (UDP, better for NAT). A transport orchestrator auto-negotiates sample rates and codecs between what Asterisk sends on the wire and what each AI provider expects — so you can run 8kHz ulaw from Asterisk into a provider that wants 24kHz linear16 without manual config.

Session lifecycle: A typed session store tracks every call from StasisStart through hangup — audio diagnostics, barge-in counts, provider state, conversation turns. Every call is fully observable and debuggable after the fact.

Barge-in and VAD were the hardest problems. We use a dual-mode VAD — WebRTC VAD combined with energy-based RMS detection, scored into a single confidence value (40% WebRTC weight, 40% energy ratio, 20% agreement bonus). Frame smoothing prevents single-frame glitches from triggering false interrupts. When barge-in fires, we kill active playback (both streaming and file-based) via ARI, flush provider audio buffers, release conversation gating tokens, and optionally suppress provider output for a configurable window to prevent pre-barge audio from re-queuing. The system supports three interrupt sources: local VAD, Asterisk's native talk detection events, and provider-side interruption signals.

The hardest latency challenge was bridging legacy SIP/RTP with modern WebSocket streams. We use a two-container architecture: a lightweight orchestrator for ARI state management and an optional heavier container for local model inference. There are 6 pre-validated golden baseline configs if you just want something working out of the box, plus an Admin UI for visual setup.

Try the live demo: (925)-736-6718 Option 5 for Google, 6 for Deepgram, 7 for Openai realtime, 8 for Local hybrid and 9 for Elevenlabs

Code is MIT. I'd love feedback on the transport layer (src/core/transport_orchestrator.py) and the VAD tuning (src/core/vad_manager.py).

Bringing Chrome to ARM64 Linux Devices

https://blog.chromium.org/2026/03/bringing-chrome-to-arm64-linux-devices.html
1•ingve•2m ago•0 comments

MCP tools for AI-native ontology engineering (Rust and Oxigraph)

https://github.com/fabio-rovai/open-ontologies
1•fabio_rovai•2m ago•1 comments

The Generational Prisoner's Dilemma: Three Certain Truths

https://twitter.com/dgt10011/status/2031757205888762205
1•washedup•2m ago•0 comments

The Official GBBS Pro Repository

https://gbbs.applearchives.com/
1•js2•2m ago•0 comments

You can turn Claude's most annoying feature off

https://jdlms.site/blog/claudes-most-annoying-feature
1•tietjens•5m ago•0 comments

Agentic Evidence

https://actis.world
1•blazingjolt•5m ago•1 comments

Ask HN: Do you struggle analyzing large log files with AI due to token limits?

1•DrTrader•5m ago•0 comments

Sitka's excellent Gravity clock Eurorack module is now open source

https://cdm.link/gravity-open-source/
1•bryanrasmussen•7m ago•0 comments

Show HN: RestaRules – A robots.txt for how AI agents interact with restaurants

https://github.com/selfradiance/restarules
1•selfradiance•7m ago•0 comments

One More Prompt: The Dopamine Trap of Agentic Coding

https://blog.quent.in/blog/2026/03/09/one-more-prompt-the-dopamine-trap-of-agentic-coding/
1•sylvainkalache•7m ago•0 comments

RAF vs. Usaaf: The Bombing Doctrine Split That Divided the Allies

https://warcommons.com/2026/03/11/raf-usaaf-bombing-doctrine-disagreement/
1•NENCMediaGroup•11m ago•0 comments

Grok 4.20 brings minimal improvements for Grok-4.1-fast

https://aibenchy.com/compare/x-ai-grok-4-20-beta-medium/x-ai-grok-4-20-multi-agent-beta-medium/x-...
1•XCSme•12m ago•1 comments

Show HN: Slack but Usage Priced

https://loftchat.com
1•valliveeti•12m ago•0 comments

Can LLMs Be Computers?

https://www.percepta.ai/blog/can-llms-be-computers
1•linolevan•12m ago•0 comments

Spacetime Quasicrystals

https://arxiv.org/abs/2601.07769
2•amai•13m ago•0 comments

Show HN: I'm building niche AI agents with OpenClaw (Clawsify)

https://clawsifyai.com
1•Vanshfin•14m ago•0 comments

Deepak Jain to Host Two Sessions at Nvidia GTC 2026

1•AiNETFactory•15m ago•0 comments

You Can Stop Marrying for Genes

https://fi-le.net/love/
2•fi-le•16m ago•0 comments

Amazon Employees Say AI Is Just Increasing Workload

https://gizmodo.com/amazon-employees-say-ai-is-just-increasing-workload-a-new-study-confirms-thei...
2•jamesgill•18m ago•0 comments

Ask HN: Developing .NET on Windows but deploying to Linux – common issues?

1•ghostlyInc•19m ago•2 comments

Boeing's Bizarre Planes That Were Never Built

https://www.airlineratings.com/articles/boeings-bizarre-planes-that-were-never-built
1•voxadam•20m ago•0 comments

On Making

https://beej.us/blog/data/ai-making/
1•ingve•21m ago•0 comments

Zeno's Paradox resolved by physics, not by math alone

https://bigthink.com/starts-with-a-bang/zenos-paradox/
2•bryanrasmussen•22m ago•1 comments

What's That? – Photo to personalized audio narrative in under 10 seconds

https://apps.apple.com/us/app/whats-that-ai-audio-guide/id6756409506
1•whatsthatapp•23m ago•1 comments

The Dopamine Trap of Vibe Coding

https://codn.dev/blog/the-dopamine-trap-of-vibe-coding/
3•romanhoffmann•26m ago•5 comments

MCPs, CLIs, and skills: when to use what?

https://jngiam.bearblog.dev/mcps-clis-and-skills-when-to-use-what/
6•jngiam1•26m ago•0 comments

A Snapshotable WASM Interpreter

https://github.com/friendlymatthew/gabagool
2•birdculture•27m ago•0 comments

Looking for Partner to Build Agent Memory (Zig/Erlang)

2•kendallgclark•28m ago•0 comments

Show HN: Codex Symphony – bootstrap OpenAI Symphony and Linear in any repo

https://github.com/Citedy/codex-symphony
1•ntty•29m ago•0 comments

How to use Claude Cowork – Complete guide

https://overtoncollective.com/blog/claude-cowork-complete-guide
2•Ali_Jiwani•30m ago•0 comments