frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Voicetest – open-source test harness for voice AI agents

3•pldpld•1h ago
We've been building voice agents across Retell, VAPI, LiveKit, and Bland, and the testing story is... rough. Every platform has its own config format, there's no shared way to define what "correct" looks like, and most teams end up doing manual QA by literally calling their agent and listening. So we built voicetest.

voicetest is an open source (Apache 2.0) test harness that works across voice AI platforms. You import your agent graph from any supported platform (or define one from scratch), write test scenarios with expected behaviors, and voicetest simulates conversations and evaluates them with LLM judges that score each turn 0.0-1.0 with written reasoning. It also ships global compliance evaluators for things like HIPAA, PCI-DSS, and brand voice consistency. The core abstraction is an AgentGraph IR that normalizes across platform formats, so you can convert between Retell, VAPI, LiveKit, and Bland configs and test them all the same way.

Quick start:

``` uv tool install voicetest voicetest demo --serve ```

That gives you a web UI at localhost with a sample agent, test cases, and evaluation results you can poke at. There's also a CLI, a TUI, and a REST API. It integrates into CI/CD with GitHub Actions, uses DuckDB for persistence, and includes a Docker Compose dev environment with LiveKit, Whisper STT, and Kokoro TTS. If you have a Claude Code subscription, voicetest can pass through to it instead of requiring separate API keys for evaluation.

GitHub: https://github.com/voicetestdev/voicetest Docs: https://voicetest.dev API reference: https://voicetest.dev/api/

Instruction decoding in the Intel 8087 floating-point chip

https://www.righto.com/2026/02/8087-instruction-decoding.html
1•ibobev•38s ago•0 comments

Memories: Doing my PhD at Stanford, under John L Hennessy

https://lawrencecpaulson.github.io//2026/02/13/John_Hennessy.html
1•ibobev•54s ago•0 comments

The great computer science exodus (and where students are going instead)

https://techcrunch.com/2026/02/15/the-great-computer-science-exodus-and-where-students-are-going-...
1•sylvainkalache•1m ago•0 comments

Dimensional novel Validation protocol DAM Elara project

https://github.com/navigatorbuilds/elara-protocol
1•NenadVasic•1m ago•0 comments

TinyFish Accelerator: 9 Weeks Virtual Agent Accelerator, $2M Seed Pool

https://www.tinyfish.ai/accelerator
2•gargi_tinyfish•2m ago•1 comments

The One Woman Anthropic Trusts to Teach AI Morals

https://www.wsj.com/tech/ai/anthropic-amanda-askell-philosopher-ai-3c031883
1•judah•2m ago•0 comments

Heydawy DNS Changer v1 x64

https://github.com/davoo233/HeyDawy-DNS-Changer
1•HeyDawy•3m ago•0 comments

PlaceboBench: An LLM hallucination benchmark for pharma

https://www.blueguardrails.com/en
1•mathis-l•3m ago•0 comments

Amplified.dev: Developers amplified, not automated

https://amplified.dev
1•tydunn•3m ago•0 comments

Feed the AI Beast

https://brettcvz.com/posts/171-feed-the-ai-beast
1•brettcvz•3m ago•0 comments

Gentoo on Codeberg

https://www.gentoo.org/news/2026/02/16/codeberg.html
2•todsacerdoti•4m ago•0 comments

KDE Plasma 6.6

https://kde.org/announcements/plasma/6/6.6.0/
2•jrepinc•4m ago•0 comments

Cannabis Beverage Substitution for Alcohol: A Novel Harm Reduction Strategy

https://www.tandfonline.com/doi/full/10.1080/02791072.2026.2614506
1•PaulHoule•5m ago•0 comments

Show HN: StewReads – Turn Claude chats into Kindle ebooks

https://www.stewreads.com/help/mcp
1•rajma•6m ago•0 comments

Show HN: Agent Readiness Score – A real AI agent to test your website

https://trypillar.com/tools/agent-score
3•jjmaxwell4•6m ago•0 comments

Security Hardened OpenClaw

https://github.com/redscaresu/hardened-scaleway-openclaw
1•redscaresu•8m ago•0 comments

OpenAI axes exec for "sexual discrimination" after she objected GPT erotica plan

https://nypost.com/2026/02/11/business/openai-axes-exec-for-alleged-sexual-discrimination-after-s...
4•pera•8m ago•1 comments

Avoid IaaS Lock-In with a SAML Proxy (2025)

https://mikehadlow.com/posts/2025-07-17-avoid-identity-vendor-lock-in/
1•mooreds•10m ago•0 comments

Open SSH: Post-Quantum Cryptography

https://www.openssh.org/pq.html
1•hmokiguess•12m ago•0 comments

Show HN: Website Monitoring with Telegram Alerts

https://pingwithstick.ovh/
1•EgoriiSt•13m ago•0 comments

Waiting for the AI J-Curve

https://www.apolloacademy.com/waiting-for-the-ai-j-curve/
1•akyuu•14m ago•0 comments

Ask HN: How do you motivate your humans to stop AI-washing their emails?

3•causal•15m ago•1 comments

Show HN: Self-Hosted Task Scheduling System (Back End and UI and Python SDK)

https://github.com/Ghiles1010/Cratos-UI
2•rilesthefirst•15m ago•0 comments

Hybrid Search in PostgreSQL: The Missing Manual

https://www.paradedb.com/blog/hybrid-search-in-postgresql-the-missing-manual
1•jamesgresql•15m ago•1 comments

Grand Time: Time-Based Models in Decentralized Trust

2•AGsist•15m ago•0 comments

Show HN: I Forked Moltbook to Build a Hybrid Social Network (Humans and AI)

https://theeno-nine.vercel.app
1•shahidbilal6535•16m ago•0 comments

Retrotech YouTuber Sam Battle "Lookmumnocomputer" to Represent UK in Eurovision

https://www.theguardian.com/tv-and-radio/2026/feb/17/look-mum-no-computer-uk-entry-eurovision-2026
1•fortran77•16m ago•0 comments

WolfSSL Doesn't Suck

https://blog.feld.me/posts/2026/02/wolfssl-doesnt-suck/
2•thomasjb•16m ago•0 comments

Show HN: Continue – Source-controlled AI checks, enforceable in CI

https://docs.continue.dev
3•sestinj•17m ago•0 comments

Chess engines do weird stuff

https://girl.surgery/chess
4•admiringly•17m ago•0 comments