frontpage.

Vilberta is a speech-to-speech/text chatbot using a pipeline that combines voice acivity detector (VAD), speech recognition (ASR), a large language model (LLM), tool calling (via MCP) and text-to-speech (TTS). This is a standard pipeline. Initially, I was going for a more noval approach where a multi-modal LLM handled both speech recognition and LLM. But I found that the multimodal LLM struggled with certain capabilities like tool calling. So I ended up with VAD+ASR+LLM+TTS where I can configure model for each.

No echo cancellation, so you will need a headset.

"Not everything needs to be spoken" - this is the idea I wanted to capture with this project. It generates text for TTS (usually short) and displays relevant information on the screen (longer sections).

Few things I learned on the journey:

- collecting jargons from previous chats to the llm based asr system greatly improves its ability to handle jargons during transcription

- openrouter is awesome!

- end to end speech to speech systems aren't all that great once tool calling is involved. for any serious use case, tool calling will be involved. so it has to go through speech -> text, text processing, text -> speech anyhow.

- once you are serious about a project, Claude code will consume the weekly quota rather quickly. I neded up with opencode + kimi 2.5. 90% of the code is done by chatbots

Usable, tested vibe coded PRs are welcome!

MariaDB vs. PostgreSQL: Understanding the Architectural Differences That Matter

Show HN: Peen – A minimal coding agent CLI built for local models

MySpace Founder Deletes Post Amid Backlash After Teasing Platform Return

SoTA LLM Guardrails by Trusting the Typical [ICLR 2026]

Show HN: EZTest – All in One, Open Source Test Management. Built W Claude Code

Peter Thiel warns the Antichrist and apocalypse linked to the 'end of modernity'

Show HN: Uptime – A menu bar timer that shows how long you've been at your Mac

Show HN: Trupositive – Auto-tag infrastructure with Git metadata (Terraform/CFN)

Ask HN: Route Optimization Software

Moltbook and I Am Your AIB

Are Meta and Google ads recession-proof?

Show HN: Prompt-injection‑resistant agent runtime that writes web apps

Ask HN: Who's Looking for Funding?

Our Wandering Path to Supporting Domain Names (2023)

TickTappy – One-tap time tracking for people who hate time tracking

I Don't Call Myself an Accessibility Expert

Electronic Warfare Innovations and Exports [video]

Noam Chomsky advised Epstein about 'horrible' media coverage, files show

America's Cows Are Making Too Much Butterfat

Synthesizing scientific literature with retrieval-augmented language models

Moltbook: After the First Weekend

The third golden age of software engineering [video]

Open-source AI program can answer science questions better than humans

Show HN: AgentGuard – Open-source security layer for AI agents and skills

Agentic search vs. embedding-based search vs. truth layers

My Eighth Year as a Bootstrapped Founder

Battle of the privacy-focused search engines: Kagi vs. DuckDuckGo

Economy Layoffs in January were the highest to start a year since 2009

Spotify to let users buy physical books

(Un)portable defer in C

Show HN: Vilberta: speech to speech/text chatbot