Show HN: Built a conversational language learning app with Expo/WebGL/LLMs

https://convolive.com/

1•benjah•3h ago

Could I get some opinions on my experiential learning language app?

When I learned Japanese with new friends, I didn't know how to ask where we were going or where the bathroom was. It was really motivating. As in-country immersion isn't option in most cases, language exchange is next best option. However, in-person language exchange can be expensive or difficult to schedule. So I built ConvoLive to make language practice more engaging for myself.

Open Beta on Android and iOS https://convolive.com

Technically interesting bits:

  - Lip-synced avatars using WebGL.
  - Continuous speech recognition on recent phone models so you don't have to press to speak
  - Using device speech recognition and caching assets means that most external LLM calls are limited to the free-form chat mode.
  - Unfortunately, on-device local LLM ended up being too demanding and slow.
  - GPT 4o provided better speed and results than GPT 5.
  - Multimodal quiz system with drag-and-drop, fill-in-the-blank, and multi-choice exercises
  - Freeform conversation gives you suggestions to keep the chat going, but you can also ask how to say something.

What works:

  - People seem to love or hate the avatars.
  - Testers are saying they feel more comfortable speaking.
  - The avatars aren't groundbreaking but they do make it feel less like talking to a chatbot.
  - Being able to speak freely without clicking seems more natural.

What doesn't:

  - People seem to love or hate the avatars.
  - As an app that promotes speaking out loud, people might be wary of speaking as a beginner outside their home.
  - The app might be better suited for people who already understand fundamentals like tenses or different alphabets which don't fit into a per conversation approach.
  - Newer better quality TTS models do not provide the viseme lip sync data I need for animation.

Currently supports Spanish, Japanese, Italian, German, Portuguese, and French.

Curious what other language learners here think – is this approach useful or do we not want to talk to our phones in broken Spanish? Should I keep working on this?

Comments

pickettd•31m ago

Was the on-device local LLM stack that you tried llama.cpp or something like MLC? I've seen better performance with MLC than llama.cpp in the past - but it has been probably at a least a year since I tested iphones and androids for local inference

Hacking with AI SASTs: An Overview of 'AI Security Engineers'

Codemaps: Understand Code, Before You Vibe It

Floating Man

Avicenna

The Bombing of Gaza Sky Geeks

Show HN: Just launched AlterBase. Find cheap alternatives to expensive tools

Michael Burry a.k.a. "Big Short",discloses $1.1B bet against Nvidia&Palantir

Show HN: Curated Domain Name Marketplace

SocketAddrV6 is not roundtrip serializable

Halfway to Hell

The Inequity of Consumption-Based Tax Systems

Windsurf Codemaps: Understand Code, Before You Vibe It

Project Suncatcher, research moonshot to scale machine learning compute in space

Sentry – No Marketing Mode

Humans and neural networks show similar patterns of transfer and interference

72% of game devs believe Steam has a monopoly on PC games

Towards a future space-based, highly scalable AI infrastructure system design [pdf]

Mutiny on the Bounty

The Death of Traditional QA (Or: "AI Everywhere " Reaches SQA)

Comet 3I/Atlas Perihelion Update

Pi hosted interactive portfolio website

The AI Village Where Top Chatbots Collaborate–and Compete

Discovering a New Quantum Algorithm

'History won't forgive us' if UK falls behind in QC race, says Tony Blair

Show HN: Talking Moose Is Back Baby Mack the Moose

D-Wave Advantage2 Now Available for U.S. Gov. Apps at Davidson Technologies

Dashbrd: Simple Bootstrap 5 dashboard template

DHS proposes biometrics expansion for immigrants, dropping age restrictions

The Dumpster Dive Principle

Show HN: Pion/rtwatch – Watch video in sync with friends, pause/seek on back end