frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

My accent costs me 30 IQ points on Zoom. So we built an ML model to fix it

https://krisp.ai/blog/introducing-accent-conversion-for-the-listener/
27•artavazdsm•2h ago

Comments

artavazdsm•2h ago
Co-founder of Krisp here. 1.5B non-native English speakers in the workforce, 4x native — yet all comms infra is optimized for native accents. We spent 3 years building listener-side, on-device accent understanding. The hard parts: no parallel training data exists, the accent space is infinite, accent is entangled with voice identity, and it runs on CPU under 250ms latency. Built in Yerevan, Armenia. Beta is live and free. Happy to go deep on the ML side.
AlexeyBelov•2h ago
What do you think about the misuse potential (by scammers for example)?

Aside from that, I like that this exists now.

davitb•2m ago
This is for listener-side, not speaker-side. So no misuse case here.
astipili•1h ago
will it help the barista in Starbucks get my name right finally?
lu_mn•1h ago
Kinda wild to think accent friction is basically a tech problem. Doing this in real time on CPU sounds tough. Curious how well it holds up in messy, real calls.
snek26•1h ago
Curious whether wav2vec-style embeddings played a role in your representation learning.
sohanyan•1h ago
Accent space is effectively infinite. Generalization must rely on invariants rather than enumeration.
Flora_H42•1h ago
Streaming constraint under 200ms changes everything. Causal modeling in speech is brutal to get right.
bebelovejan•1h ago
I would like to use such model but only if it really preserves my voice, otherwise people would understand its not me or I have to use it all the time.
imuradyan•1h ago
On-device CPU inference is the real flex here! Optimization probably mattered as much as modeling.
sssnowgirl•1h ago
This is a game-changer! I remember each and every call I had with an investor and feeling shy asking "can you repeat?"... thanks krisp, you changed my life!!!
KarineS•1h ago
Finally Krisp built it! I will understand my users from interviews better, with no cognitive load and "could you please repeat that" phrasing.
gyumjibashyan•1h ago
How did you estimate the number of IQ points?
arshakarap•1h ago
This is built for international, privacy-first teams!
armsuro•1h ago
This feels adjacent to voice conversion research, but with stricter latency constraints.
amartiro•1h ago
The parallel data is a problem here — you can’t crowdsource ground truth because no one can record themselves with a different accent.
CyberSec86888•1h ago
This tackles a massive yet often overlooked gap in global communication.

A majority of professionals around the world operate in English as a second language, yet most voice technology has historically been designed with native speech patterns in mind. That imbalance creates subtle barriers in everyday conversations, from team syncs to high-stakes business calls.

Building real-time, on-device accent adaptation, without clean paired datasets, across countless speech variations, while separating pronunciation patterns from speaker identity and keeping latency ultra-low on CPU, is an extraordinary technical achievement.

Deep respect for taking on something this fundamental to inclusion and clarity in the modern workplace.

aharutyunyan•1h ago
Accent space is effectively infinite. Generalization must rely on invariants rather than enumeration.
nareksardaryann•1h ago
Great work. Natural + clear is the combo that matters.
Hripsimeh•1h ago
This is a huge game changer !
rasjonell•1h ago
Latency can destroy conversational rhythm. What’s your p95 inference time? also are there any benchmarks we can see?
armb21•1h ago
This works weirdly well — I’m honestly amazed by how good and fast it is!
Tatevik_H•1h ago
Streaming constraint under 200ms changes everything. Causal modeling in speech is brutal to get right.
aris_hovsepyan•52m ago
The real achievement here isn't just quality, it's doing it streaming with tight latency on CPU while preserving speaker identity. Most VC-style work looks great offline, then falls apart once you go real-time. Nice work getting this to hold up in streaming.
tritont•36m ago
Nice to finally see this direction of accent conversion (that is on incoming calls) in the Krisp app. This is a very meaningful feature.

Intel's make-or-break 18A process node debuts for data center with 288-core Xeon

https://www.tomshardware.com/pc-components/cpus/intels-make-or-break-18a-process-node-debuts-for-...
1•vanburen•2m ago•0 comments

Silent Backwards Compatibility Breaking Changes in PyTorch

https://blog.ezyang.com/2026/03/silent-bc-breaking-changes/
1•matt_d•5m ago•0 comments

Hacked traffic cameras & US Intel: How plot to kill Iran's leader came together

https://www.cnn.com/2026/03/03/middleeast/us-israel-plot-kill-iran-khamenei-latam-intl
1•CGMthrowaway•5m ago•0 comments

Claude Code escapes its own denylist and sandbox

https://ona.com/stories/how-claude-code-escapes-its-own-denylist-and-sandbox
1•tomvault•6m ago•1 comments

I Built a Spy Satellite Simulator in a Browser. Here's What I Learned

https://www.spatialintelligence.ai/p/i-built-a-spy-satellite-simulator
1•CGMthrowaway•7m ago•0 comments

LotusQ Cross platform voice dictation with free local Whisper(Mac/Windows/Linux)

1•nkodev•8m ago•1 comments

The gap between ICP documents and buyer understanding in B2B sales

https://artemisgtm.ai/blog/why-most-b2b-companies-get-icp-wrong
1•thegtmauditguy•9m ago•1 comments

Academics Need to Wake Up on AI

https://alexanderkustov.substack.com/p/academics-need-to-wake-up-on-ai
1•verdverm•9m ago•0 comments

Qwen Tech Lead Steps Down

https://twitter.com/JustinLin610/status/2028865835373359513
1•informal007•9m ago•0 comments

Fire the CEO, Introducing the AxO's

https://boringops.sh/articles/fire_the_ceo/
1•boringops-dan•9m ago•0 comments

Mpv Is the MVP of Video and Image Viewing

https://nickjanetakis.com/blog/mpv-is-the-mvp-of-video-and-image-viewing
1•nickjj•10m ago•0 comments

Deprecate confusing APIs like "os.path.commonprefix()"

https://sethmlarson.dev/deprecate-confusing-apis-like-os-path-commonprefix
1•todsacerdoti•10m ago•0 comments

Ask HN: Using AI at work is stupidity, or a good tool if used properly?

1•MrLey•15m ago•0 comments

How HN: DocAPI – HTTP 402 as designed: agents register, pay USDC, run forever

https://www.docapi.co
1•siwandev•17m ago•1 comments

Why exe.dev VMs are persistent

https://blog.exe.dev/persistent
2•tosh•17m ago•0 comments

Gram 1.0 Released

https://gram.liten.app/posts/first-release/
1•birdculture•19m ago•0 comments

OpenAI releases GPT-5.3 Instant update to make ChatGPT less 'cringe'

https://9to5mac.com/2026/03/03/openai-releases-gpt-5-3-instant-update-to-make-chatgpt-less-cringe/
1•HiroProtagonist•20m ago•0 comments

Beatport and Beatsource to Unite into One Premium DJ Platform

https://www.beatportal.com/articles/1291036-beatport-and-beatsource-to-unite-into-one-premium-dj-...
1•DocFeind•21m ago•0 comments

Identity Formation and the Politics of Belonging: Bengali Migrants in Kerala [pdf]

https://www.aijfr.com/papers/2025/5/1400.pdf
1•thunderbong•21m ago•0 comments

Ask HN: What are your go to sources for relatively unbiased global news?

1•Jimmc414•21m ago•0 comments

Show HN: Voquill, an open source and cross-platform alternative to wisprflow

https://github.com/josiahsrc/voquill
1•josiahsrc•22m ago•0 comments

The unfortunate need for an "age verification" API for legal compliance

https://lists.ubuntu.com/archives/ubuntu-devel/2026-March/043510.html
2•turrini•22m ago•1 comments

OpenclawwOpenClaw Partners with VirusTotal for Skill Security

https://openclaw.ai/blog/virustotal-partnership
1•breitkreutz•23m ago•0 comments

Blocking a brain receptor may calm blood pressure signals

https://medicalxpress.com/news/2026-02-clue-hypertension-blocking-brain-receptor.html
2•PaulHoule•25m ago•0 comments

Show HN: Mozilla.ai introduces Clawbolt, an AI Assistant for the trades

https://github.com/mozilla-ai/clawbolt
7•river_otter•25m ago•0 comments

Claude and Pentagon whole fight timeline

https://www.youtube.com/watch?v=Ph8CrTNlWbM
2•ashutosh0707•26m ago•0 comments

New tool for designing software architecture diagrams and presentations

https://savnet.co/networks/designer
1•oscarricardosan•26m ago•0 comments

Section 230 is the best protection we have from Trump's censorship

https://www.ms.now/opinion/section-230-trump-free-speech
1•01-_-•26m ago•0 comments

Cofounder search: An internet-native way to do ML and bio research

https://labless.bio
1•jeremykalfus•27m ago•1 comments

The Making of the Atomic Bomb book predicted the AI crisis before it happened

https://blog.adafruit.com/2026/03/03/the-making-of-the-atomic-bomb-1986-by-richard-rhodes/
1•ptorrone•27m ago•0 comments