frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

https://openciv3.org/
631•klaussilveira•12h ago•187 comments

Start all of your commands with a comma

https://rhodesmill.org/brandon/2009/commands-with-comma/
18•theblazehen•2d ago•0 comments

The Waymo World Model

https://waymo.com/blog/2026/02/the-waymo-world-model-a-new-frontier-for-autonomous-driving-simula...
930•xnx•18h ago•547 comments

What Is Ruliology?

https://writings.stephenwolfram.com/2026/01/what-is-ruliology/
34•helloplanets•4d ago•26 comments

How we made geo joins 400× faster with H3 indexes

https://floedb.ai/blog/how-we-made-geo-joins-400-faster-with-h3-indexes
110•matheusalmeida•1d ago•28 comments

Unseen Footage of Atari Battlezone Arcade Cabinet Production

https://arcadeblogger.com/2026/02/02/unseen-footage-of-atari-battlezone-cabinet-production/
43•videotopia•4d ago•1 comments

Jeffrey Snover: "Welcome to the Room"

https://www.jsnover.com/blog/2026/02/01/welcome-to-the-room/
10•kaonwarb•3d ago•9 comments

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

https://github.com/valdanylchuk/breezydemo
222•isitcontent•13h ago•25 comments

Monty: A minimal, secure Python interpreter written in Rust for use by AI

https://github.com/pydantic/monty
213•dmpetrov•13h ago•103 comments

Show HN: I spent 4 years building a UI design tool with only the features I use

https://vecti.com
323•vecti•15h ago•142 comments

Sheldon Brown's Bicycle Technical Info

https://www.sheldonbrown.com/
372•ostacke•19h ago•94 comments

Microsoft open-sources LiteBox, a security-focused library OS

https://github.com/microsoft/litebox
359•aktau•19h ago•181 comments

Hackers (1995) Animated Experience

https://hackers-1995.vercel.app/
478•todsacerdoti•21h ago•234 comments

Show HN: If you lose your memory, how to regain access to your computer?

https://eljojo.github.io/rememory/
275•eljojo•15h ago•164 comments

An Update on Heroku

https://www.heroku.com/blog/an-update-on-heroku/
404•lstoll•19h ago•273 comments

Dark Alley Mathematics

https://blog.szczepan.org/blog/three-points/
85•quibono•4d ago•21 comments

Delimited Continuations vs. Lwt for Threads

https://mirageos.org/blog/delimcc-vs-lwt
25•romes•4d ago•3 comments

PC Floppy Copy Protection: Vault Prolok

https://martypc.blogspot.com/2024/09/pc-floppy-copy-protection-vault-prolok.html
56•kmm•5d ago•3 comments

Vocal Guide – belt sing without killing yourself

https://jesperordrup.github.io/vocal-guide/
16•jesperordrup•3h ago•9 comments

How to effectively write quality code with AI

https://heidenstedt.org/posts/2026/how-to-effectively-write-quality-code-with-ai/
244•i5heu•16h ago•189 comments

Was Benoit Mandelbrot a hedgehog or a fox?

https://arxiv.org/abs/2602.01122
13•bikenaga•3d ago•2 comments

Introducing the Developer Knowledge API and MCP Server

https://developers.googleblog.com/introducing-the-developer-knowledge-api-and-mcp-server/
53•gfortaine•10h ago•22 comments

I spent 5 years in DevOps – Solutions engineering gave me what I was missing

https://infisical.com/blog/devops-to-solutions-engineering
141•vmatsiiako•18h ago•64 comments

Understanding Neural Network, Visually

https://visualrambling.space/neural-network/
281•surprisetalk•3d ago•37 comments

I now assume that all ads on Apple news are scams

https://kirkville.com/i-now-assume-that-all-ads-on-apple-news-are-scams/
1060•cdrnsf•22h ago•435 comments

Why I Joined OpenAI

https://www.brendangregg.com/blog/2026-02-07/why-i-joined-openai.html
133•SerCe•9h ago•118 comments

Learning from context is harder than we thought

https://hy.tencent.com/research/100025?langVersion=en
177•limoce•3d ago•96 comments

Show HN: R3forth, a ColorForth-inspired language with a tiny VM

https://github.com/phreda4/r3
70•phreda4•12h ago•14 comments

Female Asian Elephant Calf Born at the Smithsonian National Zoo

https://www.si.edu/newsdesk/releases/female-asian-elephant-calf-born-smithsonians-national-zoo-an...
28•gmays•8h ago•11 comments

FORTH? Really!?

https://rescrv.net/w/2026/02/06/associative
63•rescrv•20h ago•23 comments
Open in hackernews

How AI hears accents: An audible visualization of accent clusters

https://accent-explorer.boldvoice.com/
260•ilyausorov•3mo ago

Comments

dereknelson•3mo ago
really fun discovery clicking a dot and hearing the accent. neat visualization, lots to think about!
tmshapland•3mo ago
Fascinating! How did you decouple the speaker-specific vocal characteristics (timbre, pitch range) from the accent-defining phonetic and prosodic features in the latent space?
oscarfree•3mo ago
We didn't explicitly. Because we finetuned this model for accent classification, the later transformer layers appear to ignore non-accent vocal characteristics. I verified this for gender for example.
JakeLester•3mo ago
Thank you for sharing! the 3d visual was an interesting application of the UMAP technique.

Is there a way to subscribe to these blog posts for auto-notification?

nosrepa•3mo ago
Yeah, if only there was a protocol for that.
bheadmaster•3mo ago
It would have taken you a second more to type out "RSS", and turn a sarcastic comment into an informative one.

Obligatory xkcd: https://xkcd.com/1053/

JakeLester•3mo ago
1: me <- 1/10k

2: also thnx for the laugh

ahstilde•3mo ago
why is spanish so distributed?
ilyausorov•3mo ago
Good question! It's likely because there are lots of different accents of Spanish that are distinct from each other. Our labels only capture the native language of the speaker right now, so they're all grouped together but it's definitely on our to-do list to go deeper into the sub accents of each language family!
bikeshaving•3mo ago
Spanish is one of those languages I would love to see as a breakdown by country. I’m sure Chilean Spanish looks very different from Catalonian Spanish.
rkomorn•3mo ago
Did you mean Catalan (which is not Spanish) or Castilian Spanish?
bikeshaving•3mo ago
Yes the Spanish spoken in Spain, especially the one that’s like /ˈɡɾaθjas/ and /baɾθeˈlona/.
djmips•3mo ago
But Spanish sounds very different in Spain depending on what region of the country you are talking about.
david-gpu•3mo ago
Yeah, and not all Spaniards have a distinct pronunciation for "c" and "s". For those curious: https://en.wikipedia.org/wiki/Phonological_history_of_Spanis...
oscarfree•3mo ago
Not sure, could be the large number of Spanish dialects represented in the dataset, label noise, or something else. There may just be too much diversity in the class to fit neatly in a cluster.

Also, the training dataset is highly imbalanced and Spanish is the most common class, so the model predicts it as a sort of default when it isn't confident -- this could lead to artifacts in the reduced 3d space.

zaouiamine•3mo ago
This is a fascinating look at how AI interprets accents! It reminds me of some recent advancements in speech recognition tech, like Google's Dialect Recognition feature, which also attempts to adapt to different accents. I wonder how these models could be improved further to not just recognize but also appreciate the nuances of regional
afiodorov•3mo ago
Apparently Persian and Russian are close. Which is surprising to say the least. I know people keep getting confused about how Portuguese from Portugal and Russian sound close yet the Persian is new to me.
zehaeva•3mo ago
When I went to Portugal I was struck by how much Portuguese there does sound like Spanish with a Russian accent!
oscarfree•3mo ago
Part of this is the "dark L" sound
BalinKing•3mo ago
I’d guess that the sibilants, consonant clusters, and/or vowel reduction would play a big role.
binary132•3mo ago
I thought I was the only one who perceived an audible similarity between Portuguese and Russian.
mh-•3mo ago
I speak neither, and both also sound similar to me depending on the accents of the speakers.
djmips•3mo ago
I had that too but it was Brazillian Portuguese where I noticed it.
maleldil•3mo ago
The characteristics that make pt-PT sound similar to Russian are largely absent in pt-BR.
dgan•3mo ago
I am native Russian speaker, and work/visited Portugal. It definitely tricks me when not paying attention, its very similar sounding
CGMthrowaway•3mo ago
Idea: Farsi and Russian both have simple list of vowel sounds and no diphtongs. Making it hard/obvious when attempting to speak english, which is rife with them and many different vowel sounds
dashtiarian•3mo ago
While Persian has only two diphtongs and 6-8 vowels, Other Languages of Iran are full of them(e.g. Southern Kurdish speakers can pronounce 12+1 vowels and 11 diphtongs). I find it funny if all Iranians are speaking English with the Persian accent.
ilyausorov•3mo ago
Yeh they seem to be in the same "major" cluster, although Serbian/Croatian, Romanian, Bulgarian, Turkish, Polish and Czech are all close.

Turkish and Persian seem to be the nearest neighbors.

zman0225•3mo ago
Going mono-tonal to that of an expressive ebook increased my "American English" score from a 52% to 92%.

I'd suggest training a little less on audio books.

djmips•3mo ago
What does it mean mono-tonal and what is an expressive ebook? I assume you are not American born? I had been of the understanding that rythm was more important than the exact sounds in comprehension.
fkyoureadthedoc•3mo ago
He's saying to read it like Jeff Hays not Travis Baldree
djmips•3mo ago
Why do I get downvoted for asking an innocent question? That's weird
bikeshaving•3mo ago
The source code for this is unminified and very readable if you’re one of the rare few who has interesting latent spaces to visualize.

https://accent-explorer.boldvoice.com/script.js?v=5

ilyausorov•3mo ago
Nothing too secret in there! We anonymized everything and anyway it's just a basic Plotly plot. Feel free to check it out.
3abiton•3mo ago
Good catch. I really hate javascript so i never got into d3js, so plptly was such a life saver.
ilyausorov•3mo ago
Plotly is great! Much love.
agrnet•3mo ago
could you explain what it means for someone to “have interesting latent spaces”? curious how you’re using that metaphor here
bikeshaving•3mo ago
I don’t think I’m using it as a metaphor? To “have interesting latent spaces” just means you have access to the actual weights and biases, the artifact produced by fine-tuning/training models, or you can somehow “see” activations as you feed input through the model. This can be turned into interesting 3D visualizations and reveal “latent” connections in the data which often align with and allow us to articulate similarities in the actual phenomena which these “spaces” classify.

Not many people have the privilege of access to these artifacts, or the skill to interpret these abstract, multi-dimensional spaces. I want more of these visualizations, with more spaces which encode different modalities.

https://en.wikipedia.org/wiki/Latent_space

dcreater•3mo ago
whats the dimensionality of the latent space? How were the 3 dimensions visualized selected?
oscarfree•3mo ago
12 layers of 768-dim each. The 3 dimensions visualized are chosen by UMAP.
lynchdt•3mo ago
Irish accent appears to break it.
oscarfree•3mo ago
We are working on this - we don't have quite enough Irish speech data.
NLips•3mo ago
How do you know? Your “UK” set is liable to have some Irish accents in it. You need to break down regions more
oscar120•3mo ago
Exactly - by "data" I meant labelled Irish speech.

>>You need to break down regions more Yes we do; shipping this soon

diegolas•3mo ago
it would've been nice to be able to visualize the differences between the different accents in the spanish language, really cool tho
ilyausorov•3mo ago
Yeh, we would've loved to see that too. It's on our roadmap for sure. Same for some of the other languages with a large amount of unique accents like e.g. French, Chinese, Arabic, etc...
johnwatson11218•3mo ago
I just got a project running whereby I used python + pdfplumber to read in 1100 pdf files, most of my humble bundle collection. I extracted the text and dumped it into a 'documents' table in postgresql. Then I used sentence transformers to reduce each 1K chunk to a single 384D vector which I wrote back to the db. Then I averaged these to produce a document level embedding as a single vector.

Then I was able to apply UMAP + HDBSCAN to this dataset and it produced a 2D plot of all my books. Later I put the discovered topic back in the db and used that to compute tf-idf for my clusters from which I could pick the top 5 terms to serve as a crude cluster label.

It took about 20 to 30 hours to finish all these steps and I was very impressed with the results. I could see my cookbooks clearly separated from my programming and math books. I could drill in and see subclusters for baking, bbq, salads etc.

Currently I'm putting it into a 2 container docker compose file, base postgresql + a python container I'm working on.

kallistisoft•3mo ago
This sounds like an interesting project. Do you have any plans to publish a tutorial/journal article + source code?
mertbozkir•3mo ago
i love boldvoice
ilyausorov•3mo ago
Thanks, we love you too
ccheever•3mo ago
Very interesting
double_espresso•3mo ago
this is super cool!
AprilArcus•3mo ago
The Australian-Vietnamese continuum is well-explained by Australia being the geographically nearest region which can supply native English language teachers to English language learners in Vietnam, rather than by any intrinsic phonetic resemblance between Vietnamese and Australian English.
oscar120•3mo ago
+1
gmurphy•3mo ago
Since our own accents generally sound neutral to ourselves, I would love someone to make an accent-doubler - take the differences between two accents and expand them, so an Australian can hear what they sound like to an American, or vice-versa
jclulow•3mo ago
If we assume this model is accurate, I sound to Americans like I'm South African!
ema•3mo ago
I've found that when I'm listening to recordings of me my accent really sticks out to me in a way that's completely inaudible when listening to myself live. This happens with both English and my native German.
wkjagt•3mo ago
My own accent doesn't sound neutral to myself at all. It sounds incredibly Dutch (which makes sense, since I'm Dutch).
nmeofthestate•3mo ago
I agree. I think there are places in the world where people consider their accent to be 'neutral', but I'm pretty sure no-one from my neck of the woods would think that.
efskap•3mo ago
BERT still making headlines in 2025, you love to see it.
pinkmuffinere•3mo ago
Why do the voices all sound so similar? I'm not talking about accent, I'm talking about the pitch, timbre, and other qualities of the voice themselves. For instance, all the phrases I heard sounded like they were said by a medium-set 45 year old man. Nothing from kids, the elderly, or people with lower / higher-pitch voices. I assume this expected from the dataset for some reason, but am really curious about that reason. Did they just get many people with similar vocal qualities but wide ranges of accents?
dwohnitmok•3mo ago
From the article:

> By clicking or tapping on a point, you will hear a standardized version of the corresponding recording. The reason for voice standardization is two-fold: first, it anonymizes the speaker in the original recordings in order to protect their privacy. Second, it allows us to hear each accent projected onto a neutral voice, making it easier to hear the accent differences and ignore extraneous differences like gender, recording quality, and background noise. However, there is no free lunch: it does not perfectly preserve the source accent and introduces some audible phonetic artifacts.

> This voice standardization model is an in-house accent-preserving voice conversion model.

hencq•3mo ago
I'm kind of curious if it would be possible for it to use my own voice but decoupled from accent. I.e. could it translate a recording from my voice to a different accent but still with my voice. If so, I wonder if that makes it easier for accent training if you can hear yourself say things in a different accent.
glandium•3mo ago
That would be interesting for sure, but considering you don't hear yourself the same way someone else or a mic does, I'm not sure it would have the benefit you're expecting.
hencq•3mo ago
Haha yeah not sure how useful it would be in practice, but mostly curious.
pinkmuffinere•3mo ago
Ah thanks, missed that somehow
crazygringo•3mo ago
This is fascinating in theory, but I'm confused in practice.

When I play the different recordings, which I understand have the accent "re-applied" to a neutral voice, it's very difficult to hear any actual differences in vowels, let alone prosody. Like if I click on "French", there's something vaguely different, but it's quite... off. It certainly doesn't sound like any native French speaker I've ever heard. And after all, a huge part of accent is prosody. So I'm not sure what vocal features they're considering as "accent"?

I'm also curious what the three dimensions are supposed to represent? Obviously there's no objective answer, but if they've listened to all the samples, surely they could explain the main constrasting features each dimension seems to encode?

retrac•3mo ago
I'm deaf. Something close to standard Canadian English is my native language. Most native English speakers claim my speech is unmarked but I think they're being polite; it's slightly marked as unusual and some with a good ear can easily tell it's because of hearing loss.

Using the accent guesser, I have a Swedish accent. Danish and Australian English follow as a close tie.

It's not just the AI. Non-native speakers of English often think I have a foreign accent, too. Often they guess at English or Australian. Like I must have been born there and moved here when I was younger, right? I've also been asked if I was Scandinavian.

Interestingly I've noticed that native speakers never make this mistake. They sometimes recognize that I have a speech impediment but there's something about how I talk that is recognized with confidence as a native accent. That leads me to the (probably obvious) inference that whatever it is that non-native speakers use to judge accent and competency, it is different from what native speakers use. I'm guessing in my case, phrase-length tone contour. (Which I can sort of hear, and presumably reproduce well, even if I have trouble with the consonants.)

AI also really has trouble with transcribing my speech. I noticed that as early as the '90s with early speech recognition software. It was completely unusable. Even now AI transcription has much more trouble with me than with most people. Yet aside from a habit of sometimes mumbling, I'm told I speak quite clearly, by humans.

Hearing different things, as it were.

chupchap•3mo ago
To judge local or not, I would consider use of phrases and word as well, and not just the accent. Perhaps, that's what is working for you?
overfeed•3mo ago
> AI also really has trouble with transcribing my speech. I noticed that as early as the '90s with early speech recognition software. It was completely unusable.

I don't know what your transcription use cases are, but you may be able to get an improvement by fine-tuning Whisper. This would require about $4 in training costs[1], and a dataset with 5-10 hours of your labeled (transcribed) speech, which may be the bigger hurdle[2].

1. 2000 steps took me 6 hours on an A100 on Collab, fine-tuning openai/whisper-large-v3 on 12 hours of data. I can shar my notebook/script with you if you'd like.

2. I am working on a PWA that makes it simple for humans to edit initial, automated transcriptions with mistakes for feeding the correct dataset back into the pipeline for fine-tuning, but its not ready yet

mjburgess•3mo ago
Any chance you could github your script for public use anyway?

It's an interesting self-contained example

erikerikson•3mo ago
We have a PWA for this at:

https://www.psyome.com/annotator

overfeed•3mo ago
It is desktop-only - do you have plans to support mobile browsers? My PWA is mobile-first.
erikerikson•3mo ago
Oh nice! No plans at the moment
erikerikson•3mo ago
FWIW, it might be usable on mobile but I haven't tried it tested it
thebruce87m•3mo ago
> with 5-10 hours of your labeled (transcribed) speech, which may be the bigger hurdle[2].

Can’t you just read from a known script?

Jemaclus•3mo ago
I'm also deaf, and I took 14 years of speech therapy. I grew up in Alabama. The only way you would know I'm from the South is because of the pin-pen merger[1]. Otherwise, you'd think I grew up in the American Midwest, due to how my speech therapy went. Almost nobody picks up on it, unless they are linguists that already knew about the pin-pen merger.

[1]https://www.acelinguist.com/2020/01/the-pin-pen-merger.html

nocoiner•3mo ago
I’m aware of the merger, but I literally can’t hear a difference between the words. I certainly pronounce them the same way.

I also think merry-marry-Mary are all pronounced identically. The only way I can conceive of a difference between them is to think of an exaggerated Long Island accent, which, yeah, I guess is what makes it an accent.

wongarsu•3mo ago
As someone with a German accent, to me the difference between merry and marry is the same as between German e (in this case ɛ in ipa) and ä (æ in ipa). Those two sounds are extremely close, but not quite the same. According to the Oxford dictionary that is true in British English, while it shows the same pronunciation (ɛ) for both in American English
mapt•3mo ago
They are the same phoneme for me in US Eastern suburbia, the only difference is in a subtle shift in the length that you drag it out. "merry" is faster than "marry" which is sometimes but not always faster than "Mary". Most UK accents seems to drag the proper name out an additional beat, and for some of them there's a slight pitch shift that sounds like "ma-ery", at its most extreme in Ireland (this is one early shibboleth by which I recognized Irish people before I really picked up on the other parts of the accent).
Jemaclus•3mo ago
That's exactly what the pin-pen merger is! As you know, it's not limited to pin/pen, and hearing ability (in my case, profound hearing loss) is not related to the ability to hear the difference. I don't understand the linguistics, but my very bad understanding is that there's actual brain chemistry here that means that you _can't_ hear the difference because you never learned it, never spoke it, and you pronounce them the same.

My partner is from the PNW and she pronounces "egg" as "ayg" (like "ayyyy-g") but when I say "egg" she can't hear the difference between what I'm saying and what she says. And she has perfect hearing. But she CAN hear the difference between "pin" and "pen", and she gets upset when i say them the same way. lol

But yeah, that's one of the things that makes accents accents. It's not just the sounds that come out of our mouths but the way we hear things, too. Kinda crazy. :)

nocoiner•3mo ago
When I was listening to some of the samples on the page you linked (pronunciation of “when”), it really seemed to me like the difference they were highlighting was how much the “h” was pronounced. Even knowing what I was listening for, it was very like my brain was just refusing to recognize the vowel sound distinction. So I think you must be right about it being a matter of basic brain chemistry.

In the example of the reverse pen/pin merger (HMS Pinafore) on that page, I couldn’t hear “penafore” to save my life. Fascinating stuff.

I used to think of the movie “Fargo” and think “haha comical upper midwestern accents.” And then at some point I realized that the characters in “No Country for Old Men” probably must sound similarly ridiculous to anyone whose grandparents and great grandparents didn’t all speak with a deep, rural West Texas accent - which mine did, so watching the movie it just seemed completely natural for the place and time at a deeply subconscious level.

gertlex•3mo ago
Hard of hearing, from the midwest; also identified as Swedish by the accent guesser.
gucci-on-fleek•3mo ago
Wow, I'm not deaf, but almost everything you mentioned applies to me too. I've never met anyone else who has experienced this before, yet all of your following points apply exactly to me:

> standard Canadian English is my native language

> Most native English speakers claim my speech is unmarked

> Non-native speakers of English often think I have a foreign accent, too. Often they guess at English or Australian. Like I must have been born there and moved here when I was younger, right?

> They sometimes recognize that I have a speech impediment but there's something about how I talk that is recognized with confidence as a native accent.

At least 2 or 3 times a year, someone asks me if I'm British, but me and my parents were born in Canada, and I've never even been to England, so I'm not really sure why some people think that I have a British accent. Interestingly, the accent checker guesses that my accent is

  American English    89%
  Australian English  3%
  French              3%
which is pretty close to correct.
fortran77•3mo ago
I was born in Brooklyn, to Yiddish speaking parents and Yiddish was my first language. I now spend half my time in California and half in Israel. The accent checker said 80% American English, 16% Spanish, and 4% Brazilian Portuguese. In Israel they ask if I’m Russian when I speak Hebrew. In the US, people ask where I’m from all the time because my accent—and especially my grammar—is odd. The accent checker doesn’t look for grammatical oddities but that’s where a lot of my “accent” comes from.
SkyeCA•3mo ago
I'm a maritimer and I'm constantly getting asked if I'm from South England, even by brits them self.

More bizarrely? Locals often assume I'm not from around here as well. I actually don't understand it.

gucci-on-fleek•3mo ago
> I'm a maritimer and I'm constantly getting asked if I'm from South England, even by brits them self.

I'm assuming that you're from NS/NB? Because it would be pretty fair for someone to mix up a British and a Newfoundland accent. (I'm from Alberta)

ehnto•3mo ago
Some variants of Australian English are very similar to Canadian English. I can't always immediately tell if someone is from Canada or home.

This is probably because some states in Aus use Queens English passed down from the colonies.

shade•3mo ago
Yep, I'm also deaf (since age 6), went through a lot of speech therapy, and have a very pronounced deaf accent. I live in the midwestern US (specifically, Ohio) and at least once a year I get asked where I'm from - England being the most common guess, but I've also had folks ask if I'm Scottish or Australian.

AI struggles massively with my accent. I've gotten the best results out of Whisper Large v2 and even that is only perhaps 60% accurate. It's been on my todo list to experiment with using LLMs to try to clean it up further - mostly so I can do things like dictate blog post outlines to my phone on long car rides - but I haven't had as much time as I'd like to mess around with it.

dmevich1•3mo ago
Fascinating work — especially how geography and history influence accent clustering more than language families. Brilliant visualization!
TomNomNom•3mo ago
I tried the oracle and got this:

> Your accent is Dutch, my friend. I identified your accent based on subtle details in your pronunciation. Want to sound like a native English speaker?

I'm British; from Yorkshire.

When letting it know how it got it wrong there's no option more specific than "English - United Kingdom". That's kind of funny, if not absurd, to anyone who knows anything of the incredible range of accents across the UK.

I also think the question "Do you have an accent when speaking English?" is an odd one. Everyone has an accent when speaking any language.

david-gpu•3mo ago
> I also think the question "Do you have an accent when speaking English?" is an odd one. Everyone has an accent when speaking any language.

Sure, I agree. But look at it from the perspective of a foreigner living in an English-speaking country, which is probably their target demographic.

We know that as soon as we open our mouth the locals will instantly pigeonhole us as "a foreigner". No matter how good we might be in other areas, we will never be one of "them". The degree of prejudice that may or may not exist against us doesn't matter as much as the ever present knowledge that the locals know that we are not one of them, and the fear of being dismissed because of that.

Nobody likes to stand out like that, particularly when it so clearly puts you at a disadvantage. That sort of insecurity is what this product is aimed at.

wizzwizz4•3mo ago
It's not ethical to lie to people about whether they need something you're selling, especially if you're playing on their fears of vulnerability to make the sale. Laundering the lies through an AI model doesn't make it any less bad.

BoldVoice is very clear about being an American accent "training app", so that's not (necessarily) what's happening here, but the point remains.

f7f3•3mo ago
It's quite offensive. English is my native tongue, I got a perfect IELTS score, and one of my parents was an English professor. But my accent makes me less than "native".
Antibabelic•3mo ago
IELTS is a test for non-native English speakers. Why did you have to take it?
physicsguy•3mo ago
It's often required for immigration purposes. Countries/Universities will let you off where you're coming from a country that has english as it's main language or have studied a degree in the language, but they often won't if you're a native English speaker living elsewhere.
chronci739•3mo ago
> Countries/Universities will let you off where you're coming from a country that has english as it's main language

Singapore is a “native” English speaking country yet has an extremely distinctive accent.

(usually seen as a negative by both Singaporeans and non-Singaporeans)

stared•3mo ago
I had a 3-month stay in Singapore (at CQT, NUS).

The first two days were a shock, as I felt it was a different language. But just after some time, god adjusted. And I find endearing both Singlish pronunciation and phrases.

For example, the first time I hear "ondah-cah?" I was puzzled. Then understood that it is "Monday can?". Which, as I learned, means "Would Monday work for you?".

f7f3•3mo ago
Migration purposes. I'm from South Africa, so it was required.

I would have done it regardless, because it got me extra points.

walthamstow•3mo ago
I agree there is no such thing as a "British accent", though I'm lucky that my mockney lilt is considered to be one, but Dutch, Danish and Yorkshire are very similar for historical reasons so it's somewhat understandable for you to be detected as Dutch in this app.

I find Danes speaking Danish to sound like a soft Yorkshire accent, and the vowels that Yorkies use are better written in Danish, like phøne.

vintermann•3mo ago
Well in Danish e's at the end of words aren't silent, so you may get som føni results:)
suddenlybananas•3mo ago
>That's kind of funny, if not absurd, to anyone who knows anything of the incredible range of accents across the UK.

Yeah I was disappointed when I realised this post was about foreign accents and not regional accents in English across the world.

nedt•3mo ago
Yeah it's the same for having just one accent "German". Swiss, Austrians but also north vs middle vs south Germans do still sound different - even when they talk English.
Measter•3mo ago
For me it doesn't think I'm reading the prompt correctly, and refuses to accept the input.

I'm also British, from Devon.

sailingparrot•3mo ago
> This voice standardization model is an in-house accent-preserving voice conversion model.

Not sure this model works really well. As a french/spanish native speaker, I can immediately recognize an actual French or Spanish person speaking in english, but the examples here are completly foreign to me. If I had to guess where the "french" accent was from I would have guessed something like Nigeria. For example spanish have a very distinct way of pronouncing "r" in english that is just not present here. I would have been unable to correctly guess French or Spanish for the ~10 examples present in each language (mayyybe 1 for French).

ilyausorov•3mo ago
For sure the voice standardization model is not perfect, but it was important for us to do especially for the voice privacy. It’s still pretty early tech.
vintermann•3mo ago
It's probably an artifact of them lumping together all varieties/dialects of a given language. I don't speak Spanish, but I know that the R is one of the things that's different in e.g. Argentina.
suddenlybananas•3mo ago
I wonder if they have a large population of African French speakers in the dataset?
kazinator•3mo ago
"Audible visualization" is a visualization enhanced by auditorization. :)
adeptima•3mo ago
Did research on accent, pronunciation improvement, phoneme recognition, kaldi ecosystem, etc … nothing really changed in the public domain past few years. There’s no even accurate open source dataset. All self claimedccc manually labelled dataset with 10k+ hours was partly done with automation. Next issue, model models operates in different latent space often with 50ms chunks while pronunciation assessment requires much better accuracy. Just try to say B loud - silent part gathering energy in the lips, loud part, and everything what resonates after. Worst part there are too many ml papers from the last year students or junior phd folks claiming success or fake improvements, etc

The article itself is just a vector projection in 3d space … the actual reality is much complex.

Any comments on pronunciation assessment models are greatly appreciated

oezi•3mo ago
You are right and I don't think incentives exist to solve the issues you describe, because currently many of the building blocks people are building are aligned to erase subtleaccent differences: the neural codecs, transcription systems such as whisper want to output clean/compressed representations of their inputs.
adeptima•3mo ago
100% agree
sota_pop•3mo ago
Very nice viz. it reminds me of the visualizations people used to do of the mnist data set in the days when the quintessential ML project was “training a hand writing digits classifier”: https://projector.tensorflow.org/
glandium•3mo ago
Note: this is related to https://news.ycombinator.com/item?id=42392088 from a few months ago.
bashtoni•3mo ago
The fact that they believe there to be a single 'British' accent means this can be quickly discounted as nonsense.
tavavex•3mo ago
When people mention a single "British accent", in 99% of the cases it's just a more widely understood shorthand for Received Pronunciation. I don't see how that's bad or wrong, considering how common it is in education.
noja•3mo ago
I would say in 99% of cases people mean Estuary English.
CamouflagedKiwi•3mo ago
It's not common any more, very few people really speak RP these days. The more usual thing accent that people might think of is sometimes called "Standard Southern British" (I've heard "BBC English" as well).
TurkTurkleton•3mo ago
I mean, if you want to be like that, you could generalize that statement to "the fact that they believe there to be a single `$LANGUAGE_OR_REGION` accent means this can be quickly discounted as nonsense". Other languages, and other varieties of English, have regional variation as well, after all--although in the case of other languages, I'll grant that the accents of, say, two German speakers from different regions might not be as distinct from each other in English as they are in German.

At any rate, I was looking forward to finding out what the accent oracle thought of my native US English accent, which sounds northern to southerners and southern to northerners, but I guess it'd probably just flag it as "American".

fnands•3mo ago
Try yourself here: https://start.boldvoice.com/accent-oracle

It wrongly pegged me as Swedish.

It's second choice was the place I live, and third place was where I'm from, so not too bad overall. I have been told I have a very ambiguous accent though.

fnands•3mo ago
Tried again and this time it got me. Second place is still Swedish. Looking at the UMAP visualisation, there is a South African cluster overlapping with a Swedish cluster, so makes sense I guess.
physicsguy•3mo ago
It got me, native English speaker with British accent.

I was hoping it might drill down into regional accents though, there is a huge variety in the UK. I have a Midlands accent which can occasionally confuse non-native speakers.

blixt•3mo ago
It would be interesting to do a wider test like this but instead of trying to clump people together into "American English" and "British English" it would be interesting if the data point was "in which city do people speak like you do?" and create a geographic map of accents.

I'm from the south of Sweden and I've had my "accent" made fun of by people from Malmö just because I grew up outside of Helsingborg, because the accent changes that much in just 60 kilometers.

tananan•3mo ago
It would be really cool if it could highlight the parts of the speech that gave you away your accent. It guesses mine correctly most of the time (though not the first time I tried), but also lets me know my accent is pretty light.
stared•3mo ago
I am curious - why UMAP not t-SNE? (See https://pair-code.github.io/understanding-umap/) When I saw the vis, there is a a collection of lines, which look as an artifact. t-SNE (typically) gives more "organic" results of blobs, provided you set perplexity high enough.

Also, while I admire examples of instances, it would be interesting to the map or original laguages - which is close to which, in terms of their English accents.

alex-moon•3mo ago
Fascinated by the cluster of Australian, British and South African. As an Australian living in UK, I hear an enormous difference between these accents - even just in the British ones, the Yorkshireman and the Geordie stick out like a sore thumb to me - the narcissism of small differences perhaps. Interestingly, my partner, who is from England, often says, of various Australians we hear (either on TV or my friends), that they sound British to her. I, meanwhile, can pick an Australian from very few words. What are we hearing differently? It is a mystery to me.
epolanski•3mo ago
I'm a native italian speaker with polish ancestry.

I've tried to do the accent oracle test few times and it catches me being Italian with a 90%+ confidence.

The interesting thing is that if I try to fake a more english accent like American...it tells me I'm polish.

Which is odd because I don't really have a polish accent and don't speak it that well. I sound Italian even in Polish.

https://start.boldvoice.com/accent-oracle

ahoka•3mo ago
Apparently my accent is "Application Error".
nmeofthestate•3mo ago
All the accents sound like somebody from... somewhere in the third world...? but with a small trace of the named accent. I don't know if that's intended - maybe the different recordings are not supposed to sound like their label but like a foreigner who learned English while around people with that accent?