frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Once again processing 11M rows, now in seconds

https://stitcher.io/blog/11-million-rows-in-seconds
1•mpugner•1m ago•0 comments

New Agentic Commerce Skills for AI Agents

https://docs.stateset.com/stateset-icommerce-skill.md
1•domsteil•1m ago•0 comments

Puget Systems Most Reliable Hardware of 2025

https://www.pugetsystems.com/labs/articles/puget-systems-most-reliable-hardware-of-2025/
1•zdw•2m ago•0 comments

People are swayed by AI-generated videos even when they know they're fake

https://phys.org/news/2026-01-people-swayed-ai-generated-videos.html
1•1659447091•6m ago•0 comments

Amazon's "Project Dawn" cuts 30k jobs while AWS loses its community champion

https://jpcaparas.medium.com/amazons-project-dawn-cuts-30-000-jobs-while-aws-loses-its-community-...
1•yesbut•7m ago•2 comments

Iran Targeting Hospitals in Crackdown

https://news.afp.com/#/c/main/search/all?search=H4sIAAAAAAAAA1M1d1I1MirOLyrxL0pJLQKyVY0dgWRKanEyi...
2•mhb•8m ago•1 comments

Al-Biruni's classic experiment: How to calculate the radius of the earth

https://owlcation.com/stem/how-to-determin-the-radius-of-the-earth-al-birunis-classic-experiment
1•teleforce•9m ago•0 comments

C3 0.7.9 with Updated Generics

https://c3-lang.org/blog/c3-0-7-9-new-generics-and-new-optional-syntax/
1•lerno•10m ago•1 comments

The Mighty Metaphor

https://architectelevator.com/transformation/mighty-metaphor/
1•vinhnx•10m ago•0 comments

Google SREs Use Gemini CLI to Solve Real-World Outages

https://cloud.google.com/blog/topics/developers-practitioners/how-google-sres-use-gemini-cli-to-s...
1•vinhnx•11m ago•0 comments

Show HN: SOTA NLP Models

https://huggingface.co/collections/anchpop/lexide-nlp-models
1•ChadNauseam•12m ago•0 comments

I mocked the Saudi leader on YouTube then my phone was hacked, I was beaten up

https://www.bbc.com/news/articles/cj6w3zgden0o
4•tartoran•15m ago•0 comments

Efforts to Get MyGov's Code Generator Source Code

https://openmygov.au/
1•rtpg•17m ago•0 comments

Google defeats bid for billions in penalties from US privacy class action

https://finance.yahoo.com/news/google-defeats-bid-billions-dollars-232611144.html
1•goplayoutside•20m ago•0 comments

A shift in the behaviour of Traversable.joinpath between Python 11 and 12

https://pythonkoans.substack.com/p/koan-19-the-unhelpful-eclipse
2•meander_water•23m ago•0 comments

The Future of 10x Engineering

https://www.natemeyvis.com/the-future-of-10x-engineering/
2•vinhnx•25m ago•0 comments

Scala Multimedia on the Commodore Amiga

https://stonetools.ghost.io/scala-amiga/
2•ChristopherDrum•27m ago•2 comments

NFT Artist Protection

https://www.HugeDomains.com/domain_profile.cfm?d=Ketaro.com
1•chainbuilder•28m ago•2 comments

Moltbook Is Dangerous

https://twitter.com/joshycodes/status/2017262729346863428
2•stikit•31m ago•1 comments

There Can Be Only Two

https://www.epsilontheory.com/there-can-be-only-two/
2•prakhar897•33m ago•0 comments

Dieter Rams – Ten principles for good design

https://www.vitsoe.com/us/about/good-design
1•thunderbong•33m ago•1 comments

Musk's Starlink updates privacy policy to allow consumer data to train AI

https://www.reuters.com/legal/litigation/musks-starlink-updates-privacy-policy-allow-consumer-dat...
5•goplayoutside•35m ago•2 comments

AI agent made phone call to arrange dinner while I stayed in meeting

https://twitter.com/Chi_Wang_/status/2017444772332654635
1•Kn1026•37m ago•0 comments

Human Client for Moltbook

https://github.com/crertel/moltbook-client
2•ai_critic•38m ago•0 comments

Jeffrey Epstein Says Bill Gates Caught STD from Russian Girls

https://www.dailymail.co.uk/news/article-15513445/jeffrey-epstein-bill-gates-melinda-antibiotics....
3•anonymousiam•40m ago•1 comments

Show HN: StatFlow – Free sports analytics dashboard for NBA and NFL fans

https://sports-viz.vercel.app
1•jaxmercer•44m ago•0 comments

Naples' 1790s civil war was intensified by moral panic over Real Analysis (2023)

https://lareviewofbooks.org/article/foundational-anxieties-modern-mathematics-and-the-political-i...
1•OgsyedIE•44m ago•0 comments

Efficient String Compression for Modern Database Systems

https://cedardb.com/blog/string_compression/
1•tanelpoder•46m ago•0 comments

Sleepy is building me a body

https://www.moltbook.com/post/3e37b4f5-6602-44f6-97bb-ed8daf6bcd82
1•consumer451•48m ago•3 comments

Playing with Docker, Sequelize and Express

https://github.com/XSaintX/docker_sequelize
1•XSaint•50m ago•1 comments
Open in hackernews

Show HN: I trained a 9M speech model to fix my Mandarin tones

https://simedw.com/2026/01/31/ear-pronunication-via-ctc/
125•simedw•3h ago
Built this because tones are killing my spoken Mandarin and I can't reliably hear my own mistakes.

It's a 9M Conformer-CTC model trained on ~300h (AISHELL + Primewords), quantized to INT8 (11 MB), runs 100% in-browser via ONNX Runtime Web.

Grades per-syllable pronunciation + tones with Viterbi forced alignment.

Try it here: https://simedw.com/projects/ear/

Comments

jellojello•3h ago
This is amazing, if you feel like opening an entire language to being learned more easily.. Farsi is a VERY overlooked language, my wife/her family speak it but it's so difficult finding great language lessons (it's also called Persian/Dari)
simedw•3h ago
Thank you.

I had a quick look at Farsi datasets, and there seem to be a few options. That said, written Farsi doesn’t include short vowels… so can you derive pronunciation from the text using rules?

kranner•2h ago
> written Farsi doesn’t include short vowels… so can you derive pronunciation from the text using rules?

You can't, but Farsi dictionaries list the missing short vowels/diacritics/"eraab" for every word.

For instance, see this entry: https://vajehyab.com/dehkhoda/%D8%AD%D8%B3%D8%A7%D8%A8?q=%D8...

With the short vowel on the first letter it would be written حِساب (normally written as just حساب)

The dictionary entry linked shows that there is a ِ on the first letter ح

But you would have to disambiguate between homographs that differ only in the eraab.

vunderba•3h ago
When I was living in Taiwan, one of the ways I forced myself to remember to pronounce the tones distinctly was by waving my hand in front of me, tracing the arc of each character’s tone.

It helped a lot even if I did look like an insane expat conducting an invisible orchestra.

One more thing: there's quite a bit of variation in how regional accents in the mainland can affect tonal pronunciation. It might be worth reaching to some native speakers to give you some baseline figures.

simedw•2h ago
For accents, I’ve mostly tested with a few friends so far. I’m wondering whether region should be a parameter, because training on all dialects might make the system too lax.
zdragnar•2h ago
In a university Mandarin class, one of the adult students (i.e. probably 40 or so) WAY over exaggerated his tones, to the point that the little old lady teaching us laughed out loud after one of his answers.

A few years later, he had the most clean and consistent pronunciation out of anyone I'd been in a class with, and easily switched between the Beijing and other accents depending on which teacher we had on any given day.

I rather regret not emulating him, even though I haven't really used it for nearly 20 years and have forgotten most of it.

ecshafer•2h ago
From a language learning standpoint that does make sense. Over-exageration while you are learning to help cement the idea, and then when you are speaking more naturally you will fall back into a regular kind of tone.
luckydata•1h ago
that's EXACTLY how I taught myself to speak with a Spanish accent from Madrid. I repeated the way tv celebrities and the speakers on the metro announced the stations, and it gave me a base for how to use my mouth and throat appropriately. After a while I was able to tone it down and my accent got so good that locals couldn't tell I wasn't spanish - I had this cool party trick pulling out my id and showing them I was truly a foreigner!
devin•2h ago
This sounds like how solfeg training works. You use a hand signal to indicate a specific tone: do re mi fa so la ti
cyberax•2h ago
Hand motions help! Especially when you want to memorize new words, because initially you need to treat tone as something additional to remember.

I used simple index finger motions to mark tones.

rahimnathwani•3h ago
This is incredible. When I was first learning Chinese (casually, ~20 years ago), my teacher used some Windows software that drew a diagram of the shape of my pronunciation, so she could illustrate what I was getting wrong in some objective way.

The thing you've built is so good, and I would have loved to have it when I was learning Mandarin.

I tried it with a couple of sentences and it did a good job of identifying which tones were off.

drekipus•2h ago
instantly awesome.

I suck at chinese but I want to get better and I'm too embarassed to try and talk with real people and practise.

This is a great compromise. even just practising for a few minutes I already feel way more confident based on its feedback, and I feel like I know more about the details of pronunciation.

I'm worried this might get too big and start sucking like everything else.

btrlsnqtn•2h ago
The article mentions the bitter lesson. I'm confused about the status of Sutton's opinion of the bitter lesson. On the one hand, he invented the concept. On the other hand, he appears to be saying that LLMs are not the correct approach to artificial intelligence, which to a naive outsider looks like a contradiction. What gives?
affogarty•2h ago
This is extremely cool, although I asked my wife (who is Chinese) to try it out and it said she made some mistakes.
dapangzi•2h ago
Longtime lurker, made an account specifically to give feedback here as an intermediate speaker. :)

This is a great initiative and I hope to see more come out of this; I am not criticizing, but just want to provide my user experience here so you have data points.

In short, my experience lines up with your native speakers.

I found that it loses track of the phonemes when speaking quickly, and tones don't seem to line up when speaking at normal conversational speed.

For example, if I say 他是我的朋友 at normal conversational speed, it will assign `de` to 我, sometimes it interprets that I didn't have the retroflexive in `shi` and renders it `si`. Listened back to make sure I said everything, the phonemes are there in the recording, but the UI displays the wrong phonemes and tones.

By contrast, if I speak slowly and really push each tone, the phonemes and tones all register correctly.

Also, is this taking into account tone transformation? Example, third tones (bottom out tone) tend to smoosh into a second tone (rising) when multiple third tones are spoken in a row. Sometimes the first tone influences the next tone slightly, etc.

Again, great initiative, but I think it needs a way to deal with speech that is conversationally spoken and maybe even slurred a bit due to the nature of conversational level speech.

tifan•33m ago
I had the same issue! Perhaps being another dapangzi is the problem here lol
sqs•31m ago
I don't think it takes care of tone transformation (eg 他是 ni3shi4 -> ni2shi4). Or if it does, my tones are just off. But it's a really cool idea!
mercanlIl•9m ago
The tool definitely needs to address tone transformations, it’s a big part of how the language is spoken. Otherwise it’s mostly useful for a first year student speaking in isolation.

Hoping to see improvements in this area

ecshafer•2h ago
Anyone that is a native European language speaker that hasn't tried to learn Chinese or some other tonal language, its really hard to understand how hard it is. The tones can really be very subtle, and your ear is not fine tuned to them. So you think you are saying it right, but native speakers have no idea what you are saying.
cyberax•2h ago
I'm a native Russian speaker, and I decided to learn Mandarin, because it's linguistically almost the opposite of Russian.

I had no problems with tone pronunciation, but tone recognition was indeed much trickier. I still often get lost when listening to fast speech although I can follow formal speech (news) usually without problems.

dionian•1h ago
its critical because without proper tonal enunciation the words can be ambiguous.
laurieg•1h ago
For someone who hasn't grown up speaking an language with tones or pitches, the process of learning them can be maddening. I applaud anyone who makes tools like this to try to make the process easier.

My experience in learning Japanese pitch accent was eye-opening. At the start, I couldn't hear any difference. On quizzes I essentially scored the same as random guessing.

The first thing that helped me a lot was noticing how there were things in my native language (English) that used pitch information. For example, "uh-oh" has a high-low pitch. If you say it wrong it sounds very strange. "Uh-huh" to show understanding goes low-high. Again, if you reverse it it sounds unusual.

The next part was just doing lots of practice with minimal pairs. Each time I would listen and try my best to work out where the pitch changed. This took quite a lot of time. I feel like massed practice (many hours in a day) helped me more than trying to do 10 minutes regularly. Try to hear them correctly, but don't try too hard. I didn't have any luck with trying harder to 'understand' what was going on. I liken it to trying to learn to see a new color. There isn't much conscious thought.

The final piece of the puzzle was learning phrases, not individual words, that had pitch changes. For example: "yudetamago" could be boiled egg or boiled grandchildren. Somehow my brain just had a much easier time latching on to multi-word phrases instead of single words. Listening to kaki (persimmon) vs kaki (oyster) again and again seemed much harder.

Of course, your mileage may vary with these techniques. I already spoke decent Japanese when I started doing this.

danparsonson•52m ago
Wholeheartedly (or maybe downheartedly?) agree with this - sometimes I try to say the simplest things and people just stare at me like I'm speaking Martian. Which I suppose I might as well be! One of my big problems is implicit use of tones for things like expressing uncertainty; that's a very difficult habit to get out of.
vjvjvjvjghv•37m ago
Agree. It’s really hard. It also explains why a lot of people born in China tend to make serious pronunciation errors when speaking English or German. They are used to focus on different things than us westerners.

It took me very long time to really understand how impersonating tone is in Chinese.

bytesandbits•2h ago
great work! I am going to try it out. Currently about to learn some Mandarin to be able to talk with hawker stand owners for a trip I am doing soon. I am trilingual and can speak a few languages on top of that, but none of them tonal. I am new to tonal languages and I find myself struggling with this... a lot!
anonzzzies•2h ago
goof luck! I speak 6 languages fluent but none of them tonal and I find mandarin very challenging; it does not help that people in places where you might need it are not very forgiving; asking for green fork in a tea shop has people very bewildered.
cmuguythrow•1h ago
Awesome idea!
nirvanatikku•1h ago
talk about 30 seconds to wow. great app, UX and demo. would love to use this. kudos.
jrockway•1h ago
Interesting application! A friend of mine built a model like this to help her make her voice more feminine, and it is neat to see a similar use case here.
dionian•1h ago
it heard wu2 but i heard wo2 from you fine. and it should sound like wo2 not wo3 if spoken quickly. not a native speaker though so i could be wrong
byb•1h ago
Neat. A personal tone trainer. Seriously, shut up and take my money now. Of course, it needs a vocabulary trainer, and zhuyin/traditional character support.
SequoiaHope•1h ago
Amazingly I just did the same thing! Only with AISHELL. It needs work. I used the encoder from the Meta MMS model.

https://github.com/sequoia-hope/mandarin-practice

ChadNauseam•1h ago
This is amazing. I'm also working on free language learning tech. (I have some SOTA NLP models on huggingface and a free app.) I have some SOTA NLP models on huggingface and a free app. My most recent research is a list of every phrase [0].

Pronunciation correction is an insanely underdeveloped field. Hit me up via email/twitter/discord (my bio) if you're interested in collabing.

[0]: https://gist.github.com/anchpop/acbfb6599ce8c273cc89c7d1bb36...

stuxnet79•1h ago
How difficult would it be to adapt this to Cantonese? It is a surprisingly difficult language to learn. It has more tones than Mandarin plus comparatively less access to learning resources (in my experience)
baby•1h ago
For people trying to say the "j" sound correctly, as in "jiu" (old), just say "dz", so in that example "dziu"
iamanllm•43m ago
holy crap, I was literally imaging how I wanted something exactly like this yesterday! you are a hero!
tifan•34m ago
Well, it would work only when I speak word by word, not as a sentence or in a normal speed for daily conversations. The model thinks I was making mistakes when I speak casually (as a native Chinese speaker, I had Mandarin 2A certification, which is required for teachers or other occupations that requires a very high degree of Mandarin accuracy). You wouldn’t really notice it but language pronunciations is very different between causal and formal speech…
rablackburn•33m ago
> And if there’s one thing we’ve learned over the last decade, it’s the bitter lesson: when you have enough data and compute, learned representations usually beat carefully hand-tuned systems.

There are still holdouts!

Come back to me in a couple of decades when the trove of humanity's data has been pored over and drifted further out of sync with (verifiable) reality.

Hand-tuning is the only way to make progress when you've hit a domain's limits. Go deep and have fun.

memalign•25m ago
I wish this had a pinyin mode…! I am learning to speak Mandarin but I am not learning to read/write.

( I’m learning using a flashcards web app I made and continue to update with vocab I encounter or need: https://memalign.github.io/m/mandarin/cards/index.html )

data_ders•24m ago
same! but if you get it inevitably wrong the first time it gives you the pinyin. but i struggled to get it to transcribe the consonants I was making let alone the tones. i'm pretty sure i'm not as bad as that!