frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

UTF-8 history (2003)

https://doc.cat-v.org/bell_labs/utf-8_history
62•mikecarlton•3d ago

Comments

theologic•1h ago
Great story and brought up on Hacker News on a regular cycle: https://news.ycombinator.com/item?id=26735958

While I love the Hacker News purity, takes me back to Usenet, it makes me wonder if a little AI could take a repost and auto insert previous postings to allow people to see previous discussions.

flohofwoe•1h ago
WinNT missing out on UTF-8 and instead going with UCS-2 for their UNICODE text encoding might have been "the other" billion dollar mistake in the history of computing ;)

There was a 9 month time window between the invention of UTF-8 and the first release of WinNT (Sep 1992 to Jul 1993).

But ok fine, UTF-8 didn't really become popular until the web became popular.

But then missing the other opportunity to make the transition with the release of the first consumer version of WinNT (WinXP) nearly a decade later is inexcusable.

rfl890•1h ago
And nowadays developers have to deal with the "A/W" suffix bullshit.
ivanjermakov•56m ago
Windows using CP-125X encoding by default in many countries instead of a UTF-8 did a lot of damage, at least in my experience.
anonymars•55m ago
"UTF-8 was first officially presented at the USENIX conference in San Diego, from January 25 to 29, 1993" (https://en.wikipedia.org/wiki/UTF-8)

Hey team, we're working to release an ambitious new operating system in about 6 months, but I've decided we should burn the midnight oil to rip out and redo all of the text handling we worked on to replace it with something that was just introduced at a conference..

Oh and all the folks building their software against the beta for the last few months, well they knew what they were getting themselves into, after all it is a beta (https://books.google.com/books?id=elEEAAAAMBAJ&pg=PA1#v=onep...)

As for Windows XP, so now we're going to add a third version of the A/W APIs?

More background: https://devblogs.microsoft.com/oldnewthing/20190830-00/?p=10...

nostrademons•24m ago
Interestingly, there is another story on the HN front page about Steve Wozniak doing exactly that for the Apple I:

https://news.ycombinator.com/item?id=45265240

bobmcnamara•48m ago
Can't imagine they would've wanted to change encoding between Win3.1 and NT3.1.
masfuerte•16m ago
But they did?
xenadu02•1h ago
It is worth reading the history of the proposal. The final form is superior to the others so someone was doing a lot of editing!

Take the final and second form where the use of multiple letters was eliminated, instead using "v" to indicate bits of the encoded character.

I also chuckle at the initial implementation's note about the desire to delete support for 4/5/6 byte versions. Someone was still laboring under the UCS/UTF-16 delusion that 16-bits was sufficient.

bikeshaving•1h ago
There are socio-economic reasons why the early computing boom (ENIAC, UNIVAC, IBM mainframes, early programming languages like Fortran and COBOL) was dominated by the US: massive wartime R&D, university infrastructure, and a large domestic market. But I wonder if the Anglophone world also had an orthographic advantage as well. English uses 26 letters with no diacritics, compared to other languages like Chinese (thousands of characters), Hindi (50+ letters), or French/German (latin with diacritics).

That simplicity made early character encodings like 7-bit ASCII feasible, which in turn lowered the hardware and software barriers for building computers, keyboards, and programming languages. In other words, the Latin alphabet’s compactness may have given English-speaking engineers a “low-friction” environment for both computation and communication. And now it’s the lingua franca for most computing on top of which support for other languages is now built.

It’s very interesting to think about how written scripts give different cultures advantages in computing and elsewhere. I wonder for instance how scripts and AI interact, like LLMs trained in Chinese are working with a high-density orthography with a stable, 3500 year dataset.

ummonk•54m ago
The same applies to why China had all the building blocks (pun intended) of the printing press but it was perfected by Gutenberg in Europe, where the number of glyphs was much more manageable.
zahlman•10m ago
Indeed. Even if you try to split hanzi into parts it's far more unwieldy (https://en.wikipedia.org/wiki/Kangxi_radicals).
pklausler•7m ago
We got lots done with 6-bit pre-ASCII encodings, actually, like CDC Display Code and Univac's Fieldata. It's more than enough for 26 letters, 10 digits, and lots of punctuation. And there are faint echoes of these early character sets remaining in ASCII -- a zero byte is ^@, for example, because @ was the zero-valued Fieldata "master space" character, which distinguished EXEC 8 control cards from source code and data cards.
kps•3m ago
Computer character codes descended directly from pre-computer codes, either teletype or punched card. The advantage holds back through printing to writing itself; having a small, fixed set of glyphs that can represent anything is just better.

Waymo has received our pilot permit allowing for commercial operations at SFO

https://waymo.com/blog/#short-all-systems-go-at-sfo-waymo-has-received-our-pilot-permit
307•ChrisArchitect•2h ago•194 comments

Things you can do with a Software Defined Radio (2024)

https://blinry.org/50-things-with-sdr/
373•mihau•4h ago•75 comments

Adios Chicos, 25 Years of KDE

https://jriddell.org/2025/09/14/adios-chicos-25-years-of-kde/
32•thangqt•22m ago•4 comments

Live Updates: Shai-Hulud, the Most Dangerous NPM Breach in History

https://www.koi.security/blog/shai-hulud-npm-supply-chain-attack-crowdstrike-tinycolor
10•chha•18m ago•0 comments

Plugin System

https://iina.io/plugins/
86•xnhbx•2h ago•19 comments

Launch HN: Rowboat (YC S24) – Open-source IDE for multi-agent systems

https://github.com/rowboatlabs/rowboat
29•segmenta•1h ago•16 comments

Bertrand Russell to Oswald Mosley (1962)

https://lettersofnote.com/2016/02/02/every-ounce-of-my-energy/
102•giraffe_lady•2h ago•48 comments

A new experimental Google app for Windows

https://blog.google/products/search/google-app-windows-labs/
59•meetpateltech•3h ago•94 comments

CIA Freedom of Information Act Electronic Reading Room

https://www.cia.gov/readingroom
121•bookofjoe•5h ago•23 comments

UTF-8 history (2003)

https://doc.cat-v.org/bell_labs/utf-8_history
62•mikecarlton•3d ago•14 comments

I built my own phone because innovation is sad rn [video]

https://www.youtube.com/watch?v=qy_9w_c2ub0
12•Timothee•1d ago•0 comments

Self Propagating NPM Malware

https://www.stepsecurity.io/blog/ctrl-tinycolor-and-40-npm-packages-compromised
489•jamesberthoty•7h ago•377 comments

Implicit ODE Solvers Are Not Universally More Robust Than Explicit ODE Solvers

https://www.stochasticlifestyle.com/implicit-ode-solvers-are-not-universally-more-robust-than-exp...
66•cbolton•5h ago•21 comments

Development of the MOS Technology 6502: A Historical Perspective (2022)

https://www.EmbeddedRelated.com/showarticle/1453.php
32•jason_s•3h ago•5 comments

Paper Folding Assembly Line [video]

https://www.youtube.com/watch?v=XhUuhl9iWpQ
9•peteforde•1w ago•0 comments

The Linux Process Journey [pdf]

https://thelearningjourneyebooks.com/wp-content/uploads/2023/09/TheLinuxProcessJourney_v6_Sep2023...
7•maxmoehl•43m ago•0 comments

How Container Filesystem Works: Building a Docker-Like Container from Scratch

https://labs.iximiuz.com/tutorials/container-filesystem-from-scratch
6•lgunsch•3d ago•0 comments

Scammed out of $130K via fake Google call, spoofed Google email and auth sync

https://bewildered.substack.com/p/i-was-scammed-out-of-130000-and-google
62•davidscoville•1h ago•95 comments

Show HN: AI Code Detector – detect AI-generated code with 95% accuracy

https://code-detector.ai/
16•henryl•25m ago•6 comments

Generative AI as Seniority-Biased Technological Change

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5425555
178•zeuch•5h ago•155 comments

Writing an operating system kernel from scratch – RISC-V/OpenSBI/Zig

https://popovicu.com/posts/writing-an-operating-system-kernel-from-scratch/
40•popovicu•2d ago•1 comments

Microsoft Favors Anthropic over OpenAI for Visual Studio Code

https://www.theverge.com/report/778641/microsoft-visual-studio-code-anthropic-claude-4
134•corvad•3h ago•63 comments

60 years after Gemini, newly processed images reveal details

https://arstechnica.com/space/2025/09/60-years-after-gemini-newly-processed-images-reveal-incredi...
216•sohkamyung•3d ago•59 comments

Will I run Boston 2026?

https://getfast.ai/blogs/boston-2026
23•steadyelk•2h ago•24 comments

Teen safety, freedom, and privacy

https://openai.com/index/teen-safety-freedom-and-privacy
61•meetpateltech•5h ago•65 comments

Java 25 officially released

https://mail.openjdk.org/pipermail/announce/2025-September/000360.html
180•mkurz•5h ago•75 comments

Scientists uncover extreme life inside the Arctic ice

https://news.stanford.edu/stories/2025/09/extreme-life-arctic-ice-diatoms-ecological-discovery
64•hhs•3d ago•23 comments

Robert Redford has died

https://www.nytimes.com/2025/09/16/movies/robert-redford-dead.html
402•uptown•6h ago•124 comments

Migrating to React Native's new architecture

https://shopify.engineering/react-native-new-architecture
96•vidyesh•4d ago•59 comments

"Your" vs. "My" in user interfaces

https://adamsilver.io/blog/your-vs-my-in-user-interfaces/
420•Twixes•15h ago•205 comments