frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

AI companies charge you 60% more based on your language, BPE tokens

https://tokenstree.com/newsletter-article-5.html
22•vfalbor•1h ago

Comments

vfalbor•1h ago
The Biggest Con of the 21st Century: Tokens How AI Companies Are Charging You More Without You Even Realizing It

You pay for what you use. That's the deal. Except it's not.

When you use an AI model — GPT-4, Claude, Gemini — you do not pay per word. You pay per token. And that tiny technical detail is quietly costing you, depending on which company you choose, up to 60% more for the exact same request.

prophesi•1h ago
Wait until you hear that most models tend to perform worse for non-English languages.
aprentic•1h ago
Do you know if that's true of non-English models?

As I said elsewhere, Deepseek injects Chinese characters into responses. Anecdotally, that seems to happen when the context gets longer. That suggests that they're primarily trained in Chinese and I would expect them to use fewer tokens for Chinese than English.

Mindless2112•1h ago
Funny they didn't include any CJK languages on their list.
bobbiechen•1h ago
I heard an anecdote that Qwen Coder works better when prompted in Korean - haven't tested it for myself though.
aprentic•1h ago
Deepseek will regularly spit out Chinese (汉字)during English sessions. They generally seem to be syntactically related but it makes me think that there's some overhead of using English with an engine that's primarily trained in Chinese.
simianwords•1h ago
This has to be one of the worst things I have read. If this is not satire idk what counts
lxgr•1h ago
“Pay by token” is priced by token, not word or semantic unit; news at 11?

The product itself seems genuinely useful, but the article reads very sensationalist about something that should be pretty obvious.

In other news: French publishers are paying 30% more for paper than English publishers!!

charcircuit•1h ago
The companies didn't arbitrarily choose to bill by tokens. The cost to serve the models scales linearly with tokens which makes it a reasonable pricing strategy. The reality is that you are charged more because it was more expensive to handle the request.
lxgr•1h ago
I guess token length is indirectly determined by language frequency in the training set, and it would be possible to train a model on machine translated training data only to combat that (or maybe to force tokenization to overrepresent languages other than English?), but there’s no way that would be economical, and inference would just be accordingly more expensive to recoup that effort.
Animats•1h ago
It's an ad. "The Solution: TokensTree". From tokenstree.com

I was expecting a secondary market in tokens, perhaps crypto-powered, but no.

The cost difference for languages roughly correlates with how much text it takes to say something in that language. English is relatively terse. (This is a common annoyance when internationalizing dialog boxes. If sized for English, boxes need to be expanded.) They don't list any of the ideographic languages, which would be interesting.

lxgr•1h ago
That would cause the opposite effect of what we’re actually seeing (i.e. “more redundant languages” would be using comparatively fewer tokens).

The real reason is that tokens are probably strictly based on n-gram frequency of the training data, and English is the most common language in the training data.

telotortium•1h ago
My impression of dialog box size from least to greatest is CJK (Chinese < Korean < Japanese) < English < everything else
vfalbor•1h ago
This is not cryto or something else, it’s a platform for tokens reduction. You can try and then post it before do it assumtions. :)
simianwords•1h ago
Europeans be like:

AI commits a racism.

AI commits an environmentalism.

Now use my product (that won't solve either)

simonw•1h ago
The title of this piece differs from the HN title, but the HN title is a lot better. The original title is "The Biggest Con of the 21st Century: Tokens", subhead "How AI Companies Are Charging You More Without You Even Realizing It" - which is an absurd title because tokens are NOT the "biggest con" of anything, and AI companies make it very clear exactly how their pricing works.

I also don't like how this article presents numbers for language differences - in the "The Language Tax" section - but fails to clarify which tokenizer and where those numbers came from.

aprentic•1h ago
There's certainly an interesting question here, even if Tokenstree doesn't provide a solution or even define the problem well.

The broader questions are still interesting.

If an AI is trained more on language A than language B but has some training in translating B to A, what is the overhead of that translation?

If the abilities are combined in the same model, how much lower is the overhead than doing it as separate operations?

ie is f(a) < f(b) < f(t(B,A) ? where a and b are in A and B and f() and t() are the costs of processing a prompt and the cost of translating a prompt.

Then there's the additional question of what happens with character based languages. It's not obvious how it would make sense to assign multiple tokens to a single character but there's the question of how much information in character based vs phonic based words and what the information content of sentences with either one is.

cyberge99•10m ago
English Teachers: “Proper grammar is cost effective!”

EmDash – a spiritual successor to WordPress that solves plugin security

https://blog.cloudflare.com/emdash-wordpress/
284•elithrar•3h ago•178 comments

TurboQuant KV Compression and SSD Expert Streaming for M5 Pro and IOS

https://github.com/SharpAI/SwiftLM
34•aegis_camera•1h ago•10 comments

Ask HN: Who is hiring? (April 2026)

127•whoishiring•4h ago•101 comments

StepFun 3.5 Flash is #1 cost-effective model for OpenClaw tasks (300 battles)

https://app.uniclaw.ai/arena?tab=costEffectiveness&via=hn
88•skysniper•3h ago•33 comments

AI for American-produced cement and concrete

https://engineering.fb.com/2026/03/30/data-center-engineering/ai-for-american-produced-cement-and...
71•latchkey•2h ago•64 comments

NASA Artemis II moon mission live launch broadcast

https://plus.nasa.gov/scheduled-video/nasas-artemis-ii-crew-launches-to-the-moon-official-broadcast/
184•apitman•2h ago•91 comments

Show HN: Git bayesect – Bayesian Git bisection for non-deterministic bugs

https://github.com/hauntsaninja/git_bayesect
20•hauntsaninja•3d ago•5 comments

An Introduction to Writing Systems and Unicode

https://r12a.github.io/scripts/tutorial/part2
18•mariuz•3d ago•5 comments

CERN levels up with new superconducting karts

https://home.cern/news/news/engineering/cern-levels-new-superconducting-karts
352•fnands•12h ago•79 comments

Show HN: Zerobox – Sandbox any command with file, network, credential controls

https://github.com/afshinm/zerobox
43•afshinmeh•2d ago•43 comments

The OpenAI Graveyard: All the Deals and Products That Haven't Happened

https://www.forbes.com/sites/phoebeliu/2026/03/31/openai-graveyard-deals-and-products-havent-happ...
144•dherls•3h ago•111 comments

The AI Marketing BS Index

https://bastian.rieck.me/blog/2026/bs/
43•speckx•1h ago•4 comments

Show HN: Real-time dashboard for Claude Code agent teams

https://github.com/simple10/agents-observe
49•simple10•3h ago•17 comments

Is BGP safe yet?

https://isbgpsafeyet.com/
199•janandonly•6h ago•69 comments

Random numbers, Persian code: A mysterious signal transfixes radio sleuths

https://www.rferl.org/a/mystery-numbers-station-persian-signal-iran-war/33700659.html
79•thinkingemote•7h ago•82 comments

Apple at 50

https://www.apple.com/
20•janandonly•43m ago•2 comments

Ada and Spark on ARM Cortex-M – A Tutorial with Arduino and Nucleo Examples

http://inspirel.com/articles/Ada_On_Cortex.html
38•swq115•4d ago•9 comments

Intuiting Pratt Parsing

https://louis.co.nz/2026/03/26/pratt-parsing.html
122•signa11•2d ago•40 comments

Randomness on Apple Platforms (2024)

https://blog.xoria.org/randomness-on-apple-platforms/
40•surprisetalk•5d ago•1 comments

Wasmer (YC S19) Is Hiring – Rust and DevRel Positions

https://www.workatastartup.com/companies/wasmer
1•syrusakbary•7h ago

Consider the Greenland Shark (2020)

https://www.lrb.co.uk/the-paper/v42/n09/katherine-rundell/consider-the-greenland-shark
70•mooreds•5d ago•28 comments

Claude Wrote a Full FreeBSD Remote Kernel RCE with Root Shell (CVE-2026-4747)

https://github.com/califio/publications/blob/main/MADBugs/CVE-2026-4747/write-up.md
205•ishqdehlvi•14h ago•91 comments

Playing Wolfenstein 3D with one hand in 2026

https://arstechnica.com/gaming/2026/03/playing-wolfenstein-3d-with-one-hand-in-2026/
23•Brajeshwar•4d ago•9 comments

Show HN: CLI to order groceries via reverse-engineered REWE API (Haskell)

https://github.com/yannick-cw/korb
181•wazHFsRy•2d ago•78 comments

Ukrainian Drone Holds Position for 6 Weeks

https://defenceleaders.com/news/ukrainian-combat-robot-holds-frontline-position-for-six-weeks-in-...
59•AftHurrahWinch•1h ago•20 comments

SpaceX Files to Go Public

https://www.nytimes.com/2026/04/01/technology/spacex-ipo-elon-musk.html
28•nutjob2•1h ago•16 comments

The Document Foundation ejects its core developers

https://www.collaboraonline.com/blog/tdf-ejects-its-core-developers/
73•hackernewsblues•8h ago•37 comments

A dot a day keeps the clutter away

https://scottlawsonbc.com/post/dot-system
510•scottlawson•22h ago•155 comments

Claude Code Unpacked : A visual guide

https://ccunpacked.dev/
960•autocracy101•14h ago•347 comments

Chess in SQL

https://www.dbpro.app/blog/chess-in-pure-sql
166•upmostly•3d ago•40 comments