frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Actual Claude Tokenizer

https://tokenizer.robkopel.me
2•robkop•1h ago
I've seen a few "Claude tokenizers" floating around lately with all the 4.7 chatter, but most of them just hit the count_tokens endpoint and hand you back a number. You don't actually see how your text gets split or understand the changes from 4.6 to 4.7.

I built this a while back for doing some mech interp research. It faithfully represents Claude token splitting - showing hidden tokens, real boundaries and so on. It is not cheap to run - essentially n^2 cost - you could optimise for longer sequences but you are not guaranteed a faithful representation if so.

Open Source: https://github.com/R0bk/claude-tokenizer

Feedback welcome, let me know if there are any edge cases that look wrong.

P.S. I'd expect this to face a similar fate as streaming chunk and prefill based token extraction methods did. I do worry about the ability to do independent research once it's fully closed off and would love it if there was more public frontier tokenizers.

So What If They Have My Data?

https://cardcatalogforlife.substack.com/p/so-what-if-they-have-my-data
1•speckx•1m ago•0 comments

Kimi K2.6: Advancing Open-Source Coding

https://twitter.com/Kimi_Moonshot/status/2046249571882500354
1•nekofneko•2m ago•0 comments

Licensing Best Practices for the Sharing of Scientific Data

https://creativecommons.org/2026/04/20/licensing-best-practices-for-the-sharing-of-scientific-data/
1•Tomte•2m ago•0 comments

The printing press for biological data (Sterling Hooten)

https://www.owlposting.com/p/the-printing-press-for-biological
1•crescit_eundo•3m ago•0 comments

MoA-X: Mixture of Agents Orchestration Framework

https://github.com/drivelineresearch/moa-x
1•icelancer•5m ago•0 comments

Top Gun 3 Is Happening: The Need for Speed Lives On

https://avgeekery.com/top-gun-3-is-happening/
1•freediver•5m ago•0 comments

Anthropic tests user trust with ID and selfie checks for Claude

https://www.helpnetsecurity.com/2026/04/16/anthropic-claude-identity-verification-government-id/
1•mooreds•6m ago•0 comments

The "AI Vulnerability Storm": Building a "Mythos- Ready" Security Program [pdf]

https://labs.cloudsecurityalliance.org/wp-content/uploads/2026/04/mythosreadyv4.pdf
1•JumpCrisscross•7m ago•0 comments

I'm never buying another Kindle, and neither should you

https://www.androidauthority.com/amazon-kindle-2026-3657863/
3•mikhael•7m ago•0 comments

TIL: Checksumming Files Recursively with Rclone

https://heitorpb.github.io/bla/recursive-checksum/
1•hpb42•8m ago•0 comments

Badvibes – Lint for Vibe Coders

https://www.npmjs.com/package/badvibes
1•muoco-01•8m ago•0 comments

Show HN: A web-based replacement for Nvidia's CUDA occupancy spreadsheet

https://toolbelt.widgita.xyz/cuda-occupancy-calculator/
1•fairlight1337•8m ago•0 comments

Tech CEOs Think AI Will Let Them Be Everywhere at Once

https://www.wired.com/story/tech-ceos-using-ai-to-be-everywhere-at-once/
1•Brajeshwar•8m ago•0 comments

Has cosmic philosophy conjectures infiltrated AI?

https://medium.com/@f9121212/has-cosmic-philosophy-conjectures-infiltrated-ai-15559d03b8e9
1•ortrich•9m ago•0 comments

Astronaut's astounding iPhone 17 Pro Max video shows 'Earthset' from space

https://9to5mac.com/2026/04/20/astronauts-astounding-iphone-17-pro-max-video-shows-earthset-from-...
1•omer_k•9m ago•0 comments

Baltic nations brace for impact of Iran war delaying US weapons shipments

https://www.defensenews.com/global/europe/2026/04/20/baltic-nations-brace-for-impact-of-iran-war-...
2•Teever•9m ago•0 comments

Kimi K2.6: Advancing Open-Source Coding

https://www.kimi.com/blog/kimi-k2-6
4•meetpateltech•10m ago•0 comments

Why I De-Googled

https://kevinboone.me/why_i_degoogled.html
3•HotGarbage•10m ago•0 comments

How Cybercrime Became a Leading Industry in 'Scambodia'

https://www.wsj.com/world/asia/cambodia-cybercrime-rise-why-2f2c03cc
1•thm•10m ago•0 comments

My practitioner view of program analysis

https://sawyer.dev/posts/practitioner-program-analysis/
1•evakhoury•10m ago•0 comments

Are browsers (fully) Opus ready yet?

https://opusready.netlify.app/
1•midzer•11m ago•0 comments

Wife Acceptance Factor

https://en.wikipedia.org/wiki/Wife_acceptance_factor
2•neko_ranger•12m ago•0 comments

Best Wispr Flow Alternatives for Android

https://www.yaps.ai/blog/wispr-flow-alternative-android
1•RichAwo•14m ago•0 comments

Books Are Not Remotely Too Expensive

https://www.millersbookreview.com/p/no-books-are-not-remotely-too-expensive
1•herbertl•16m ago•0 comments

I prompted ChatGPT, Claude, Perplexity, and Gemini and watched my Nginx logs

https://surfacedby.com/blog/nginx-logs-ai-traffic-vs-referral-traffic
6•startages•16m ago•0 comments

You Can Purchase Non-Smart TVs from Samsung

https://www.samsung.com/us/business/displays/commercial-tvs/
2•speckx•16m ago•1 comments

The Mystery in the Medicine Cabinet: Acetaminophen, ibuprofen, and what to know

https://asteriskmag.com/issues/14/the-mystery-in-the-medicine-cabinet
2•nkurz•16m ago•0 comments

Kimi K2.6

https://huggingface.co/moonshotai/Kimi-K2.6
3•kbumsik•17m ago•0 comments

'Reefer Madness,' the PSA That Backfired Spectacularly

https://www.nytimes.com/2026/04/20/movies/reefer-madness-psa-1936.html
1•Teever•17m ago•0 comments

Cancelling ARM deployments in Topaz – what it means for an emulator

https://topaz.thecloudtheory.com/blog/arm-deployment-cancel/
1•kamilmrzyglod•18m ago•0 comments