frontpage.

Hi HN,

I built Splintr, a BPE tokenizer in Rust (with Python bindings), because I found existing Python-based tokenizers were bottlenecking my data processing pipelines.

While OpenAI's tiktoken is the gold standard for correctness, I found I could get significantly better throughput on modern multi-core CPUs by rethinking how parallelism is applied.

Splintr achieves ~111 MB/s batch throughput (vs ~9 MB/s for tiktoken).

The Design Choice: "Sequential by Default" One of the most interesting findings during development was that naive parallelism actually hurts performance for typical LLM inputs. Thread pool overhead is significant for texts under 1MB.

I implemented a hybrid strategy:

Single Text (encode): Purely sequential. It’s 3-4x faster than tiktoken simply by using pcre2 with JIT instead of standard regex handling.

Batch Processing (encode_batch): Parallelizes across texts using Rayon, rather than within a text. This saturates all cores without the overhead of splitting small strings.

Other Features:

Safety: Strict UTF-8 compliance, including a streaming decoder that correctly buffers incomplete multi-byte characters.

Compatibility: Drop-in support for cl100k_base (GPT-4), o200k_base (GPT-4o), and llama3 vocabularies.

The repo is written in Rust with PyO3 bindings. I’d love feedback on the implementation or other potential optimization tricks for BPE.

Thanks!

I turned myself into an AI-generated deathbot – here's what I found

Management style doesn't predict survival

One Generation Runs the Country. The Next Cashed in on Crypto

"I Was Wrong": Why the Civil War Is Running Late [video][2h21m]

Show HN: A sandboxed execution environment for AI agents via WASM

Wine-Staging 11.2 Brings More Patches to Help Adobe Photoshop on Linux

The Nature of the Beast

From Prediction to Compilation: A Manifesto for Intrinsically Reliable AI

Show HN: Curated list of 1000 open source alternatives to proprietary software

AI's Real Problem Is Illegitimacy, Not Hallucination

'I fell into it': ex-criminal hackers urge UK pupils to use web skills for good

Why 175-Year-Old Glassmaker Corning Is Suddenly an AI Superstar

Keeping WSL Alive

Unlocking core memories with GoldSrc engine and CS 1.6 (2025)

Gtrace an advanced network path analysis tool

America does not trust Putin or Trump

Let's Do Music in Linux [video]

"Nothing" is the secret to structuring your work

AI Makes the Easy Part Easier and the Hard Part Harder

Show HN: Fine-tuned Qwen2.5-7B on 100 films for probabilistic story graphs

A failed wantrepreneur's view on common startup advice

Show HN: BestClaw Simple OpenClaw/MoltBot for non tech people

AI is making me anxious and stupid

Show HN: Real-time path tracing of medical CT volumes in the browser via WebGPU

United States – Crypto Scam Help – Intelligence Cyber Wizard Safe Guide

What to Do After a Crypto Scam (USA) Intelligence Cyber Wizard Explained

The Physics of 588: A 17.64μm Isolation Barrier Strategy for 5nm Process

My Eighth Year as a Bootstrapped Founder

Data Modelling Open Source

Mid-life transitions

Show HN: Splintr – Rust BPE tokenizer, 12x faster than tiktoken for batches