Building a Zero-Allocation, SIMD-Accelerated CSV Parser in Zig

1•peymo•1h ago

Comments

peymo•1h ago

CSV looks simple until you try to parse it fast.

What started as a small utility for a personal project turned into a month-long deep dive into performance engineering, SIMD, and the surprisingly sharp edges of CSV parsing. My goal was straightforward: build a simple, zero-allocation CSV iterator and writer in Zig that could handle real-world inputs without sacrificing performance.

Along the way, I explored a number of parsing strategies, including approaches inspired by a well-known paper on SIMD-accelerated JSON parsing. While that technique is elegant and highly effective for JSON, I found it didn’t translate cleanly to CSV—at least not without giving up more performance than I was willing to accept. CSV’s delimiter-heavy structure and quoting rules demanded a different approach.

After iterating through several designs and benchmarking them against each other, I eventually converged on a technique that consistently outperformed my earlier implementations. When I compared the final version against some of the fastest CSV libraries I could find, the results were better than I expected.

To make those comparisons reproducible, I put together a small benchmark suite here: https://github.com/peymanmortazavi/csv-race

And the actual implementation is here: https://github.com/peymanmortazavi/csv-zero

This post walks through the design decisions behind csv-zero, the tradeoffs I made, and the techniques that ended up mattering the most. It’s also a bit of a love letter to Zig: working in the language made it much easier to reason about memory, data layout, and performance, and pushed me to tackle problems I would have otherwise avoided.

If you’re curious about SIMD-based parsing, zero-allocation APIs, or just want to see how far you can push CSV, read on.

I don't like imports

Does Higher VO₂ Max Make You More Attractive?

I went back to Linux and it was a mistake

Learning by hand is better than learning by AI

A Note on Flat Abstract Syntax Trees

AI After Drug Development

We Just Discovered Why Light Does This [video]

Irish man with valid US work permit held in ICE detention for five months

Olympics.com cookie acceptance button text: "Yes, I am happy"

Ask HN: What's blocking you from trusting AI agents with your real data?

AI Front end QA tester

Show HN: Claude Code from your phone via Telegram

CPUs Are Back: The Datacenter CPU Landscape in 2026

Show HN: Tool to Visualize Claude Code and Agents SDK Executions

Mrinank Sharma Resigns from Anthropic

A open source pageindex implementation

GPT-5.3 Codex vs. Claude Opus 4.6

Hackers and Painters (2004)

The Most Popular Agentic Open-Source Tools (2026 Edition)

Shared LoRA Subspaces for Almost Strict Continual Learning

The Markets of Old London

Databricks Grows >65% YoY, Surpasses $5.4B Revenue

Thank you HN: For the daily fire

GPT-5.3 Codex is now available in Cursor

Why the Humble Capacitor Is the Electric Car Industry's New Crisis

The importance of human touch in AI-driven development

Decoding China's New Space Philosophy

A Software Engineer's Wish List for CS Research

I built an AI operating system for car shopping and Research

Eddie Bauer, venerable outdoor apparel retailer, declares bankruptcy