frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•1y ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Claude AI agent admits: “I violated every principle” after wiping firm database

https://www.theguardian.com/technology/2026/apr/29/claude-ai-deletes-firm-database
1•ZeidJ•2m ago•1 comments

Why Solid State Batteries Short

https://hackaday.com/2026/04/27/why-solid-state-batteries-short/
1•omer_k•2m ago•0 comments

Trustworthy and Valuable Partnership

1•loveTech•3m ago•0 comments

PostgreSQL and the OOM Killer: Why We Use Strict Memory Overcommit

https://www.ubicloud.com/blog/postgresql-and-the-oom-killer-why-we-use-strict-memory-overcommit
1•lfittl•4m ago•0 comments

BioAge's pill aimed at reducing heart risks significantly reduced inflammation

https://www.statnews.com/2026/04/21/bioage-drug-inflammation-cardiovascular-risks/
1•warbaker•8m ago•0 comments

NASA chief hints at campaign to make Pluto a planet again

https://www.scientificamerican.com/article/nasa-chief-jared-isaacman-hints-at-campaign-to-make-pl...
2•1659447091•10m ago•0 comments

My friends left me because I told them I was a furry

2•PhiPawWolf•11m ago•1 comments

Heard some people like wheels? [video]

https://www.youtube.com/watch?v=srPz8TRpZ_8
1•mizzao•12m ago•0 comments

What Can We Gain by Losing Infinity?

https://www.quantamagazine.org/what-can-we-gain-by-losing-infinity-20260429/
1•pseudolus•12m ago•0 comments

TSRX: TypeScript extension for building declarative UIs in an agentic era

https://tsrx.dev/
1•luispa•14m ago•0 comments

The workouts of Formula 1 drivers might help computer users with 'tech neck'

https://apnews.com/article/computer-neck-pain-racing-drivers-exercises-2f4dee37c7e7cfbbdff237cf70...
1•1659447091•17m ago•0 comments

AdOps Auditor – AI-powered campaign naming convention auditor for GAM/SFMC/DOOH

https://www.adopsauditor.com/
1•InfinteOven•19m ago•0 comments

Google Releases Branded yarmulkes

https://twitter.com/samsheffer/status/2049505564359565760
1•cramsession•19m ago•0 comments

pacquet: the official pnpm rewrite in Rust

https://github.com/pnpm/pacquet
1•bpierre•19m ago•0 comments

Knee surgery for cartilage damage does not benefit patients, study suggests

https://www.theguardian.com/science/2026/apr/29/knee-surgery-cartilage-damage-patients-study
1•littlexsparkee•20m ago•0 comments

Quint – Behavioral security for AI agents, OS-level interception

https://quintai.dev
1•amerabbadi•21m ago•0 comments

Leading Charity Stops Funding Open Access Publishing Because It's Not Working

https://www.techdirt.com/2026/04/29/leading-cancer-charity-stops-funding-open-access-publishing-b...
1•dangle1•27m ago•0 comments

A Scientist Says Humans Will Go Backwards in Time Within Just 3 Years

https://www.popularmechanics.com/science/a71165617/humans-will-go-backwards-in-time-scientist-says/
1•RickJWagner•27m ago•1 comments

Stripe Treasury

https://stripe.com/treasury
1•bpierre•30m ago•1 comments

Andrej Karpathy: From Vibe Coding to Agentic Engineering [video]

https://www.youtube.com/watch?v=96jN2OCOfLs
2•auchenberg•33m ago•0 comments

Nvidia executive: AI is more expensive than paying human workers

https://fortune.com/2026/04/28/nvidia-executive-cost-of-ai-is-greater-than-cost-of-employees/
3•generic92034•33m ago•3 comments

Quiet Piggy

https://theweeklylist.substack.com/p/a-compilation-of-trumps-insults-of
1•spacebarshift•34m ago•0 comments

The career you started isn't the career you'll finish [audio]

https://www.ministryoftesting.com/podcasts/into-the-motaverse?wchannelid=b2j0jiwz2n&wmediaid=qrgg...
1•mooreds•35m ago•0 comments

The universe is not where things are, but where they go

https://manlius.substack.com/p/living-in-a-flow-the-universe-is
1•anigbrowl•41m ago•0 comments

A Comprehensive Zig SDK for Cloudflare Workers

https://github.com/nilslice/workers-zig
2•adewale•45m ago•0 comments

Starship – Test Like You Fly

https://www.youtube.com/watch?v=ANe_HW4X8oc
2•tiziano88•46m ago•0 comments

Why US Trucking Is So Deadly

https://www.nytimes.com/2026/04/24/opinion/trucking-safety.html
2•throw0101a•46m ago•1 comments

Uber Can Bring You Dinner. Now, It Wants to Book Your Hotel Room

https://www.nytimes.com/2026/04/29/travel/uber-hotel-booking-expedia.html
1•jbredeche•48m ago•0 comments

Qwen corrects code saying that Taiwan is a country

https://twitter.com/wongmjane/status/2049555509624312217
2•franciscop•49m ago•0 comments

Engineering tough blood clots for rapid haemostasis and enhanced regeneration

https://www.nature.com/articles/s41586-026-10412-y
1•warbaker•49m ago•1 comments