frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•11mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Printed neurons communicate with living brain cells

https://news.northwestern.edu/stories/2026/4/printed-neurons-communicate-with-living-brain-cells
1•robbomacrae•2m ago•0 comments

I collect first sale stories from founders

https://firstsalestories.com
1•jyriso•2m ago•0 comments

There Are Three Aardvarks (2024)

https://freakytrigger.co.uk/wedge/2024/02/there-are-three-aardvarks
1•wise_blood•4m ago•0 comments

Alchemy and Machinery: What Apple's Steve Jobs Can Teach Pronatalists

https://www.governance.fyi/p/alchemy-and-machinery-what-apples
1•guardianbob•5m ago•0 comments

How Silicon Valley Is Turning Scientists into Exploited Gig Workers

https://www.thenation.com/article/society/ai-silicon-valley-andreesen-thiel-stem/
1•ZunarJ5•7m ago•0 comments

Content vs. Curation

https://bayinformationsystems.substack.com/p/content-is-dead-the-curation-inversion
1•anax32•7m ago•0 comments

P-e-n-i-s Costume Protester Prevails in Court

https://www.courthousenews.com/penis-costume-protester-prevails-in-court/
2•ludicrousdispla•9m ago•0 comments

Ego Turns Good Engineers into Bad Teammates

https://shiftmag.dev/developers-your-ego-is-the-real-bug-in-the-system-7657/
3•choochilla4•9m ago•0 comments

Jar of NIST peanut butter for $2,050

https://www.sigmaaldrich.com/US/en/product/sial/nist2387
1•wglass•11m ago•0 comments

Let's talk about AI slop in open source

https://archestra.ai/blog/only-responsible-ai
2•motakuk•11m ago•0 comments

How (and why) we rewrote our production C++ front end infrastructure in Rust

https://blog.nearlyfreespeech.net/2026/04/17/how-and-why-we-rewrote-our-production-c-frontend-inf...
1•mjyut•13m ago•0 comments

Name in Landsat

https://science.nasa.gov/mission/landsat/outreach/your-name-in-landsat/
1•tzury•13m ago•0 comments

Cycles of disruption in the tech industry: with Kent Beck and Martin Fowler

https://newsletter.pragmaticengineer.com/p/cycles-of-disruption-in-the-tech
1•taubek•19m ago•0 comments

How to write to /dev/rdiskX without root: a look at macOS authopen

https://tech.dreamleaves.org/posts/how-to-write-to-dev-rdiskx-without-root-a-look-at-macos-authopen/
1•joshguthrie•24m ago•0 comments

I Got Tired of Shipping in Silence, So I Built DopaAI

https://www.indiehackers.com/post/i-got-tired-of-shipping-in-silence-so-i-built-dopaai-cec5e3f2ea
1•gingfuu__•25m ago•0 comments

Robbers hold 25 hostage at Naples bank before fleeing through hole in floor

https://www.theguardian.com/world/2026/apr/16/armed-robbers-hostages-naples-bank-flee-hole-floor-...
2•Coral-Tiny•30m ago•0 comments

Agents make you insanely productive, except if you are knowledge worker

https://mrprompty.com/
2•ViktorPetrov•31m ago•1 comments

A History of Teapots and Unix

https://discuss.systems/@thalia/116417242648384997
1•signa11•34m ago•0 comments

Qwen3.6 35B A3B is THE ONE The Local LLM Champ on OpenCode benchmark dashboard [video]

https://www.youtube.com/watch?v=vlo5cxH5CXM
1•grigio•35m ago•0 comments

DockLock Pro – Stop macOS Dock from Jumping Screens

https://docklockpro.com/
1•usui•40m ago•0 comments

OpenAI's GPT-5.4 Pro reportedly solves an open Erdős problem in two hours

https://the-decoder.com/openais-gpt-5-4-pro-reportedly-solves-a-longstanding-open-erdos-math-prob...
2•voisin•41m ago•0 comments

Wikipedia: AI or Not Quiz

https://en.wikipedia.org/wiki/Wikipedia:AI_or_not_quiz
1•alibarber•43m ago•0 comments

ReSyn: A Generalized Recursive Regular Expression Synthesis Framework

https://arxiv.org/abs/2603.24624
1•PaulHoule•43m ago•0 comments

2-day-old GitHub account added AI-generated dependency to Mailgen (2.5k stars)

https://github.com/eladnava/mailgen/pull/86
3•foray1010•45m ago•3 comments

Anti-Amyloid Antibodies for Alzheimer: You Know

https://www.science.org/content/blog-post/anti-amyloid-antibodies-alzheimer-you-already-know
3•u1hcw9nx•51m ago•1 comments

Tesla is facing up to $14.5B in lawsuits – and it's only getting worse

https://electrek.co/2026/04/16/tesla-facing-up-to-14-billion-lawsuits-deep-dive/
6•breve•54m ago•1 comments

The Feeling of Power – Isaac Asimov

https://hex.ooo/library/power.html
3•MSFT_Edging•55m ago•3 comments

Floating Point Fun on Cortex-M Processors

https://danielmangum.com/posts/floating-point-cortex-m/
2•hasheddan•56m ago•0 comments

Local LLM agent with persistent memory and learnable skills

https://github.com/nevenkordic/localmind
1•yotta25•59m ago•0 comments

Interviewing Japanese about Trump's Pearl Harbor Response [video]

https://www.youtube.com/watch?v=jS0ZjVbzGWg
1•keepamovin•59m ago•0 comments