frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•12mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Help Me with Multiverse OS

https://chatgpt.com/gg/v/69e7247e7e408196817bdce7534163a8?token=4gu4Y6NwS3xz3phbYpOpeA
1•liljoe•42s ago•0 comments

CC: A P2P Network for Reproducible Autoresearch Code Experiments

https://community.computer/
1•aiw1nt3rs•51s ago•0 comments

All your agents are going async

https://zknill.io/posts/all-your-agents-are-going-async/
1•zknill•1m ago•0 comments

The Anatomy of Tool Calling in LLMs: A Deep Dive

https://martinuke0.github.io/posts/2026-01-07-the-anatomy-of-tool-calling-in-llms-a-deep-dive/
1•tonyl•1m ago•0 comments

Xonsh shell 0.23 REFORGED – not just a release

https://github.com/xonsh/xonsh/releases/tag/0.23.0
1•combisearch•3m ago•1 comments

Show HN: DSS, a new human-readable and plain format for XLS and spreadsheets

https://github.com/Datastripes/DataSheetStandard/
1•vinserello•4m ago•0 comments

Show HN: OpenBridge – turn web chat access into an OpenAI-compatible endpoint

2•linuz•6m ago•0 comments

The Pirate Bay's Oldest Torrent Turned 22

https://torrentfreak.com/the-pirate-bays-oldest-torrent-turned-22/
3•franczesko•11m ago•0 comments

Heatwaves in the Indo-Gangetic Plains: Why Local Conditions Matter

https://www.iitb.ac.in/research-highlight/heatwaves-indo-gangetic-plains-why-local-land-and-atmos...
1•akbarnama•15m ago•0 comments

A Hot-Air Balloon Landed in a California Backyard. The Owner Says It's A '

https://www.wired.com/story/a-brief-interview-with-the-owner-of-the-hot-air-balloon-that-landed-i...
1•joozio•17m ago•0 comments

Less human AI agents, please

https://nial.se/blog/less-human-ai-agents-please/
5•nialse•22m ago•1 comments

Show HN: Alignear – Client communication layer for Linear teams

https://alignear.com/
3•madatbay•23m ago•0 comments

The Fencing Visualization System

https://bsky.app/profile/kcimc.bsky.social/post/3mjxchuwkzs2v
4•mariuz•36m ago•0 comments

Berea college makes tuition free with its endowment

https://www.theatlantic.com/education/archive/2018/10/how-berea-college-makes-tuition-free-with-i...
3•KnuthIsGod•42m ago•0 comments

Iran claims US backdoors knocked out networking equipment

https://www.theregister.com/2026/04/21/iran_claims_us_used_backdoors/
4•defrost•47m ago•1 comments

Writing Node.js Addons with .NET Native AOT

https://devblogs.microsoft.com/dotnet/writing-nodejs-addons-with-dotnet-native-aot/
3•soheilpro•53m ago•0 comments

Using Changesets in a polyglot monorepo

https://luke.hsiao.dev/blog/changesets-polyglot-monorepo/
4•lwhsiao•54m ago•0 comments

Louis Zocchi, inventor of the d100, has died

https://icv2.com/articles/news/view/62176/r-i-p-louis-zocchi-the-godfather-dice
11•sgbeal•1h ago•1 comments

Ask HN: What are some of your favorite dedication pages in a book?

3•chistev•1h ago•0 comments

Palantir manifesto – 'ramblings of a supervillain' amid UK contract fears

https://www.theguardian.com/technology/2026/apr/21/palantir-manifesto-uk-contract-fears-mps
4•mindracer•1h ago•0 comments

WinShader: Curated GL Shaders in a Screensaver

https://github.com/PsyChip/WinShader
3•psychip•1h ago•0 comments

A mad undertaking: An undefinitive guide to the Aadam Jacobs collection

https://aadamjacobscollection.org/
3•wise_blood•1h ago•0 comments

OMGfixMD – Comment on Markdown like it's a doc

https://omgfixmd.com/
3•ladiamant•1h ago•0 comments

Cocaine alters the movement of salmon in a large natural lake

https://www.sciencedirect.com/science/article/pii/S0960982226003155
3•tobr•1h ago•1 comments

Google Cloud in the list of 4 EU sovereign cloud providers

https://www.theregister.com/2026/04/20/europe_picks_4_sovereign_cloud/
1•kouzant•1h ago•1 comments

Amazon 'strong-armed' Levi's, Hanes to hike prices on rival sites, DA says

https://www.cnbc.com/2026/04/20/california-da-amazon-price-fixing-walmart-target.html
5•1vuio0pswjnm7•1h ago•0 comments

Canada has banned employers from ghosting job candidates

https://www.positive.news/society/canada-has-banned-employers-from-ghosting-job-candidates/
7•jethronethro•1h ago•0 comments

Types and Neural Networks

https://www.brunogavranovic.com/posts/2026-04-20-types-and-neural-networks.html
5•bgavran•1h ago•1 comments

With Orban Out, the Pianist András Schiff Plans a Return to Hungary

https://www.nytimes.com/2026/04/20/arts/music/andras-schiff-piano-viktor-orban-hungary.html
2•mykowebhn•1h ago•0 comments

In major policy shift, Japan scraps limits on lethal arms exports

https://www.japantimes.co.jp/news/2026/04/21/japan/politics/japan-lethal-weapons-export-rules-eased/
2•geox•1h ago•0 comments