frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•1y ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

With Claude: Less Coding, More Testing

https://henrikwarne.com/2026/05/31/with-claude-less-coding-more-testing/
1•ingve•2m ago•0 comments

The 2026-07-28 MCP Specification Release Candidate

https://blog.modelcontextprotocol.io/posts/2026-07-28-release-candidate/
1•gmays•2m ago•0 comments

In Malaysia, there was the first violent reaction against age verification laws

https://old.reddit.com/r/privacy/comments/1tq9e24/in_malaysia_there_was_a_bomb_threat_incident/
1•mostcallmeyt•5m ago•0 comments

People Getting Falsely Accused of Using AI to Write

https://nymag.com/intelligencer/article/the-people-getting-falsely-accused-of-using-ai-to-write.html
1•ColinWright•8m ago•0 comments

SICP Video Lectures (1986)

https://groups.csail.mit.edu/mac/classes/6.001/abelson-sussman-lectures/
2•tosh•8m ago•0 comments

Europe versus America: A Response to the Critics

https://paulkrugman.substack.com/p/europe-versus-america-a-response
1•_tk_•12m ago•0 comments

The Impact of AI-Assisted Development on Software Security

https://arxiv.org/abs/2603.15298
1•lucamark•14m ago•1 comments

Dav2d

https://jbkempf.com/blog/2026/dav2d/
2•captain_bender•14m ago•0 comments

The weatherman crucial to D-Day victory

https://www.bbc.com/culture/article/20260528-the-crucial-decision-that-helped-win-d-day
1•mellosouls•18m ago•0 comments

Robots are redefining the war in Ukraine – and forcing Russia onto the back foot

https://www.cnn.com/2026/05/30/europe/ukraine-robots-drones-russia-war-intl
1•rustoo•20m ago•0 comments

Screwing Up

https://www.seangoedecke.com/screwing-up/
2•gfysfm•21m ago•0 comments

Show HN: The Tired Engineer

https://www.thetiredengineer.com/
2•devtanna•21m ago•1 comments

The Despair of the Professor in the Age of A.I

https://www.newyorker.com/news/fault-lines/the-despair-of-the-professor-in-the-age-of-ai
1•NewCzech•22m ago•0 comments

Unlocking the Working Memory of Large Language Models for Latent Reasoning

https://arxiv.org/abs/2605.30343
1•korbip•23m ago•0 comments

Servo 0.2 Release

https://github.com/servo/servo/releases/tag/v0.2.0
1•pimeys•27m ago•1 comments

Selling to China's Muslims

https://www.middleeastbriefing.com/news/selling-to-chinas-muslims/
1•teleforce•30m ago•0 comments

Google planning to release 32M mosquitoes across Florida and California

https://twitter.com/BullTheoryio/status/2060810332831129782
1•king_zee•30m ago•1 comments

HeartMuLa – Open-Source Music Generation Model

https://github.com/HeartMuLa/heartlib
3•modinfo•31m ago•0 comments

Curated list of AI apps for visual creation

https://gist.github.com/seinecle/689a53bceca96147a04e93bdc5f83940
2•seinecle•32m ago•0 comments

James Webb spots the most chemically primitive galaxy in the ancient universe

https://www.universetoday.com/articles/astronomers-observe-the-most-chemically-primitive-galaxy-i...
1•flaburgan•32m ago•0 comments

Scenarios That Will Not Happen

https://radekmie.dev/blog/on-scenarios-that-will-not-happen/
1•birdculture•33m ago•0 comments

Japan's 2025 census reflects steepest fall in population on record, data shows

https://www.japantimes.co.jp/news/2026/05/29/japan/japan-population-largest-decline/
2•Teever•34m ago•0 comments

(An ((Even Better) Lisp) Interpreter (In Python))

http://norvig.com/lispy2.html
1•vismit2000•42m ago•0 comments

Australia to buy three second-hand United States submarines under AUKUS shake-up

https://www.abc.net.au/news/2026-05-31/australia-to-buy-second-hand-united-states-submarines-auku...
2•Teever•45m ago•0 comments

AI Agent that at inference time updates it's harness and model weights

https://github.com/hexo-ai/sia
4•martianvoid•45m ago•0 comments

The complexity of songs – Knuth (1984)

https://dl.acm.org/doi/pdf/10.1145/358027.358042
1•__patchbit__•45m ago•0 comments

Jonathan Daly's fav books about individuals who changed the way we see the world

https://bookdna.com/best-books/individuals-who-followed-their-muse-and-changed-th
1•bwb•50m ago•1 comments

Show HN: Free cloud-based tool for managing AI agents across multiple hosts

https://nodecartel.com/
3•itrinity•51m ago•0 comments

Telnet Song – Guy L Steele, Jr

http://wonderingminstrels.blogspot.com/2004/01/telnet-song-guy-l-steele-jr.html
2•__patchbit__•52m ago•0 comments

Mark Warren's favorite novels about a child's immersion into wilderness

https://bookdna.com/best-books/childs-immersion-into-wilderness
1•bwb•52m ago•1 comments