frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•11mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Bob Dylan's AI "Lectures from the Grave" Review: An Accidental Warning

https://consequence.net/2026/03/bob-dylan-ai-lectures-from-the-grave-review/
1•coloneltcb•1m ago•0 comments

Error messages are an underrated part of agent experience

https://www.knut.fyi/blog/2026-03-30/agentic-developer-experience-starts-with-your-system
1•jeffinpdx•1m ago•0 comments

Ask HN: Anyone getting random quotes from U-Haul while their website is down?

1•phyzix5761•2m ago•0 comments

iOS 26.4 Autocorrect Updates

https://www.wsj.com/tech/apple-iphone-autocorrect-update-7659d618
1•daviesgeek•4m ago•0 comments

Show HN: HackerOne Silent-Patched My Critical BOLA and Banned Me

https://github.com/guardiankali/HackerOne-Silent-Patch-Exposed/blob/main/Critical.README.md
2•Salamalto•5m ago•0 comments

The Nginx default page hides 5 clues that lead to same 24-word BIP39 mnemonic

https://bip39-recast.pages.dev/wwwroot/
2•imcotton•7m ago•0 comments

Anarchist Calisthenics

https://harpers.org/archive/2012/12/anarchist-calisthenics/
2•theopsimist•8m ago•0 comments

They Believed They Found $500M in Civil War Gold. Then the FBI Swooped In(2025)

https://www.popularmechanics.com/adventure/a69762000/treasure-hunters-find-gold-fbi/
1•Jimmc414•10m ago•0 comments

Sedan (Nuclear Test)

https://en.wikipedia.org/wiki/Sedan_(nuclear_test)
2•JumpCrisscross•10m ago•0 comments

California Counties and Cities Comparison

https://trekhleb.dev/cali-vibe/
2•okso_app•12m ago•0 comments

Books and Screens

https://aeon.co/essays/what-we-think-is-a-decline-in-literacy-is-a-design-problem
2•bookofjoe•13m ago•0 comments

PocketMage PDA CrowdSupply

https://www.crowdsupply.com/talisman-design/pocketmage
2•caminanteblanco•13m ago•0 comments

StarRocks Is Not Enterprise Ready

https://dataengineeringguide.substack.com/p/starrocks-celerdata-not-enterprise-ready-2026
3•amandagerdes•15m ago•0 comments

Show HN: Codemaxxing – Maximize your slop abilities

https://github.com/jshchnz/codemaxxing
3•jshchnz•16m ago•0 comments

Long-Context Isn't the Answer

https://www.humanlayer.dev/blog/long-context-isnt-the-answer
2•arbayi•21m ago•0 comments

Wisp: WebAssembly Lisp

https://github.com/DavidLiedle/WISP
3•DavidCanHelp•21m ago•0 comments

Apollo's impatient old-timers are rooting for NASA's return to the moon

https://apnews.com/article/apollo-artemis-nasa-moon-6fd9cb210d40c59a729d5103c0994351
6•devonnull•21m ago•0 comments

FreeBSD Forums Hacked

https://mastodon.social/@nixCraft/116319158100665914
4•PortableCode•22m ago•0 comments

AI's capability improvements haven't come from it getting less affordable

https://www.lesswrong.com/posts/E6ELHguZFNF3Czp55/ai-s-capability-improvements-haven-t-come-from-...
2•gmays•24m ago•0 comments

OpenAI introduces a Codex plugin for Claude Code

https://twitter.com/reach_vb/status/2038670509768839458
4•adamfeldman•31m ago•1 comments

Discrete Norms, Stability Analysis, and the Lax Equivalence Theorem

https://natrask.github.io/ENM5320-2026/NewMaterial/Lecture03_Jan27/Lecture_4.html
2•measurablefunc•35m ago•0 comments

AI models sabotaging shutdown scripts. It took 22 years to regulate Meta

https://www.briancarpio.com/blog/ai-is-self-preserving-what-happens-in-22-years
3•linsys•35m ago•0 comments

Nous Research Hermes Agent launches Multi-agent

https://twitter.com/nousresearch/status/2038688578201346513
2•grafda•36m ago•0 comments

AI Leaders versus Elon Musk

https://www.axios.com/2026/03/30/elon-musk-openai-altman-anthropic
2•doener•36m ago•0 comments

Notes on Going Solo

https://www.joanwestenberg.com/notes-on-going-solo-celebrating-6-years-of-studio-self/
3•exolymph•39m ago•0 comments

Show HN: 30u30.fyi – Is your startup founder on Forbes' most fraudulent list?

https://30u30.fyi
97•not-chatgpt•43m ago•28 comments

Ask HN: Do multiple short stints in startups hurt even if you've learned a lot?

3•gokuljs•45m ago•0 comments

Show HN: Buyout Game Benchmark: Multi-Agent Bargaining, Transfers, and Takeovers

https://github.com/lechmazur/buyout_game
3•zone411•46m ago•0 comments

Railway CDN Caching Incident: When Opt-In Becomes Opt-Everyone-In

https://joshuabellew.com/posts/railway-cdn-caching-incident-march-2026/
3•kawsper•47m ago•1 comments

Get ready with the latest beta releases

https://developer.apple.com/news/?id=z8vzrgzx
1•surprisetalk•47m ago•0 comments