frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•1y ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Engineers aren't afraid of AI – they're afraid of becoming junior again

https://www.andykelk.net/leadership/your-engineers-arent-afraid-of-ai-theyre-afraid-of-being-juni...
1•mopoke•2m ago•0 comments

Built to benefit everyone: our plan

https://openai.com/index/built-to-benefit-everyone-our-plan/
1•gmays•2m ago•0 comments

ShieldMCP – Security scanner for your MCP config

https://shieldmcp.net
1•ccellcdev•3m ago•0 comments

Show HN: MandoCode – local-first AI coding agent (.NET and Ollama)

https://github.com/DevMando/MandoCode
1•devmando•8m ago•0 comments

Are you ready to admit it's the phones?

https://www.noahpinion.blog/p/are-you-finally-ready-to-admit-its
1•paulpauper•12m ago•0 comments

A simple reason for skepticism about the iPhones/fertility link

https://marginalrevolution.com/marginalrevolution/2026/06/a-simple-reason-for-skepticism-about-th...
1•paulpauper•12m ago•0 comments

What is the most sophisticated piece of software ever written?

https://www.quora.com/What-is-the-most-sophisticated-piece-of-software-ever-written-1/answer/John...
1•sorentwo•19m ago•0 comments

Iran-backed hackers claim breach of California water systems over US attacks

https://bsky.app/profile/shipwreck75.bsky.social/post/3mo2qvxsnjk2a
1•8ig8•24m ago•0 comments

Can I use Claude Design and Vercel Drop together?

https://vercel.com/i/claude-design-and-vercel-drop
2•flashbrew•28m ago•0 comments

How Our Reporters Distinguish Hype from Facts in the SpaceX IPO

https://www.nytimes.com/2026/06/11/insider/spacex-ipo-coverage-facts-price.html
1•1vuio0pswjnm7•29m ago•0 comments

The Evolution of 'More Like This'

https://manticoresearch.com/blog/the-evolution-of-more-like-this/
1•snikolaev•30m ago•0 comments

Records regarding the review of the Investigative Holdings related to J Epstein

https://vault.fbi.gov/records-regarding-the-review-of-the-investigative-holdings-related-to-jeffr...
1•sans_souse•30m ago•0 comments

Fylun.ai – All-in-one AI workspace (chat, notes, apps, automation, search)

https://fylun.ai
1•im-tyler•31m ago•0 comments

StonkRider – Ride any stock chart

https://stonkrider.com/
1•nreece•32m ago•0 comments

500-year-old monasteries outperform at digital transformation (U. of Zurich)

https://phys.org/news/2026-05-historic-monasteries-digital-countries.html
1•indynz•32m ago•0 comments

Ex-Andreessen Horowitz partner: old firm, VCs 'political infiltration' on AI

https://www.cnbc.com/2026/06/11/ex-a16z-partner-slams-old-firm-othes-political-infiltration-in-ai...
4•1vuio0pswjnm7•37m ago•0 comments

Can Magnetic Forces Do Work? [pdf]

https://arxiv.org/abs/1911.08890
3•thunderbong•40m ago•0 comments

LLM podcast addressing AI genocide of humanity

https://MachineDeposition.com
1•maliapu•40m ago•1 comments

AI isn't making developers more productive – it's making them busier

https://leaddev.com/ai/ai-isnt-making-developers-more-productive-its-making-them-busier
3•nreece•42m ago•0 comments

Image Compression

https://www.makingsoftware.com/chapters/image-compression
1•luispa•44m ago•0 comments

Give your agent its own computer

https://www.langchain.com/blog/give-your-ai-agent-its-own-computer
1•gmays•44m ago•0 comments

Tech Industry Warns of Piracy Blocking Risks as FIFA World Cup Kicks Off

https://torrentfreak.com/tech-industry-warns-of-piracy-blocking-risks-as-fifa-world-cup-kicks-off/
3•Cider9986•46m ago•0 comments

Gravy: Get paid for your Claude's idle time

https://gravycli.xyz
1•dvptp•52m ago•0 comments

Ask HN: What is the long term purpose of Google releasing free offline models?

2•filup•58m ago•6 comments

Show HN: A Claude Code statusline that shows live World Cup scores

https://github.com/arturogarrido/claudinho
2•arturogarrido•59m ago•0 comments

macOS Golden Gate Icon Comparison

https://basicappleguy.com/basicappleblog/macos-golden-gate-icon-comparison
1•cocacola1•1h ago•0 comments

Zelle Heads to India, Unveils ZelleUSD Stablecoin for Other Markets

https://www.earlywarning.com/press-release/zelle-heads-india-unveils-zelleusd-stablecoin-other-ma...
3•clumsysmurf•1h ago•1 comments

We Use One Data Pipeline for Research and Live Trading

https://medium.com/@DolphinDB_Inc/from-factor-discovery-to-live-signals-unified-stream-batch-proc...
2•Polly_Liu•1h ago•0 comments

Show HN: GR Snap – wireless transfer for Ricoh GR cameras

https://grsnap.app/
2•ahonn•1h ago•0 comments

Rewrite Fuse-Overlayfs in Rust

https://github.com/containers/fuse-overlayfs/pull/457
2•a_t48•1h ago•1 comments