frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•1y ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

What A.I. Did to My College Class

https://www.nytimes.com/2026/05/17/opinion/chatgpt-ai-college-school-graduation.html
1•thm•14s ago•0 comments

How to Write Something Wise (Maria Popova Interview) [video]

https://www.youtube.com/watch?v=yb9Tz-RQFN4
1•freediver•40s ago•0 comments

I automated opt-outs for 500 data broker sites (open source)

https://github.com/stephenlthorn/auto-identity-remove
1•stephenlthorn•2m ago•0 comments

AI agent harnesses like OpenClaw are changing LLMs, inference, and CPUs

https://www.theregister.com/ai-ml/2026/05/17/how-ai-agent-harnesses-like-openclaw-are-changing-ll...
1•abdelhousni•6m ago•0 comments

The Global Fertility Crisis Is Worse Than You Probably Think

https://www.derekthompson.org/p/why-the-whole-world-stopped-having
2•momentmaker•9m ago•0 comments

How Trump's crypto venture and Iran's top exchange tapped into the same networks

https://www.reuters.com/investigations/how-trumps-crypto-venture-irans-top-exchange-tapped-into-s...
1•notagoodidea•11m ago•1 comments

Show HN: Chrome extension that hides YouTube shorts and other distractions

https://chromewebstore.google.com/detail/distraction-free-youtube/ckkcdcieljicflmkokdekbfpkclmmibp
2•mikax•12m ago•1 comments

Now that code is cheap, personal and open software is next

https://blog.stromflix.com/personal-software-is-next
1•StromFLIX•13m ago•0 comments

How to Create Your Own Bespoke, Artisanal, Hand-Drawn PCBs

https://www.hackster.io/news/how-to-create-your-own-bespoke-artisanal-hand-drawn-pcbs-d96d6978a4fb
2•CTOSian•17m ago•0 comments

Japanese-style free pdf editor

https://katanapdf.com/
1•samuraiduckling•18m ago•1 comments

The Backward Logic of Chickenpox Parties

https://www.wired.com/story/chickenpox-parties-and-the-pre-vaccine-internet/
1•joozio•18m ago•0 comments

Indexing code by behavior not imports – tested on large repos, seeking feedback

1•afxuh•18m ago•0 comments

Ask HN: Which books do you wish you'd read earlier in life?

1•jimsojim•22m ago•0 comments

I made a machine that burns money to prove it doesnt exists [video]

https://www.youtube.com/watch?v=2UM4j1_xEs0
3•tzvc•22m ago•1 comments

Spec-Driven Development with math-glyph compression

https://github.com/kborovik/pilot-skills/
1•kborovik•23m ago•0 comments

Zero a Language for Humans and Robots

https://zero-lang.com/
1•dcu•23m ago•0 comments

Show HN: Alder: Dynamic Code Execution Without Roslyn

1•MartiSilvio•25m ago•0 comments

A Danish Couple's Maverick African Research Finds Its Moment in RFK Jr.'S Vacci

https://www.wired.com/story/a-danish-couples-maverick-african-research-finds-its-moment-in-rfk-jr...
2•joozio•26m ago•0 comments

Tim – A High-Performance Template Engine and Markup Language

https://github.com/openpeeps/tim
1•TheWiggles•27m ago•0 comments

Show HN: I built an easy to manage, sharable personal memory for my AI agents

https://ai.actingweb.io/
1•gregertw•32m ago•1 comments

Show HN: Shiftpaper – native parallax wallpaper engine for Wayland

https://github.com/CPritch/shiftpaper
2•PxldLtd•33m ago•0 comments

An ICE Firearms Trainer Was Involved in at Least 4 Deadly Shootings

https://www.wired.com/story/an-ice-firearms-trainer-was-involved-in-at-least-4-deadly-shootings/
4•joozio•34m ago•0 comments

Am I part of the luckiest generation in history?

https://www.bbc.co.uk/news/articles/cj6pyk7e3w4o
2•mmarian•35m ago•0 comments

The Silver Swan Automaton (1773)

https://thebowesmuseum.org.uk/collections/the-silver-swan/
1•pseudolus•35m ago•0 comments

Key Python 3.15 Updates to Make Your Coding Faster, Cleaner, and Easier

https://medium.com/techtofreedom/9-key-python-3-15-updates-to-make-your-coding-faster-cleaner-and...
1•yangzhou•35m ago•0 comments

Nora is now open source

https://www.withnora.run
2•d_cherrington•37m ago•0 comments

Zenk Space raises $26M, targets June debut launch – SpaceNews

https://spacenews.com/zenk-space-raises-26-million-targets-june-debut-launch/
2•rbanffy•40m ago•0 comments

Awesome DESIGN.md

https://github.com/VoltAgent/awesome-design-md/
3•DeathArrow•41m ago•0 comments

Pebble production update and How I use my Index 01

https://repebble.com/blog/how-i-use-my-index-01-production-update
2•smig0•42m ago•0 comments

Ask HN: How can I get interviews when "who wants to be hired" isn't working?

2•LoganDark•42m ago•1 comments