frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•1y ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

On Labubu and the Hyperreal

https://2earth.github.io/website/20260525.html
1•2earth•1m ago•0 comments

Show HN: Lelu – Open-source authorization engine for AI agents

https://lelu-ai.com
1•Abenezer0923•1m ago•0 comments

Keep Your Identity Small

https://paulgraham.com/identity.html
2•vikas-sharma•2m ago•0 comments

Why Handoffs Hurt Software Teams

https://www.scrum.org/resources/blog/handoffs-hurt
1•affordablechip•2m ago•1 comments

Consider Sending Earnest Spam

https://www.nair.sh/guides-and-opinions/marketing-under-pressure/you-should-consider-sending-earn...
1•nilirl•2m ago•0 comments

The Jargon File

http://www.catb.org/~esr/jargon/html/index.html
1•GlitchRider47•3m ago•0 comments

Open-source developers are working themselves sick on AI bugs

https://www.heise.de/en/opinion/Comment-Open-source-developers-are-working-themselves-sick-on-AI-...
1•smartmic•3m ago•0 comments

California issues state of emergency after toxic leak threatens 40k residents

https://www.france24.com/en/americas/20260524-california-declares-state-of-emergency-toxic-leak-t...
2•campuscodi•3m ago•0 comments

What Apple and Google are doing to your push notifications

https://www.jacquescorbytuech.com/writing/what-apple-and-google-are-doing-your-push-notifications
3•iamacyborg•8m ago•0 comments

Is Amp more or less expensive than Claude Code? Is it better?

2•markosn•13m ago•0 comments

UniGetUI - FOSS GUI for package managers on Windows

https://www.zdnet.com/home-and-office/work-life/how-to-use-unigetui-to-install-and-manage-windows...
2•alok-g•13m ago•1 comments

US law enforcement warns of "anti-tech extremism" as AI hatred grows

https://arstechnica.com/ai/2026/05/us-law-enforcement-warns-of-anti-tech-extremism-as-ai-hatred-g...
8•helterskelter•15m ago•3 comments

They call it stupid hot for a reason: Heat muddles animal brains

https://knowablemagazine.org/content/article/living-world/2026/heat-waves-scramble-animal-minds-t...
3•anarbadalov•18m ago•0 comments

The Filesystem Is the API (With TigerFS)

https://packagemain.tech/p/the-filesystem-is-the-api-with-tigerfs
3•voxadam•19m ago•0 comments

Yeunjoo Choi from Igalia on Chromium

https://theconsensus.dev/p/2026/05/20/yeunjoo-choi-from-igalia-on-chromium.htmlQ
3•mooreds•19m ago•1 comments

Show HN: Dataforge Honeypot – Simple decoy system for LAN intrusion alerts

https://honeypot.app.dataforgecanada.com/
1•CarlVon77•19m ago•0 comments

Apple, Google push for judicial oversight in Canada online safety bill

https://www.reuters.com/legal/litigation/apple-google-push-judicial-oversight-canada-online-safet...
1•1vuio0pswjnm7•19m ago•0 comments

UK Births fall to the lowest level in 50 years

https://www.bbc.com/news/articles/cvgzdq23xpgo
3•hmmmmmmmmmmmmmm•20m ago•1 comments

Authors Sue Meta's AI Scientists Directly in Llama Copyright Case

https://www.law.com/corpcounsel/2026/05/26/authors-sue-metas-ai-scientists-directly-in-llama-copy...
4•1vuio0pswjnm7•21m ago•0 comments

Securing Your AI Agent Infrastructure

https://teriradichel.substack.com/p/securing-your-ai-agent-infrastructure
1•mooreds•21m ago•0 comments

Show HN: Remove Audio | mute any video in the browser, no upload

https://remove-audio.com/
1•iamcodemaster•21m ago•0 comments

Anthropic co-founder hallucinates ghost in the machine

https://www.theregister.com/ai-ml/2026/05/27/anthropic-co-founder-hallucinates-ghost-in-the-machi...
1•joebuckwilliams•22m ago•0 comments

The LOGOS Framework: A five-level taxonomy for AI-assisted assessment

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6749961
1•vinicius-covas•23m ago•0 comments

Prose programs are Markdown contracts agents can run

https://openprose.ai/
1•mooreds•23m ago•0 comments

Removing Carbon using geochemistry and volcanic rocks

https://twitter.com/SparshAgarwall/status/2059658214044819524
1•sparshselim•27m ago•0 comments

WP23

https://wordpress.org/news/2026/05/wp23/
4•agbonghama•27m ago•0 comments

Champion ethical hacker warns AI tools like Mythos will make competing harder

https://www.bbc.com/news/articles/c3r2zjpryzro
3•tigerlily•29m ago•0 comments

IXI's autofocusing lenses are almost ready to replace multifocal glasses

https://www.engadget.com/wearables/ixis-autofocusing-lenses-multifocal-glasses-ces-2026-212608427...
2•amichail•29m ago•0 comments

Closely watched Parkinson's drug from Biogen, Denali comes up short

https://www.biopharmadive.com/news/biogen-denali-parkinsons-trial-failure-lrrk2/820961/
1•randycupertino•32m ago•1 comments

Fuelwise: Find the Cheapest Fuel Near You in Seconds

https://apps.apple.com/gb/app/fuelwise-uk-petrol-prices/id6762345341
1•perfexa•34m ago•0 comments