frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•10mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Show HN: VSCode .env Autocomplete

https://github.com/Chrilleweb/vscode-dotenv-diff
1•chrillemn•43s ago•0 comments

Kona EV Hacking

http://techno-fandom.org/~hobbit/cars/ev/
1•AnnikaL•3m ago•0 comments

The Bank and Private Capital Shadow Venture

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5205679
1•petethomas•4m ago•0 comments

Runflow

https://runflow.io/
1•ricardoghekiere•6m ago•0 comments

AI productivity gains are 10%, not 10x

https://newsletter.getdx.com/p/ai-productivity-gains-are-10-not
1•donutshop•6m ago•0 comments

People Who Shun Super-Popular Pop Culture

https://www.theatlantic.com/culture/2026/03/pop-culture-hype-aversion/686312/
1•JumpCrisscross•6m ago•0 comments

A Crypto River Runs Through It

https://cepa.org/article/a-crypto-river-runs-through-it/
1•petethomas•6m ago•0 comments

If computers are the future why are users expected to be permanently illiterate?

https://lapcatsoftware.com/articles/2026/3/5.html
2•zdw•6m ago•0 comments

Anthropic has strong case against Pentagon blacklisting, legal experts say

https://www.reuters.com/legal/legalindustry/anthropic-has-strong-case-against-pentagon-blacklisti...
1•tartoran•7m ago•0 comments

US may have struck Iranian girls' school after using outdated targeting data

https://www.reuters.com/world/middle-east/us-may-have-struck-iranian-girls-school-after-using-out...
1•tartoran•9m ago•0 comments

Base44 Superagents

https://base44.com/superagents
1•yoavfr•9m ago•0 comments

US inflation stable ahead of Iran shock

https://www.bbc.com/news/articles/cde4w32573xo
1•tartoran•9m ago•0 comments

Decision Guardian: My First GitHub Action and CLI Project

https://github.com/DecispherHQ/decision-guardian
3•poor_hustler•10m ago•0 comments

Show HN: A crowdsourced wiki tracking design origins in Pickmon

https://pickmonfans.com/
1•lion__93332•10m ago•0 comments

OpenAI: We built a computer environment for agents

https://openai.com/index/equip-responses-api-computer-environment/
1•danebalia•10m ago•1 comments

Lightpath – track your flight through daylight, twilight and darkness

https://lightpath.cc/flight/MXP-PEK/2026-03-11/2215
1•situationista•10m ago•0 comments

Does High Home-Ownership Impair the Labor Market? (2013)

https://www.nber.org/papers/w19079
1•herbertl•14m ago•0 comments

Get free ChatGPT Pro for open-source maintainers

https://developers.openai.com/codex/community/codex-for-oss/
1•rmast•14m ago•0 comments

Iran-linked cyber crew claims hit on US med-tech firm

https://www.theregister.com/2026/03/11/us_medtech_firm_stryker_cyberattack_iran/
1•beardyw•15m ago•0 comments

Type systems are leaky abstractions: the case of Map.take!/2

https://dashbit.co/blog/type-systems-are-leaky-abstractions-map-take
1•tosh•16m ago•0 comments

Show HN: Free audiobooks with synchronized text for language learning

https://discovox.org/en/library
1•floo•17m ago•1 comments

Rabbit: Project Cyberdeck

https://www.rabbit.tech/earlyaccess
1•tjwds•18m ago•0 comments

Nobody finishes reading my books

https://smallpotatoes.paulbloom.net/p/nobody-finishes-reading-my-books-eca
1•herbertl•18m ago•0 comments

Hermes Agent: The self-improving AI agent

https://github.com/NousResearch/hermes-agent
1•danebalia•20m ago•1 comments

I Updated My Embedding Model and My RAG Broke: A Post-Mortem

https://decompressed.io/learn/rag-observability-postmortem
1•zacole•21m ago•1 comments

A practical technique for issue resolution with agentic AI

https://blog.scottlogic.com/2026/03/05/analysis-implementation-reflection-practical-techniques.html
2•oriondean•22m ago•0 comments

I just released PluriSnake, a new kind of snake puzzle game. [macOS/iOS/iPadOS]

https://apps.apple.com/us/app/plurisnake/id6756577045
1•amichail•22m ago•2 comments

Halfway on the path to community support for free-threaded Python

https://labs.quansight.org/blog/free-threaded-python-halfway
1•lumpa•22m ago•0 comments

Britain is ejecting hereditary nobles from Parliament after 700 years

https://apnews.com/article/uk-house-of-lords-hereditary-peers-expelled-535df8781dd01e8970acda1dca...
9•divbzero•23m ago•1 comments

Meta patented an AI that lets you keep posting from beyond the grave

https://www.businessinsider.com/meta-granted-patent-for-ai-llm-bot-dead-paused-accounts-2026-2
3•JumpCrisscross•23m ago•1 comments