frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•11mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Synaphe – A type-safe language for hybrid AI and quantum computing

https://github.com/martus-spinther/synaphe-project
1•martus-spinther•8m ago•0 comments

Mindwtr – Open-source, local-first GTD app (Tauri and React Native)

https://github.com/dongdongbh/Mindwtr
1•dongdongbh•9m ago•0 comments

Quantum mechanics simulation Python library for research and learning

https://github.com/iDEA-org/iDEA
1•jw1294•11m ago•1 comments

Proof Theory and Logic Programming

https://www.lix.polytechnique.fr/Labo/Dale.Miller/ptlp/
1•remywang•12m ago•0 comments

Tell HN: Microsoft365 "Convert to Paid" checkout silently default to 25 licenses

2•davidstarkjava•16m ago•0 comments

Show HN: Passport Globe (See where your passport takes you)

https://hariharan.uno/globe
1•hariharan_uno•24m ago•0 comments

Show HN: TMA1 – Local-first observability for LLM agents

https://tma1.ai/
2•killme2008•25m ago•0 comments

Show HN: Yeet – Throw AI tasks at hardware and walk away (Nomad and OpenShell)

https://github.com/wan0net/yeet
1•wan0net•26m ago•0 comments

Phase Transitions and Computation

https://theory.org/complexity/cdpt/html/node5.html
1•downboots•30m ago•0 comments

Show HN: Banish: A declarative framework for rule-based state machines in Rust

https://github.com/LoganFlaherty/banish/releases/tag/v1.3.0
1•LoganFlaherty•30m ago•0 comments

Bitcoin mining difficulty drops 7.8% as miner exodus accelerates amid AI pivot

https://www.theblock.co/post/394579/bitcoin-mining-difficulty-drops-7-8-as-miner-exodus-accelerat...
2•adrianwaj•30m ago•1 comments

Review: Why Evolution Is True

https://ncse.ngo/review-why-evolution-true
1•akbarnama•34m ago•0 comments

We Read What Delve Ships to the Browser

https://security.redeux.ai/research/delve-compliance-posture
1•chasewarren•43m ago•0 comments

Isometric exercise: The most efficient fitness regime?

https://www.bbc.com/future/article/20260319-isometric-exercise-the-most-efficient-fitness-regime
1•akbarnama•47m ago•0 comments

A Rant about Resolutions

https://blog.brixit.nl/rant-about-resolutions/
1•vinhnx•49m ago•0 comments

Is Simple Good?

https://darth.games/posts/is-simple-good/
1•vinhnx•49m ago•0 comments

Delve Accused of Fraud

https://techcrunch.com/2026/03/21/delve-accused-of-misleading-customers-with-fake-compliance/
4•zlu•54m ago•0 comments

Sashiko: AI code review system for the Linux kernel spots bugs humans miss

https://www.theregister.com/2026/03/20/sashiko_code_review_linux/
2•maxloh•54m ago•0 comments

AI Disrupts Talent Evaluation Before It Disrupts Talent

https://substack.com/home/post/p-191732116
2•cactaceae•56m ago•0 comments

A Reason to Ditch Jira: AI Agents

https://age-of-product.com/jira-ai-agents/
1•swolpers•57m ago•0 comments

Show HN:Entroly – Compress codebase context for LLMs by 78% using Rust

https://github.com/juyterman1000/entroly
1•savetokens•59m ago•0 comments

The History and Business of Formula 1

https://www.acquired.fm/episodes/formula-1
1•vismit2000•1h ago•0 comments

Why Even Smart People Believe AI Is Thinking

https://www.wsj.com/tech/ai/ai-tools-sentience-b98fc6e6
1•1vuio0pswjnm7•1h ago•0 comments

OpenAI to introduce ads to all ChatGPT free and Go users in US

https://www.reuters.com/business/media-telecom/openai-expand-ads-chatgpt-all-free-low-cost-users-...
1•1vuio0pswjnm7•1h ago•0 comments

Anthropic just shipped an OpenClaw killer

https://venturebeat.com/orchestration/anthropic-just-shipped-an-openclaw-killer-called-claude-cod...
3•qwertmax•1h ago•1 comments

Not Even Elon Musk Can Get Nvidia Stock Moving

https://www.barrons.com/articles/nvidia-stock-price-musk-ai-1a20fbaf?
1•1vuio0pswjnm7•1h ago•0 comments

David Botstein, RIP

https://www.nytimes.com/2026/03/20/science/david-botstein-dead.html
2•paulpauper•1h ago•0 comments

My Time with Jürgen Habermas, Europe's 'Last Intellectual'

https://www.politico.com/news/magazine/2026/03/20/karp-habermas-remembrance-00838398
1•paulpauper•1h ago•0 comments

The M4 Apple Neural Engine, Part 3: Training

https://maderix.substack.com/p/inside-the-m4-apple-neural-engine-c8b
1•wslh•1h ago•0 comments

Did Delve (Compliance) Commit Securities Fraud?

1•ManuelSuarez•1h ago•0 comments