frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•11mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Streamfold Is Joining Cursor

https://rotel.dev/blog/streamfold-joining-cursor/
1•bryanmikaelian•56s ago•0 comments

Performance Implications of AArch64 Atomics

https://www.researchgate.net/publication/370682772_A_Study_on_the_Performance_Implications_of_AAr...
1•fanf2•1m ago•0 comments

ASIC Chip Routing GIF

https://old.reddit.com/r/chipdesign/comments/1s21n30/global_routing_in_action/
1•random__duck•1m ago•0 comments

InterviewSim: A Scalable Framework for Interview-Grounded Personality Simulation

https://arxiv.org/abs/2602.20294
1•PaulHoule•1m ago•0 comments

Python CFD simulator matching atomic resting radii vs. CODATA

https://github.com/agus79amm-dotcom/scalar-cartographer-cfd
1•Nitsuga0•1m ago•0 comments

RIP Orelhao

https://www.designative.info/2026/01/12/r-i-p-orelhao/
1•pearlsontheroad•1m ago•1 comments

Issue Tracking Is Dead

https://linear.app/next
1•cristinacordova•2m ago•0 comments

Show HN: I built a party game that makes fun of corporate culture

https://cubiclegame.com/
1•pianobrothers•4m ago•0 comments

Bermuda Triangle Search Led Divers to Challenger Wreckage

https://modernengineeringmarvels.com/2026/03/23/bermuda-triangle-search-led-divers-to-challenger-...
1•Brajeshwar•5m ago•0 comments

Atomic Display Switching: Solving

https://github.com/piot5/displayflow_cli
1•pyotq•5m ago•0 comments

Fixed Python Autocomplete

https://matan-h.com/better-python-autocomplete
1•karakoram•6m ago•0 comments

What I mean when I say that I hate GenAI

https://michal.sapka.pl/weblog/2026/what-i-mean-when-i-say-that-i-hate-genai/
3•speckx•8m ago•0 comments

Em-dash – open-source HIPAA compliance that runs locally

https://github.com/aanishs/em-dash
1•aanishs•8m ago•0 comments

Wine 11 rewrites how Linux runs Windows games at kernel with massive speed gains

https://www.xda-developers.com/wine-11-rewrites-linux-runs-windows-games-speed-gains/
3•felineflock•8m ago•0 comments

Show HN: Cranki – Crosswords meet Anki flashcards

https://cranki.app
1•petargyurov•9m ago•0 comments

Should religious AI chatbots be treated differently from all others?

https://mfioretti.substack.com/p/should-religious-ai-chatbots-be-treated
2•oopsiremembered•10m ago•1 comments

Cognitive OS – Prediction-error learning layer for AI agents

https://github.com/eugenexonr/cognitive-os
1•dovmant•10m ago•1 comments

The Danger Behind Meta Killing End-to-End Encryption for Instagram DMs

https://www.wired.com/story/the-danger-behind-metas-decision-to-kill-end-to-end-encrypted-instagr...
3•pulisse•14m ago•0 comments

US Gov Investigators Found No EU Internet Censorship, and Ignored the Findings

https://www.techdirt.com/2026/03/24/the-trump-admins-own-investigators-found-no-eu-internet-censo...
3•hn_acker•16m ago•2 comments

Anduril Industries – Senior Software Engineer – JavaScript, React, AWS

1•Floss•16m ago•0 comments

Delta suspends special congressional services amid shutdown

https://thehill.com/policy/transportation/5797907-delta-suspends-special-congressional-desk-service/
4•JumpCrisscross•19m ago•0 comments

New cars allegedly include "eye of sauron" to monitor drivers

https://twitter.com/VladTheInflator/status/2036174124180185260
3•bilsbie•19m ago•0 comments

Show HN: Skub – a sliding puzzle browser game

https://skub.app
2•kasperstorgaard•21m ago•5 comments

Airstrikes may have destroyed Iran's last F-14s

https://www.npr.org/2026/03/24/nx-s1-5752380/f14-tomcats-iran-us-israel-airstrikes-top-gun
2•divbzero•21m ago•0 comments

LiteLLM's SOC 2 auditor was - Delve

https://trustcompliance.xyz/blog/supply-chain-trust
2•fadijob•22m ago•0 comments

LLMs don't think outside the box

https://zeyrie.blog/posts/technology/llms-dont-think-outside-the-box/
3•speckx•23m ago•0 comments

Victorian service stations run out of fuel as Middle East war spikes demand

https://www.abc.net.au/news/2026-03-24/victorian-petrol-stations-run-out-diesel-fuel-iran-conflic...
4•geox•24m ago•1 comments

Yes, AI is intelligent. Prove me wrong

https://bertrandmeyer.com/2026/02/26/yes-ai-is-intelligent-prove-me-wrong/
3•reillyse•24m ago•3 comments

Gmail AI Productivity Hacks

https://consul.so/blog/gmail-productivity-tips-ai-2026
1•goldkey•24m ago•0 comments

Hark – The most advanced personal intelligence [video]

https://www.youtube.com/watch?v=0H1LSLipOVI
1•dangtony98•25m ago•0 comments