frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•11mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Italian bill proposes curbs on social media addiction

https://www.reuters.com/legal/litigation/italian-bill-proposes-curbs-social-media-addiction-2026-...
1•1vuio0pswjnm7•51s ago•0 comments

Searches for Piracy Increased over the Past 5 Years

https://trends.google.com/trends/explore?date=today%205-y&geo=US&q=piracy&hl=en-US
1•Cider9986•58s ago•0 comments

Musk: SpaceX IPO to fund space data centers. MSFT undersea fail sounds warning

https://www.reuters.com/business/aerospace-defense/spacexs-orbital-data-centers-could-face-same-h...
1•1vuio0pswjnm7•2m ago•0 comments

Show HN: JavaScript Obfuscator

https://github.com/nstarke/egodeath
1•bootbloopers•3m ago•0 comments

The Math of Friday 13th

https://www.scientificamerican.com/article/why-friday-the-13th-is-a-mathematical-inevitability/
1•pixiemaster•3m ago•0 comments

Show HN: Tileserver-RS – Tile Server in Rust with MapLibre Native Rendering

https://github.com/vinayakkulkarni/tileserver-rs
1•vinayakkulkarni•8m ago•0 comments

I rebuilt the same project after 15 years – what changed in web dev

https://bamwor.com/en/news/rebuilt-same-project-after-15-years
1•manudaro•8m ago•0 comments

Embracing AI with Claude's C Compiler

https://chipsandcheese.com/p/embracing-ai-with-claudes-c-compiler
1•signa11•10m ago•0 comments

Pair Programming in the Age of Agents

https://mattwynne.net/pair-programming-in-the-age-of-agents
1•mattwynne•12m ago•0 comments

I built a programming language on the top of Node.js

https://github.com/dominexmacedon-dev/starlight-cli-script
1•dominexmacedon•15m ago•1 comments

Vietnamization (1969)

https://en.wikipedia.org/wiki/Vietnamization
1•ValentineC•21m ago•0 comments

Ask HN: Why is almost all of API documentation online?

1•triilman•23m ago•0 comments

RSL: Simple Licensing

https://rslstandard.org/
2•fagnerbrack•24m ago•0 comments

How to take down a US F-35 over Iran? Chinese engineer's tutorial goes viral

https://www.scmp.com/news/china/science/article/3348619/how-take-down-us-f-35-over-iran-chinese-e...
2•KnuthIsGod•26m ago•0 comments

What Agents Need Before They Handle Real Money

https://catenalabs.com/blog/what-agents-need-before-they-handle-real-money/
3•jorgereyna•31m ago•1 comments

It's time for the 'Sell painkillers, not vitamins' metaphor to die

https://www.pathsensitive.com/2023/09/its-time-for-painkillers-vitamins-die.html
2•zrkrlc•32m ago•0 comments

History Behind the SpaceX IPO

https://ioc.exchange/@muskfiles/116333241408716236
4•infinitewars•37m ago•2 comments

Chinese government bonds emerge as lone war haven

https://www.ft.com/content/72215587-4b6c-454c-85ff-6956132705d9
2•toomuchtodo•39m ago•0 comments

I watch most videos at single speed

https://liquidbrain.net/blog/single-speed/
1•kurinikku•39m ago•0 comments

Delve – Fake Compliance as a Service – Part II – Day 2 of 5

https://deepdelver.substack.com/p/delve-fake-compliance-as-a-service-98a
2•Garbage•41m ago•1 comments

Intro to Reality Pentesting

https://cptkj.substack.com/p/intro-to-reality-pentesting
1•curmudgeon22•43m ago•0 comments

iOS 27 Rumored to Feature Keyboard Upgrade

https://www.macrumors.com/2026/04/01/ios-27-upgraded-keyboard-rumor/
1•Tomte•52m ago•0 comments

Agent Self-Care: The Hidden CPU Demand Reshaping Software Engineering

https://codephysics.dev/articles/agent-self-care/
2•antonajp•52m ago•1 comments

The Lost Art of Baseball's Makeup Call

https://thespade.substack.com/p/the-lost-art-of-baseballs-makeup
1•duck•53m ago•0 comments

A Guide to the AI Tribes

https://micheljusten.substack.com/p/a-guide-to-the-ai-tribes
1•timshell•57m ago•0 comments

Salomi, a research repo on extreme low-bit transformer quantization

https://github.com/OrionsLock/SALOMI
5•Edward9055•1h ago•0 comments

Subscription bombing and how to mitigate it

https://bytemash.net/posts/subscription-bombing-your-signup-form-is-a-weapon/
20•homelessdino•1h ago•6 comments

Show HN: Open-agent-SDK – Claude Code's internals, extracted and open-sourced

https://github.com/codeany-ai/open-agent-sdk-typescript
2•idoubi•1h ago•0 comments

Neural Particle Automata: Learning Self-Organizing Particle Dynamics

https://arxiv.org/abs/2601.16096
1•E-Reverance•1h ago•0 comments

'Unsustainable': Congressional Scrutiny of Kalshi, Polymarket Explodes

https://www.politico.com/news/2026/04/01/congress-kalshi-polymarket-regulation-00852370
4•mitchbob•1h ago•1 comments