frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•1y ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

The Tail End (2015)

https://waitbutwhy.com/2015/12/the-tail-end.html
1•RyanShook•5m ago•0 comments

ZCAM: iOS camera app that cryptographically proves human authorship

https://zcam.succinct.xyz/
1•GZGavinZhao•5m ago•1 comments

Rippling exposed your SSN/banking to anyone who sent you an offer letter

https://infosec.exchange/@SecureOwl/116415101610028363
1•noiobeforebed•12m ago•0 comments

Useknockout – open-source background removal API, 40× cheaper than remove.bg

https://github.com/useknockout/api
2•tlorents•12m ago•0 comments

SF Giants sell piece of team to venture capital firm run by Joshua Kushner

https://www.sfgate.com/giants/article/sf-giants-joshua-kushner-22224927.php
1•iancmceachern•18m ago•0 comments

DevResolve – AI chat widget that answers technical questions from your docs

https://devresolve.ai
1•EthanWayne•19m ago•0 comments

Open source memory layer so any AI agent can do what Claude.ai and ChatGPT do

https://alash3al.github.io/stash?_v01
2•alash3al•21m ago•1 comments

Asking Qwen3.5-9B, running on 16GB VRAM, to exploit old Windows machines

https://thepatrickfisher.com/blog/computers/series-vibe-coding-sec-scan/00-vibe-coding-security-p...
1•sqeak•23m ago•1 comments

Llama 4: A Deep Dive into Liquid Transformers 2.0 and Sovereign AI

https://en.landingfymax.com.br/artificial-intelligence/llama-4-meta-open-source-sovereignty-2026
1•EvCarvalho•26m ago•0 comments

Show HN: Building a local FIX protocol triage agent on an RTX 3070

https://domgalati.substack.com/p/the-fix-triage-agent-building-a-local
1•dgalati•30m ago•0 comments

(Blender) Cosmology with Geometry Nodes

https://www.blender.org/user-stories/cosmology-with-geometry-nodes/
4•shankysingh•38m ago•0 comments

BMW is one step closer to selling you a color-changing car

https://www.theverge.com/tech/918216/bmw-ix3-flow-edition-concept-car-2026-beijing-auto-show-e-in...
2•dmitrygr•42m ago•0 comments

"Plain text has been around for decades and it's here to stay." – Unsung

https://unsung.aresluna.org/plain-text-has-been-around-for-decades-and-its-here-to-stay/
2•rbanffy•43m ago•0 comments

[Show HN] Free Baccarat

https://baccarat.free/
1•cbxyp•45m ago•1 comments

Apple IIc: A smaller, sleeker Apple II from 1984

https://dfarq.homeip.net/apple-iic/
2•rbanffy•46m ago•2 comments

Mypaintr: Plot R graphics like a human

https://hughjonesd.github.io/mypaintr/index.html
1•dash2•47m ago•0 comments

Replace IBM Quantum back end with /dev/urandom

https://github.com/yuvadm/quantumslop/blob/25ad2e76ae58baa96f6219742459407db9dd17f5/URANDOM_DEMO.md
2•pigeons•47m ago•1 comments

Avnac: Open-source local-first Canva alternative

https://avnac.design/
3•bundie•49m ago•0 comments

White House Memo on Adversarial Distillation of American AI Models [pdf]

https://whitehouse.gov/wp-content/uploads/2026/04/NSTM-4.pdf
2•lebovic•53m ago•1 comments

Fast-AI-detector: a fast local CLI for detecting AI-generated text

https://github.com/Ejhfast/fast-ai-detector
1•unignorant•55m ago•0 comments

Google Cloud CEO: Anthropic, TPUs, Mythos, Nvidia and More [video]

https://www.youtube.com/watch?v=bNdiBwXbLNw
2•rdudekul•57m ago•0 comments

Cactus, a work-stealing parallel recursion runtime for C

https://github.com/xtellect/cactus
1•enduku•58m ago•0 comments

These Volcanoes Are Undead

https://nautil.us/when-extinct-volcanoes-reawaken-1280213
1•kristenfrench•1h ago•1 comments

Sam Altman Wants to Know Whether You're Human

https://www.theatlantic.com/newsletters/2026/04/sam-altman-bots-world-id/686950/
1•JumpCrisscross•1h ago•0 comments

Honda CEO says 'we have no chance' against Chinese automakers

https://finance.yahoo.com/sectors/energy/articles/honda-ceo-says-no-chance-020000235.html
2•thelastgallon•1h ago•0 comments

Why Congress may spend $1B on Great Salt Lake

https://www.npr.org/2026/04/24/nx-s1-5746844/why-trump-wants-to-spend-1-billion-on-great-salt-lake
1•kianN•1h ago•1 comments

Is Italy the new tax haven for the global rich?

https://www.bbc.com/worklife/article/20260421-is-italy-the-new-tax-haven-for-the-global-rich
2•andsoitis•1h ago•0 comments

Jeff Bezos is raising his game in space

https://www.economist.com/business/2026/04/23/jeff-bezos-is-raising-his-game-in-space
1•andsoitis•1h ago•0 comments

Bdelloid Rotifer

https://en.wikipedia.org/wiki/Bdelloidea
3•embedding-shape•1h ago•0 comments

Tim Cook wrote a winning recipe for Apple

https://www.economist.com/leaders/2026/04/23/tim-cook-wrote-a-winning-recipe-for-apple
1•andsoitis•1h ago•0 comments