frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•1y ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

A way out of US debt?

https://www.warman.life/blog/2026-04-26-the-synthetic-buyer/
1•shaunistyping•1m ago•0 comments

OpenAI Reportedly Working on an AI Smartphone to Rival iPhone

https://www.macrumors.com/2026/04/27/openai-working-on-an-ai-smartphone/
1•mgh2•2m ago•0 comments

Pancreatic Cancer Study Retracted over Undisclosed Conflict of Interest

https://globalportalnews.com/spain-culture-entertainment-news/mariano-barbacid-pancreatic-cancer-...
1•wslh•2m ago•0 comments

I Won a Championship That Doesn't Exist

https://ron.stoner.com/How_I_Won_a_Championship_That_Doesnt_Exist/
1•SEJeff•3m ago•0 comments

Pentagon seeks to codify Department of War title as renaming costs total $50M

https://www.stripes.com/theaters/us/2026-04-28/pentagon-congress-codify-dow-name-21516668.html
3•Bender•6m ago•0 comments

Disaggregated Serving for Hybrid SSM Models in vLLM

https://vllm-website-lx4pji0mz-inferact-inc.vercel.app/blog/hybrid-ssm-disagg
1•matt_d•8m ago•0 comments

Show HN: Effected Keyboard 2 – Effects as You Type

1•vitalipom•9m ago•0 comments

Drone pilot makes US rescind no-fly zones around unmarked, moving ICE vehicles

https://arstechnica.com/gadgets/2026/04/no-fly-zones-around-moving-ice-vehicles-this-drone-pilot-...
8•Bender•10m ago•0 comments

King Charles state visit to US

https://www.bbc.co.uk/news/live/c4g5lly7qg8t
2•FridayoLeary•10m ago•0 comments

Flesh-eating bacteria devour man's arm and leg in just three days

https://arstechnica.com/health/2026/04/flesh-eating-bacteria-devour-mans-arm-and-leg-in-just-thre...
4•Bender•11m ago•0 comments

Mad Bugs: QEMU and UTM Escape

https://blog.calif.io/p/mad-bugs-qemu-and-utm-escape
1•wslh•12m ago•0 comments

Post-trained Qwen3-Coder with a debugger: 70% → 89% solve rate, 59% fewer turns

https://twitter.com/moofeez/status/2049192929739280482
3•moofeez•14m ago•1 comments

Show HN: My friend and his AI homies wrote SGI Indy emulator in Rust

https://github.com/techomancer/iris
2•greg_w•14m ago•0 comments

Release PiClaw v2.0.4 – Chapek 9 · rcarmo/piclaw

https://github.com/rcarmo/piclaw/releases/tag/v2.0.4
1•rcarmo•15m ago•0 comments

Max/MSP external for running neural amplifier captures

https://github.com/apresta/neural_tilde
2•ot•16m ago•0 comments

FCC Orders a Review of ABC's Broadcast Licenses

https://www.nytimes.com/2026/04/28/business/media/fcc-abc-television-kimmel.html
4•standardUser•16m ago•0 comments

The missing macOS web app viewer chromeless, highly opinionated

https://github.com/rcarmo/swift-webapp-viewer
1•rcarmo•16m ago•0 comments

Show HN: An agent that remembers across sessions (no chat history)

https://github.com/umbecanessa/neural-ledger-system
1•wasnaga•18m ago•0 comments

Ask HN: Should the letter B be typed with the left or the right hand?

2•modinfo•20m ago•3 comments

John Carlos Baez: "Learning from Nature with System Dynamics"

https://mathstodon.xyz/@johncarlosbaez/116478639091196587
1•_Microft•22m ago•0 comments

Why China's Affordable AI Is a Worry for Silicon Valley

https://www.bloomberg.com/news/articles/2026-04-27/why-china-s-deepseek-qwen-and-moonshot-are-a-w...
6•wslh•23m ago•0 comments

What Anthropic's Mythos means for the future of cybersecurity

https://www.schneier.com/blog/archives/2026/04/what-anthropics-mythos-means-for-the-future-of-cyb...
2•MattSayar•24m ago•0 comments

BookStack Has Migrated from GitHub to Codeberg

https://www.bookstackapp.com/blog/project-migrated-to-codeberg/
2•IceWreck•24m ago•0 comments

MCP Apps put user data into the model context. That is the point, not the bug

https://mcprunbook.com/posts/interactive-mcp-apps-tredict.html
1•Aldipower•24m ago•0 comments

The Race Is on to Keep AI Agents from Running Wild with Your Credit Cards

https://www.wired.com/story/the-race-is-on-to-keep-ai-agents-from-running-wild-with-your-credit-c...
2•pavel_lishin•25m ago•0 comments

'World models' are AI's latest sensation: what are they and what can they do?

https://www.nature.com/articles/d41586-026-00820-5
2•petethomas•25m ago•0 comments

AI Talent agent making direct intros to 100s of startups

https://www.getclera.com
1•alexanderfarr•26m ago•0 comments

Show HN: Nomie is an open-world self care game to replace doomscrolling

https://apps.apple.com/us/app/nomie-self-care-stress-relief/id6757396354
1•liaai0630•27m ago•0 comments

Private equity dismantled West Suburban Medical Center and other area hospitals

https://chicago.suntimes.com/other-views/2026/04/24/pipeline-health-west-suburban-weiss-memorial-...
2•petethomas•29m ago•0 comments

Can You Reverse Brain Rot?

https://untangled.bearblog.dev/brain-rot/
1•speckx•29m ago•0 comments