frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•11mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

OpenAI: The Next Phase of Enterprise AI

https://openai.com/index/next-phase-of-enterprise-ai/
1•louiereederson•1m ago•0 comments

Show HN: SessionFlow macOS app that auto-schedules sessions around your calendar

https://github.com/kibermaks/SessionFlow
1•kibermaks•1m ago•0 comments

Grading laptop and cell phone companies on the fixability of their products

https://pirg.org/edfund/resources/failing-the-fix-2026/
1•breve•3m ago•0 comments

So My Friend Made Me a Bioreactor

https://chillphysicsenjoyer.substack.com/p/so-my-friend-made-me-a-bioreactor
1•crescit_eundo•4m ago•0 comments

Anthropic Set to Preview Powerful 'Mythos' Model to Ward Off AI Cyberthreats

https://www.wsj.com/tech/ai/anthropic-set-to-preview-powerful-mythos-model-to-ward-off-ai-cyberth...
1•sonabinu•7m ago•0 comments

Show HN: I built a local data lake for AI powered data engineering and analytics

https://stream-sock-3f5.notion.site/Nile-Local-an-AI-Data-IDE-that-runs-on-your-local-machine-33b...
3•vpfaiz•7m ago•0 comments

Fantasshtic – Free Cursor for SSH

https://fantasshtic.vercel.app
1•aureus_cx•8m ago•0 comments

Ask HN: What tools are you using to secure your Claude memory files?

1•taariqlewis•8m ago•0 comments

Something weird is happening on Tinder [video]

https://www.youtube.com/watch?v=rjxAYdUe8uU
1•DavidHaerer•11m ago•0 comments

The $400K Degree Is Broken. Here Is How to Fix It

https://raisinghumanity.substack.com/p/the-400k-degree-is-broken-here-is
1•United857•12m ago•0 comments

You Need a Windows Remote Desktop, Not an OpenClaw

https://nedshed.dev/p/you-need-a-windows-remote-desktop
1•etwigg•12m ago•1 comments

Show HN: CongaLine – Self-hosted isolated AI agent fleet (OpenClaw, Hermes)

https://github.com/cruxdigital-llc/congaline
1•zhendershot•12m ago•0 comments

Show HN: I Used 15 AI Agents to Design a Wearable – Here's Where They Broke

https://chetandesh.substack.com/p/i-used-15-ai-agents-to-design-a-wearable
1•cdesh•13m ago•0 comments

Vera – A language designed for machines to write

https://veralang.dev/
2•joecobb•14m ago•0 comments

An Untold Piece of Fast Food History in Alexandria, Virginia

https://www.thedeletedscenes.com/p/wiener-me-this
1•adelmastro•14m ago•0 comments

My Journey to a Datacenter in a Box

https://merqur.io/2026/04/08/the-journey-to-a-datacenter-in-a-box/
1•merqurio•14m ago•1 comments

The Quality Wall of AI Adoption

https://jitera.com/blog/in-vs-through/
1•everlier•16m ago•2 comments

How Augmented Reality Is Transforming Museums, Public Venues, and Accessibility

https://sawtoothcreative.substack.com/p/how-augmented-reality-is-transforming
1•SteveMburu•17m ago•0 comments

Recursive Moving Polynomial Regression – O(1) Constant Complexity

https://zenodo.org/records/19038620
2•Pierdimi•17m ago•1 comments

AMD AI director says Claude Code is becoming dumber and lazier since update

https://www.theregister.com/2026/04/06/anthropic_claude_code_dumber_lazier_amd_ai_director/
7•Logans_Run•18m ago•0 comments

Swiss Banks Want a Franc Stablecoin

https://www.siliconsnark.com/swiss-banks-finally-want-a-franc-stablecoin/
1•SaaSasaurus•18m ago•0 comments

Strait of Hormuz Live Tracker

https://hormuzstraitmonitor.com
2•elsewhen•19m ago•0 comments

Spec: Generic Methods for Go

https://github.com/golang/go/issues/77273
1•_ikke_•19m ago•0 comments

Show HN: Starla – Unofficial Ripe Atlas Software Probe

https://github.com/ananthb/starla
1•pcpuser•19m ago•0 comments

On TypeScript's Flaws (2024)

https://zanlib.dev/blog/on-typescripts-flaws/
2•aragonite•23m ago•0 comments

MRI machine that's freezing tumors and saving patients from debilitating pain

https://www.9news.com.au/health/liverpool-hospital-mri-machine-sydney/27db0a50-615a-4aa2-a1b2-2b2...
2•rmason•25m ago•1 comments

SNN brain-inspired gen-AI in C/C#, no external AI libs could be promising?

1•adinhitlore•26m ago•0 comments

Meta Muse Spark is darn good

https://www.riteshkhanna.com/blog/muse-spark-arena
1•treadon•30m ago•0 comments

The End of Gangs – Policing and Crime in Los Angeles

https://psmag.com/social-justice/the-end-of-gangs-los-angeles-southern-california-epidemic-crime-...
1•caycep•31m ago•0 comments

John Deere to Pay $99M in Monumental Right-to-Repair Settlement

https://www.thedrive.com/news/john-deere-to-pay-99-million-in-monumental-right-to-repair-settlement
11•CharlesW•33m ago•1 comments