frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•10mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

I'm Not Reading That

https://karldaniel.co.uk/im-not-reading-that/
1•speckx•54s ago•0 comments

The Many Meanings of "Stack": From Data Structures, VMs, to Calling Conventions

https://ezzeriesa.notion.site/The-many-meanings-of-stack-bc768cb186714b579547b7b8681ee32f
1•kurinikku•1m ago•0 comments

Kumo: Cloudflare's UI Component Library

https://kumo-ui.com/
1•mmarian•1m ago•0 comments

Minions: Stripe's one-shot, end-to-end coding agents–Part 2

https://stripe.dev/blog/minions-stripes-one-shot-end-to-end-coding-agents-part-2
1•ains•2m ago•0 comments

Show HN: Inconvo – open-source chat-with-data agent that doesn't generate SQL

https://github.com/inconvoai/inconvo
1•ogham•2m ago•0 comments

Reassessing Spinosaurus: New Fossils and the Aquatic Debate

https://comuniq.xyz/post?t=818
1•01-_-•2m ago•0 comments

Show HN: Ghost OS – Let AI agents use your Mac, not just the terminal

https://github.com/ghostwright/ghost-os
1•mcheemaa•2m ago•0 comments

The Clock Has Run Out on Stablecoin Ambiguity

https://thefutureofmoney.substack.com/p/the-clock-has-run-out-on-stablecoin
1•futureofmoney•2m ago•0 comments

China Robots

https://www.newsweek.com/china-killer-robots-unitree-robotics-1917569
1•aversivet•2m ago•1 comments

40k param model beats Yolo26n (at least for small objects)

https://one-ware.com/docs/one-ai/demos/tennis-ball-demo/
1•lebeier•4m ago•0 comments

How AI is reshaping developer choice (and Octoverse data proves it)

https://github.blog/ai-and-ml/generative-ai/how-ai-is-reshaping-developer-choice-and-octoverse-da...
1•mikece•4m ago•0 comments

Show HN: Git worktree manager for Niri (Wayland compositor)

https://github.com/nskha101/niri-worktree-management
1•nithiiyan25•5m ago•0 comments

Show HN: I created a webapp to track the latest OpenClaw news

https://www.lobstersauce.news/
1•Tjerkienator•5m ago•0 comments

zeptocom.js

https://github.com/tabemann/zeptocomjs
1•tosh•7m ago•0 comments

Show HN: Cogitator – Self-hosted AI agent runtime with native A2A Protocol

https://github.com/cogitator-ai/Cogitator-AI
1•el1fe•8m ago•1 comments

Show HN: Getting Warmer – Daily word game scored by GloVe embedding similarity

https://gettingwarmer.io
1•frostadvisory•9m ago•0 comments

European companies don't have an innovation problem, they have an incentive prob

https://productics.substack.com/p/european-companies-dont-have-an-innovation
1•iggori•9m ago•2 comments

NASA Puts Starliner Mishap in Same Class as Shuttle Tragedies

https://www.bloomberg.com/news/articles/2026-02-19/nasa-puts-starliner-mishap-in-same-class-as-sh...
3•Betelbuddy•10m ago•1 comments

Starliner Crew Flight Test Investigation Letter from Isaacman

https://twitter.com/NASAAdmin/status/2024558806135689354
1•baggy_trough•11m ago•0 comments

Micropayments as a reality check for news sites

https://blog.zgp.org/micropayments-as-a-reality-check-for-news-sites/
2•speckx•11m ago•0 comments

AI Adoption at Sentry

https://twitter.com/jshchnz/status/2024213163483546076/photo/1
1•tosh•12m ago•0 comments

Built a CPI inflation calculator (US and UK) and want feedback on accuracy

https://investment-calculator.net/inflation-calculator/
1•investmentcalc•14m ago•0 comments

AI helps unlock 50-80x improvement in Linux's io_uring

https://lore.kernel.org/qemu-devel/20260213143225.161043-1-axboe@kernel.dk/#t
1•binkHN•14m ago•0 comments

Loon Is a Lisp

https://campedersen.com/loon
1•ecto•16m ago•0 comments

0 A.D. Release 28: Boiorix

https://play0ad.com/new-release-0-a-d-release-28-boiorix/
2•jonbaer•17m ago•0 comments

Palantir partnership is at heart of Anthropic, Pentagon rift

https://www.semafor.com/article/02/17/2026/palantir-partnership-is-at-heart-of-anthropic-pentagon...
3•everybodyknows•17m ago•0 comments

A High Performance Neural Network for Energy-Efficient Copyright Violation [pdf]

https://raw.githubusercontent.com/em-tg/laundercat/refs/heads/master/laundercat.pdf
1•em-tg•20m ago•0 comments

Alpha School's Secret Sauce

https://fivetwelvethirteen.substack.com/p/alpha-schools-secret-sauce
3•yorwba•20m ago•0 comments

Show HN: NationalDex – an open-source Pokédex app

https://www.nationaldex.app
1•linesofcode•20m ago•1 comments

Show HN: Giving Claude Code persistent memory with a self-hosted MCP server

https://github.com/elvismdev/mem0-mcp-selfhosted
1•elvismdev•21m ago•1 comments