frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•9mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Ask HN: How do you motivate your humans to stop AI-washing their emails?

1•causal•1m ago•0 comments

Show HN: Self-Hosted Task Scheduling System (Back End and UI and Python SDK)

https://github.com/Ghiles1010/Cratos-UI
1•rilesthefirst•1m ago•0 comments

Hybrid Search in PostgreSQL: The Missing Manual

https://www.paradedb.com/blog/hybrid-search-in-postgresql-the-missing-manual
1•jamesgresql•1m ago•1 comments

Grand Time: Time-Based Models in Decentralized Trust

1•AGsist•1m ago•0 comments

Show HN: I Forked Moltbook to Build a Hybrid Social Network (Humans and AI)

https://theeno-nine.vercel.app
1•shahidbilal6535•2m ago•0 comments

Retrotech YouTuber Sam Battle "Lookmumnocomputer" to Represent UK in Eurovision

https://www.theguardian.com/tv-and-radio/2026/feb/17/look-mum-no-computer-uk-entry-eurovision-2026
1•fortran77•2m ago•0 comments

WolfSSL Doesn't Suck

https://blog.feld.me/posts/2026/02/wolfssl-doesnt-suck/
1•thomasjb•2m ago•0 comments

Show HN: Continue – Source-controlled AI checks, enforceable in CI

https://docs.continue.dev
1•sestinj•2m ago•0 comments

Chess engines do weird stuff

https://girl.surgery/chess
2•admiringly•3m ago•0 comments

Show HN: Agent Breadcrumbs – Unified Work Log Across Claude, Codex, OpenClaw

https://github.com/ejcho623/agent-breadcrumbs
1•ejcho623•3m ago•0 comments

Show HN: Listen to sounds around the world and guess the location

https://placethesound.vikborges.com
1•bit_nomad•3m ago•0 comments

Micron Is Spending $200B to Break the AI Memory Bottleneck

https://www.wsj.com/tech/micron-is-spending-200-billion-to-break-the-ai-memory-bottleneck-a4cc74a1
1•gmays•4m ago•1 comments

Thank HN: You helped save 33,241 lives

3•chaseadam17•4m ago•0 comments

Tech Startup Culture Not as Innovative as Founders May Think (2025)

https://www.hec.edu/en/dare/innovation-entrepreneurship/tech-startup-culture-not-innovative-found...
1•wslh•5m ago•0 comments

Astronomers track bubbles on a star's surface in the most detailed video yet

https://www.almaobservatory.org/en/press-releases/astronomers-track-bubbles-on-a-stars-surface-in...
1•nobody9999•5m ago•0 comments

Massively Parallel Programming

https://dcosson.substack.com/p/massively-parallel-programming
2•dcosson•6m ago•0 comments

Launch HN: Sonarly (YC W26) – AI agent to triage and fix your production alerts

https://sonarly.com/
2•Dimittri•7m ago•0 comments

AI-authored code contains worse bugs than software crafted by humans

https://www.theregister.com/2025/12/17/ai_code_bugs/
1•thefilmore•7m ago•0 comments

About the Indianapolis Hiking Club

https://www.indyhike.org/about.shtml
1•mooreds•8m ago•0 comments

Multi-player agents are the future

https://charlielabs.ai/blog/why-the-next-agent-interface-is-shared/
1•mrbbk•9m ago•0 comments

Show HN: 6cy – Experimental streaming archive format with per-block codecs

https://github.com/byte271/6cy
1•yihac1•10m ago•0 comments

Grok 4.20 Beta

https://grok.com/
5•tosh•10m ago•0 comments

How an AI baby-tracking app grew to ~$300K/month

https://www.starterstory.com/stories/sprouty
1•igor_ryabenkiy•11m ago•1 comments

So You Want to Build a Tunnel

https://practical.engineering/blog/2026/2/17/so-you-want-to-build-a-tunnel
4•crescit_eundo•11m ago•0 comments

Show HN: Trained YOLOX from scratch to avoid Ultralytics (aircraft detection)

https://austinsnerdythings.com/2026/02/13/training-yolox-aircraft-detection-mit-license/
1•auspiv•12m ago•0 comments

Openclaw 2.0. Openrappter.

https://github.com/kody-w/openrappter
2•kody_w•13m ago•0 comments

My thoughts on Open Source – after a decade and in AI era

https://blog.inoki.cc/2026/02/17/My-thoughts-on-Open-Source-after-a-decade-2026/index.html
1•inoki•13m ago•1 comments

Most people are individually optimistic, but think the world is falling apart

https://hannahritchie.substack.com/p/many-people-are-individually-optimistic
2•speckx•13m ago•0 comments

Host range and antibiotic resistance are shaped by distinct survival strategies

https://academic.oup.com/nar/article/54/2/gkaf1479/8427120?login=false
1•PaulHoule•15m ago•0 comments

The Best Programming Language for the End of the World

https://web.archive.org/web/20250326100613/https://www.wired.com/story/forth-collapse-os-apocalyp...
1•tosh•15m ago•0 comments