frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•10mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

NPM install is stealing your passwords – I built a tool to catch it

https://westbayberry.com/product
1•ComCat•1m ago•1 comments

Lamplight.Cafe

https://lamplight.cafe
1•ryuura•3m ago•0 comments

NASA's Artemis II launch date gets pushed back again

https://qz.com/nasas-artemis-ii-launch-delayed-april
1•bookmtn•6m ago•0 comments

Show HN: MFLScout – Analytics Platform for Metaverse Football League

1•iedayan03•6m ago•0 comments

Claude on Socialization

https://claude.ai/share/486be97b-df4f-4e11-abc9-53021038f141
1•s1gs3gv•8m ago•1 comments

Show HN: CodeAnswr – AI-powered Stack Overflow alternative, free forever

https://codeanswr.com
1•mobinpo•9m ago•1 comments

Show HN: BudgetFast – Upload a bank statement screenshot, AI does the rest

https://budgetfast.co
1•ivanramos•10m ago•0 comments

I made a game about the nihilist penguin

https://store.steampowered.com/app/4343410/ULTRATAP_Demo/
1•luckyape_•14m ago•0 comments

Building DIY Split-Flap Displays (2021)

https://www.partsnotincluded.com/building-diy-split-flap-displays/
2•walterbell•14m ago•0 comments

VibeLattice – A vortex lattice method based on AVL

https://avl.vibefoil.com
1•carabiner•17m ago•0 comments

Underpriced app is now live in app store AI Profit Calculator for Resellers

https://apps.apple.com/us/app/underpriced/id6759008101
1•UnderpricedApp•32m ago•0 comments

They Fought for the CIA in Afghanistan. In America, They're Living in Fear

https://www.nytimes.com/2026/02/23/magazine/zero-units-cia-afghanistan.html
4•jbegley•36m ago•0 comments

It might be time to say goodbye to HTML inputs

https://medium.com/zar-engineering/it-might-be-time-to-say-goodbye-to-html-inputs-f37ccf434cc3
2•obiefernandez•36m ago•1 comments

An online book about how ChatGPT works

https://ericsilberstein1.github.io/how-they-think-book/index.html
1•DenisM•37m ago•0 comments

Blood test boosts Alzheimer's diagnosis accuracy to 94.5%, clinical study shows

https://medicalxpress.com/news/2026-02-blood-boosts-alzheimer-diagnosis-accuracy.html
33•wglb•37m ago•3 comments

Saturated ARC-AGI-2

https://www.ycombinator.com/launches/PWR-confluence-labs-an-ai-research-lab-focused-on-learning-e...
1•eightnoteight•39m ago•0 comments

Google, Apple start testing encrypted RCS on Android and iOS 26.4

https://9to5google.com/2026/02/23/google-messages-encrypted-rcs-iphone/
6•thunderbong•39m ago•0 comments

Show HN: Falcon – Chat-first communities built on Bluesky AT Protocol

3•JohannaWeb•45m ago•0 comments

Some things we've learned about GPU textures at planetary scales

http://richg42.blogspot.com/2026/02/some-things-weve-learned.html
1•vinhnx•45m ago•0 comments

What's in the Housing for the 21st Century Act?

https://bipartisanpolicy.org/explainer/whats-in-the-housing-for-the-21st-century-act/
1•toomuchtodo•47m ago•1 comments

Ask HN: Posthotty.com I kindly ask for feedback to improve my AI vibed website

1•gitprolinux•47m ago•0 comments

Uber launches autonomous vehicles services venture in robotaxi push

https://www.ft.com/content/0c0902f6-f6d8-421d-8767-fe3aaf9a3ce4
2•ryan_j_naughton•50m ago•0 comments

Panasonic, the former plasma king, will no longer make its own TVs

https://arstechnica.com/gadgets/2026/02/panasonic-the-former-plasma-king-will-no-longer-make-its-...
7•mroche•57m ago•3 comments

Show HN: Mouse Tester – visualize raw mouse input in the browser

https://mousetester.net/en
2•greey2026•57m ago•0 comments

The Oral Microbiome and Systemic Health: The Mouth-Body Connection

https://www.mdpi.com/2075-1729/16/2/294
1•PaulHoule•58m ago•0 comments

Say Goodbye to the Undersea Cable That Made the Global Internet Possible

https://www.wired.com/story/say-goodbye-to-the-undersea-cable-that-made-the-global-internet-possi...
3•jonbaer•1h ago•1 comments

The Agentic Data Stack

https://github.com/ClickHouse/agentic-data-stack
1•ryadh•1h ago•0 comments

GitHub Actions Pull_request_target vs. Apache NuttX RTOS

https://lupyuen.org/articles/prtarget
1•lupyuen•1h ago•0 comments

Web page design studio – Part one: User-friendly visuals, and responsive design

https://research.exoticsilicon.com/design_studio1
1•bookstore-romeo•1h ago•0 comments

Director of Safety and Alignment meta gave clawdbot full-access to her computer

https://twitter.com/summeryue0/status/2025774069124399363
3•tamnd•1h ago•0 comments