frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•10mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

How do hardware timers work? [video]

https://www.youtube.com/watch?v=g_koa00MBLg
1•tosh•2m ago•0 comments

Show HN: Built Habits Easily

https://trykaro.xyz/
1•AkshayS96•5m ago•0 comments

Show HN: I was bored so I made a site that aggressively roasts other websites

https://siteroaster.vercel.app
1•macraft•5m ago•1 comments

The Cost of Accepting Without Agreeing: A Warning for Architects

https://architectureintel.com/the-hidden-cost-of-accepting-without-agreeing-a-warning-for-archite...
1•younss•5m ago•0 comments

Redis Patterns for Coding

https://antirez.com/news/161
1•surprisetalk•10m ago•0 comments

YARA rules to detect Palantir Gotham data federation

1•OussamaAfnakkar•15m ago•0 comments

Computer Says No

https://koenvangilst.nl/lab/computer-says-no
1•vnglst•19m ago•0 comments

Show HN: I built an app that screenshots news sites hourly on Raspberry Pi 5

https://github.com/herol3oy/kiosk24
1•herol3oy•20m ago•0 comments

Blogatto – A Gleam framework for building static blogs

https://blogat.to/
1•todsacerdoti•20m ago•0 comments

Show HN: Finger GAMES Crossfingers on the same phone

https://fingree.com/
2•santiss•22m ago•0 comments

MicroGPT Explained Interactively

https://growingswe.com/blog/microgpt
2•growingswe•23m ago•0 comments

Recreating the 1984 Amiga Boing Ball demo as a KDE Plasma 6 live wallpaper

https://old.reddit.com/r/amiga/comments/1rh8oyh/i_tried_recreating_the_1984_amiga_boing_ball_demo/
1•doener•24m ago•0 comments

ChopeMiles – Miles/points tracker built for Singapore credit cards

https://chopemiles.com
1•gladgate8812•24m ago•1 comments

The 64-bit Hurd has landed in Guix

https://guix.gnu.org/en/blog/2026/the-64-bit-hurd/
2•todsacerdoti•24m ago•0 comments

Master and Commander: The Far Side of the World movie review (2003)

https://www.rogerebert.com/reviews/master-and-commander-the-far-side-of-the-world-2003
2•walterbell•26m ago•0 comments

Show HN: Tired of seeing ugly maps – Translate Simple JSON into maps

https://jsonmaps.dev/
3•milind-soni•30m ago•0 comments

OpenAI has exposed and shut down Russian network "Rybar"

https://xcancel.com/CforCD/status/2027379581817147679
2•doener•30m ago•0 comments

What does Utopia Look Like? (why tech bros don't understand the Culture series) [video]

https://www.youtube.com/watch?v=A7wrGM9r2x4
1•zeristor•32m ago•0 comments

WA councils could make laws to stop cats roaming

https://www.abc.net.au/news/2026-02-20/wa-councils-could-soon-have-power-to-stop-cats-roaming/106...
1•Tomte•32m ago•0 comments

Finally: Unrestricted AI Video

https://unbound.video
2•gabrieln•33m ago•1 comments

Ask HN: Can "mental disorders" be induced in LLMs via weight changes alone?

1•Gooblebrai•37m ago•0 comments

Fair, WordPress, and Knowing When to Stop

https://joost.blog/fair-wordpress-and-knowing-when-to-stop/
1•Tomte•38m ago•0 comments

Why is the first C++ (m)allocation always 72 KB?

https://joelsiks.com/posts/cpp-emergency-pool-72kb-allocation/
2•joelsiks•39m ago•0 comments

Show HN: Terminal-Style Portfolio on the Internet

https://kuber.studio/
3•kuberwastaken•40m ago•0 comments

Hans Zimmer Performs the Dune Soundtrack Live [video]

https://www.youtube.com/watch?v=_j5GgGdSwjE
1•doener•40m ago•0 comments

Hackerbot-Claw: AI Bot Exploiting GitHub Actions – Microsoft, Datadog Hit So Far

https://www.stepsecurity.io/blog/hackerbot-claw-github-actions-exploitation
9•varunsharma07•44m ago•3 comments

Show HN: Effective Git

https://github.com/nolasoft/okgit
2•nola-a•45m ago•0 comments

AI pioneer Fei-Fei Li's World Labs raises $1B in funding

https://www.channelnewsasia.com/business/ai-pioneer-fei-fei-lis-world-labs-raises-1-billion-in-fu...
3•doppp•46m ago•0 comments

Show HN: A Pi Harness for Color Correcting Multi-Clip Footage

https://github.com/perbhat/agentic-color-grader
1•perbhat•47m ago•0 comments

Understanding RabbitMQ in Simple Terms

https://sushantdhiman.dev/understanding-rabbitmq/
2•birdculture•48m ago•0 comments