frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•9mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Things got too easy with AI

https://gusarich.com/blog/things-got-too-easy
1•Gusarich•36s ago•0 comments

Glass Core Substrates and Glass Interposers: Advanced Packaging for AI and HPC

https://www.microwavejournal.com/articles/44910-glass-core-substrates-and-glass-interposers-new-g...
1•teleforce•1m ago•0 comments

What happens to the human body in 49C heat? Australians are finding out

https://www.theguardian.com/australia-news/2026/jan/27/what-happens-to-the-human-body-in-49c-heat...
1•beardyw•2m ago•0 comments

Voice-first dating app that matches you in 4 days

https://voicevibe.dating/
1•evercrestaimee•3m ago•1 comments

Bop Spotter

https://walzr.com/bop-spotter
1•mattmark•5m ago•1 comments

South Korea's Edenlux set for U.S. debut of eyestrain wellness device

https://techcrunch.com/2026/01/26/south-koreas-edenlux-set-for-u-s-debut-of-eye-strain-wellness-d...
2•plun9•8m ago•0 comments

Automating Image Compression

https://www.ramijames.com/thoughts/on-automating-image-compression
8•ramijames•9m ago•0 comments

Shorlabs: Deploy back ends without the hassle. (OSS Alternative to Render)

https://www.shorlabs.com/
10•shorlabss•9m ago•0 comments

Ask HN: What's your favorite self-hosted application?

1•surrTurr•13m ago•0 comments

Ask HN: What is the hair on fire problem in your company?

2•nemath•18m ago•0 comments

Trump's $6T crypto plot [video]

https://www.youtube.com/watch?v=hqNxmWYMAr4
2•simonebrunozzi•22m ago•0 comments

Summary of CVE-2026-23864

https://vercel.com/changelog/summary-of-cve-2026-23864
1•tamnd•22m ago•0 comments

Show HN: Externalized Properties, a modern Java configuration library

https://github.com/joel-jeremy/externalized-properties
1•jeyjeyemem•24m ago•0 comments

Collatz High Cycles Do Not Exist (K. Knight), Discrete Mathematics 349(3), 2023

https://hal.science/hal-04261183/document
1•vismit2000•25m ago•0 comments

Show HN: GetClawdbot – A Community Guide and Skill Hub for Clawdbot

https://getclawdbot.org
1•medivhX•27m ago•1 comments

Chanfana: OpenAPI 3.1 and Zod for Hono/itty-router on Cloudflare Workers

https://github.com/cloudflare/chanfana
1•Lwrless•30m ago•0 comments

Syncthing: Open-Source Continuous File Synchronization

https://github.com/syncthing/syncthing
1•AbuAssar•31m ago•0 comments

Nixtamal: Fulfilling, Pure Input Pinning for Nix

https://nixtamal.toast.al
2•toastal•33m ago•0 comments

Microsoft ordered to stop tracking school children

https://noyb.eu/en/noyb-win-microsoft-ordered-stop-tracking-school-children
2•HotGarbage•33m ago•0 comments

Ask HN: What's your wiring pattern for large addressable LED installs?

3•emmasuntech•35m ago•1 comments

The state of Linux music players in 2026

https://crescentro.se/posts/linux-music-players-2026/
2•signa11•36m ago•0 comments

Disabling GitHub MCP on CC extended my sessions ~10%

https://staunch.ai/blog/disabling-github-mcp
1•irasigman•37m ago•0 comments

EU-India Free Trade,Investment Protection and Geographical Indications Agreement

https://policy.trade.ec.europa.eu/eu-trade-relationships-country-and-region/countries-and-regions...
3•Someone•38m ago•1 comments

DeepSeek-OCR 2

https://github.com/deepseek-ai/DeepSeek-OCR-2
3•wahnfrieden•38m ago•0 comments

From Hours to Seconds: Automating Python Security with AI?

https://nocomplexity.substack.com/p/from-hours-to-seconds-automating
1•runningmike•40m ago•0 comments

How do you use LLMs to verify databases with minimal hallucinations?

1•rochansinha•40m ago•0 comments

Windows Central Eliminates Most of Its Gaming Journalists

https://80.lv/articles/windows-central-eliminates-most-of-its-gaming-journalists
2•pjmlp•41m ago•0 comments

Anthropic launches the MCP Apps open spec, in Claude.ai

https://www.latent.space/p/ainews-anthropic-launches-the-mcp
2•swyx•42m ago•0 comments

Ask HN: What Happened to Apple App Clips?

4•tomtec•46m ago•5 comments

You gotta think outside the hypercube

https://lcamtuf.substack.com/p/you-gotta-think-outside-the-hypercube
1•fratellobigio•46m ago•0 comments