frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•10mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Long-term support for Linux releases gets a new lease on life

https://thenewstack.io/long-term-support-for-linux-releases-gets-a-new-lease-on-life/
1•CrankyBear•1m ago•0 comments

Best API Documentation Tools

https://www.jamdesk.com/blog/best-api-documentation-tools
1•gbourne1•1m ago•0 comments

The UAW Is Leading the Push for Green Jobs in California

https://jacobin.com/2026/02/uaw-green-jobs-california-union/
1•PaulHoule•2m ago•0 comments

Utah's online porn tax proposal poses a major threat to civil liberties

https://www.techdirt.com/2026/03/06/utahs-proposal-to-tax-online-pornography-is-a-civil-liberties...
1•speckx•3m ago•0 comments

Show HN: How to Catch Documentation Drift with Claude Code and GitHub Actions

https://dosu.dev/blog/how-to-catch-documentation-drift-claude-code-github-actions
1•onlydole•4m ago•0 comments

Runtime observability and policy enforcement for AI coding agents

https://www.oculisecurity.com/
1•rellaElla•5m ago•0 comments

Wine 11.4 – Run Windows Applications on Linux, BSD, Solaris and macOS

https://www.winehq.org/announce/11.4
1•neustradamus•5m ago•0 comments

BlackRock $26B Private Credit Fund Limits Withdrawals

https://www.bloomberg.com/news/articles/2026-03-06/blackrock-s-26-billion-private-credit-fund-lim...
1•greesil•6m ago•1 comments

Dracarys

https://dracarys.robertborghesi.is/
1•darmensdf•6m ago•0 comments

Project Operational Autonomy: our agent-to-agent-future

https://pebblebed.com/blog/operational-autonomy
1•kmavm•6m ago•0 comments

China's 792M kWh compressed air energy station now operational

https://interestingengineering.com/energy/china-largest-compressed-air-energy-station-world
1•pseudolus•9m ago•1 comments

Are there any companies who are anti-AI?

1•anti-ai-dev•10m ago•1 comments

The Worst Acquisition in History, Again

https://www.profgmedia.com/p/the-worst-acquisition-in-history
1•JumpCrisscross•11m ago•0 comments

OpenPawz Conductor Protocol

https://github.com/OpenPawz/openpawz/blob/main/reference/conductor-protocol.mdx
1•gotham64•13m ago•1 comments

Show HN: Unread, turns your unread newsletters into a daily podcast

https://app.unread.live
1•benfosterdev•13m ago•0 comments

Show HN: See Your Money in 3D

https://moneyvisualiser.com
1•KhazP•13m ago•0 comments

Show HN: Verified n8n community node for Anytype

https://github.com/splch/n8n-nodes-anytype
1•splch•14m ago•0 comments

Claude Code vs. Codex (Nate B Jones) [video]

https://www.youtube.com/watch?v=09sFAO7pklo
1•JnBrymn•16m ago•0 comments

Show HN: Synclippy – Ephemeral rooms for sharing text or files

https://synclippy.ujjwalvivek.com
1•ujjwalvivek•16m ago•0 comments

PKGSmith

https://pkgsmith.app/
1•Fogh•18m ago•0 comments

Plan 9 from User Space

https://9fans.github.io/plan9port/
1•tosh•20m ago•0 comments

GPT-5.4 code-golfs GPT-2

https://twitter.com/hansonwng/status/2030000810894184808
1•tosh•20m ago•0 comments

Re-creating the complex cuisine of prehistoric Europeans

https://arstechnica.com/science/2026/03/recreating-the-complex-cuisine-of-prehistoric-europeans/
1•apollinaire•21m ago•0 comments

Oracle and OpenAI drop Texas data center expansion plan

https://www.reuters.com/business/oracle-openai-end-plans-expand-texas-data-center-site-bloomberg-...
2•speckx•21m ago•0 comments

Palera1n Jailbreak Compiled and Run on a Samsung Galaxy S3 (PostmarketOS, ARMv7)

https://github.com/noxbitx/s3ra1n/tree/main
2•noxbit•21m ago•0 comments

Eval awareness in Claude Opus 4.6's BrowseComp performance

https://www.anthropic.com/engineering/eval-awareness-browsecomp
1•gcampbell•22m ago•0 comments

Show HN: I built an international calling platform/Android App

https://voklit.app
1•ahmgeek•23m ago•1 comments

If flip-phones can make a comeback, can Flash do the same?

https://disassociated.com/flip-phones-comeback-can-flash/
1•speckx•23m ago•0 comments

An AI disaster is getting ever closer

https://www.economist.com/briefing/2026/03/05/an-ai-disaster-is-getting-ever-closer
3•bookofjoe•26m ago•1 comments

Ecological Imperialism

https://kschroeder.substack.com/p/my-library-ecological-imperialism
1•MaysonL•27m ago•0 comments