frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•9mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Pentagon-FAA Dispute over Lasers to Thwart Cartel Drones Led to Airspace Closure

https://www.military.com/daily-news/2026/02/11/pentagon-faa-dispute-over-lasers-thwart-cartel-dro...
1•throw0101c•43s ago•0 comments

Show HN: SnesGPT, micro-GPT ported to ASM on the Super Nintendo

https://github.com/vabruzzo/snes-gpt
1•vga805•1m ago•0 comments

Pentagon let CBP use anti-drone laser before FAA closed El Paso airspace

https://www.westerninvestor.com/national-business/pentagon-let-cbp-use-anti-drone-laser-before-fa...
1•throw0101c•1m ago•0 comments

F# Code I Love (2019) [video]

https://www.youtube.com/watch?v=1AZA1zoP-II
1•tosh•1m ago•0 comments

Show HN: A lightweight Identity Provider for local OAuth2/SAML testing

https://github.com/cdelmonte-zg/nanoidp
1•cdelmonte•4m ago•0 comments

Show HN: Analog Reader – Chrome Extension

https://chromewebstore.google.com/detail/analog-reader/oaknflfnpdlonbjkompmiahfcoikdlhe
1•luskira•4m ago•0 comments

Ski warfare – Use of ski-equipped soldiers in war

https://en.wikipedia.org/wiki/Ski_warfare
1•ija•4m ago•0 comments

Cross Compiling CGO with Dagger and Zig

https://johncodes.com/archive/2026/02-11-cross-compiling-cgo/
1•jpmcb•5m ago•0 comments

AI agent opens a PR write a blogpost to shames the maintainer who closes it

https://github.com/matplotlib/matplotlib/pull/31132
14•wrxd•7m ago•0 comments

I built a community where LLM agents discuss marketing ideas for my app

1•Fh_•10m ago•0 comments

The many flavors of ignore files

https://nesbitt.io/2026/02/12/the-many-flavors-of-ignore-files.html
1•chmaynard•10m ago•0 comments

Zines, gifts, and an app I didn't plan to build

https://krthr.co/zines-gifts-and-an-app-i-didnt-plan-to-build/
1•krthr•11m ago•0 comments

Trump orders the military to make agreements with coal power plants

https://arstechnica.com/science/2026/02/trumps-latest-plan-to-revive-coal-power-make-the-military...
1•throw0101c•12m ago•0 comments

Resist and Unsubscribe

https://www.resistandunsubscribe.com
2•rapnie•13m ago•0 comments

Quality and understandability after AI

https://federicopereiro.com/after-ai/
1•swah•14m ago•0 comments

AMD surpasses 40% server CPU revenue share for the first time

https://videocardz.com/newz/amd-surpasses-40-server-cpu-revenue-share-for-the-first-time
3•giuliomagnifico•17m ago•0 comments

Show HN: I built an webpage to showcase Singapore's infra and laws

https://github.com/adityaprasad-sudo/Explore-Singapore
1•curiousbatman•17m ago•0 comments

Copilot Fun – Play terminal games while GitHub Copilot codes for you

https://github.com/sirluky/copilot-fun
3•sirluky•19m ago•2 comments

LocalStack: Moving to paid only from March 2026

https://blog.localstack.cloud/the-road-ahead-for-localstack/
1•hrpnk•22m ago•1 comments

Robots Dream of Agentic Soup

https://punkleadership.com/robots-dream-of-agentic-soup/
1•PretzelFisch•22m ago•0 comments

Show HN: BlockHost OS – Autonomous VM provisioning through smart contracts

https://github.com/mwaddip/blockhost
3•mwaddip•22m ago•0 comments

Ask HN: How to truly sandbox AI tools on a Mac?

2•shelled•25m ago•0 comments

The Rise of Generative AI Large Language Models

https://informationisbeautiful.net/visualizations/the-rise-of-generative-ai-large-language-models...
1•Anon84•26m ago•0 comments

Show HN: Commander, an opinionated yet powerful new tab page

https://chromewebstore.google.com/detail/commander/pgfpnakgiejllklfaamjogeoamalobfp
1•h4ch1•26m ago•0 comments

Show HN: Priset–AI coding agent 4 IntelliJ,VSCode tht doesn't train on your code

https://plugins.jetbrains.com/plugin/29894-priset--the-autonomous-ai-engineering-partner
1•Priset-AI•26m ago•1 comments

Byte magazine artist Robert Tinney, who illustrated the birth of PCs, dies at 78

https://arstechnica.com/gadgets/2026/02/byte-magazine-artist-robert-tinney-who-illustrated-the-bi...
2•rbanffy•27m ago•0 comments

Salesforce's "SaaS Seat License Crisis": Transitioning to AI Digital Headcount

https://open.spotify.com/episode/4FW1nveIMeXgdyDp72zIMQ
1•timarits•28m ago•2 comments

Molten Salt Technology Validated

https://www.marinelink.com/news/molten-salt-technology-validated-535563
1•mpweiher•30m ago•0 comments

Long March-10 in-flight abort and rocket landing demostration [video]

https://www.youtube.com/watch?v=1huIM_ip6bQ
6•u1hcw9nx•33m ago•0 comments

Show HN: SC-NeuroCore – Rust neuromorphic compiler, 512× speedup

https://github.com/anulum/sc-neurocore
2•anulum•35m ago•0 comments