frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•10mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Utm-Builder – Bulk UTM Link Generator CLI for Marketers

1•mbinatorom•1m ago•0 comments

ManuscriptFormatter – Instant Standard Manuscript Format for Writers

1•mbinatorom•1m ago•0 comments

Curses-exec: interactive xargs for less

https://github.com/dnewcome/curses-exec
1•dnewcome•6m ago•1 comments

Ask Maps and Immersive Navigation: New AI Features in Google Maps

https://blog.google/products-and-platforms/products/maps/ask-maps-immersive-navigation/
1•yread•9m ago•0 comments

100 Jumps

https://100jumps.org/play/
2•pompomsheep•17m ago•1 comments

Black logos are taking over Silicon Valley

https://old.reddit.com/r/dataisbeautiful/comments/1rs2wzq/oc_black_logos_are_taking_over_silicon_...
2•ghghgfdfgh•21m ago•0 comments

A defense official reveals how AI chatbots could be used for targeting decisions

https://www.technologyreview.com/2026/03/12/1134243/defense-official-military-use-ai-chatbots-tar...
2•joozio•22m ago•0 comments

Show HN: CacheLens – Local-first cost tracking proxy for LLM APIs

https://github.com/stephenlthorn/cache-lens
1•stephenlthorn•25m ago•0 comments

Tracking and analysis of a hidden mesh network operating across iOS devices

https://lists.nanog.org/archives/list/nanog@lists.nanog.org/thread/YDTTFIWTVGTLOUNLUXL6VNKWOIEDJ37Q/
1•speckx•27m ago•0 comments

2025 State of Rust Survey Results

https://blog.rust-lang.org/2026/03/02/2025-State-Of-Rust-Survey-results/
1•olalonde•32m ago•0 comments

Lenovo ThinkStation PGX Review: The Nvidia GB10 128GB AI Workstation

https://www.servethehome.com/lenovo-thinkstation-pgx-review-the-nvidia-gb10-128gb-ai-workstation-...
1•teleforce•34m ago•0 comments

We are not alone: Our sun escaped together with stellar 'twins' from galaxy cent

https://phys.org/news/2026-03-sun-stellar-twins-galaxy-center.html
1•bookmtn•37m ago•0 comments

Dennis Ritchie, Ken Thompson And others on the Unix system [video]

https://www.youtube.com/watch?v=tc4ROCJYbm0
1•tzury•41m ago•0 comments

GitHub – REST API version 2026-03-10 is now available

https://github.blog/changelog/2026-03-12-rest-api-version-2026-03-10-is-now-available/
1•stevehipwell•41m ago•0 comments

AutoExp: One-liner turn any traning code to autoresearch

https://github.com/wizwand/autoexp
1•allanhahaha•44m ago•0 comments

HP has new incentive to stop blocking third-party ink in its printers

https://arstechnica.com/gadgets/2026/03/hp-has-new-incentive-to-stop-blocking-third-party-ink-in-...
2•XzetaU8•50m ago•0 comments

What if compiler errors were an API? (AI-native language demo)

https://asciinema.org/a/834560
1•hvoetsch•51m ago•1 comments

Show HN: YAOS – A 1-click deploy, real-time sync engine for Obsidian

2•kavinsood•52m ago•0 comments

Shiny Object Syndrome

https://en.wikipedia.org/wiki/Shiny_object_syndrome
1•esher•56m ago•0 comments

I ran /autoresearch on liquid codebase. 53% faster combined parse+render time

https://twitter.com/tobi/status/2032212531846971413
2•tosh•59m ago•0 comments

The parasite continues to eat the host. The cancer is trying to spread

https://twitter.com/wordpress/status/2032291317871468554
1•docdeek•1h ago•0 comments

NVFP4: Efficient and Accurate Low-Precision Inference

https://developer.nvidia.com/blog/introducing-nvfp4-for-efficient-and-accurate-low-precision-infe...
1•tosh•1h ago•0 comments

Chicken Nuget

https://daniel.haxx.se/blog/2026/03/12/chicken-nuget/
2•HieronymusBosch•1h ago•0 comments

Private LLM Inference on Consumer Blackwell GPUs

https://arxiv.org/abs/2601.09527
1•rohansood15•1h ago•0 comments

Show HN: ROI-first AI automation framework for B2B companies

https://roihacking.ai/
1•roihacking•1h ago•1 comments

Ask HN: What benchmarks do you trust most when comparing large LLMs?

1•QubridAI•1h ago•0 comments

LLMs: Using a single Unix-style tool instead of multiple tools/function calling

https://old.reddit.com/r/LocalLLaMA/comments/1rrisqn/i_was_backend_lead_at_manus_after_building_a...
4•drtse4•1h ago•0 comments

Atlassian Is Not Collapsing – But Its Business Model Might Be

https://www.ctol.digital/news/atlassian-credibility-crisis-not-a-collapse/
2•donutshop•1h ago•0 comments

Ask HN: Resources for a conceptual model of LLMs as applicable to coding?

2•pramodbiligiri•1h ago•0 comments

Cockroach Milk: Yes. You Read That Right

https://www.npr.org/sections/thesalt/2016/08/06/488861223/cockroach-milk-yes-you-read-that-right
2•thunderbong•1h ago•0 comments