frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•10mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Ask HN: Current role feels unsustainable, but I'm not excited by any alternative

1•pella_may•9s ago•0 comments

Show HN: Caaspp Explorer

https://tools.encona.com/caaspp-explorer
1•rahimnathwani•1m ago•0 comments

Show HN: Salary Converter – Compare real purchasing power across 182 cities

https://salary-converter.com/
1•jay7gr•2m ago•0 comments

Sam Altman: "We see a future where intelligence is a utility"

https://old.reddit.com/r/ObscurePatentDangers/comments/1rryogu/sam_altman_we_see_a_future_where_i...
1•armcat•3m ago•0 comments

Operationalizing trust in the age of autonomous agents

https://www.kamiwaza.ai/the-inference-firewall-why-enterprise-ai-demands-relationship-based-acces...
1•mooreds•4m ago•0 comments

Rust Project Perspectives on AI

https://nikomatsakis.github.io/rust-project-perspectives-on-ai/feb27-summary.html
1•tcbrah•4m ago•0 comments

Show HN: PIAF – a Rust EDID parser with deep CEA-861 support and no_std support

1•dracowhitefire•4m ago•1 comments

Open-source platform for running and tracking quantum experiments

https://github.com/mareksuchodolski12-hash/kwantowy
1•ProEloElo•4m ago•1 comments

Life is hard, have a token. (2025)

https://getvouchsafe.org/blog/2025-10-08.html
1•mooreds•7m ago•0 comments

Show HN: Bash Theft Auto – a GTA-inspired open-world crime game in pure Bash

1•stuffbymax•7m ago•0 comments

Show HN: Sway, a board game benchmark for quantum computing

https://shukla.io/blog/2026-03/sway.html
2•BinRoo•8m ago•0 comments

Sandboxing AI-Authored Code in GitHub Actions

https://haulos.com/blog/sandboxing-github-actions/
1•s4i•9m ago•0 comments

We saw how 30 AI agent projects handle authorization-93% use unscoped API keys

1•mishrasanjeev•9m ago•0 comments

Native H2 pathways enable biocompatible hydrogenation of alkenes in bacteria

https://www.nature.com/articles/s41557-025-02052-y
2•PaulHoule•10m ago•0 comments

Palantir defends its role in the kill chain: "We are proud of that"

https://www.heise.de/en/news/Palantir-defends-its-role-in-the-kill-chain-We-are-very-very-proud-o...
5•botanical•13m ago•0 comments

'Bit of treachery': US attack on IRIS Dena undermines Indian security ties

https://www.theguardian.com/world/2026/mar/15/us-attack-iris-dena-undermines-indian-security-ties...
3•prmph•13m ago•0 comments

I built a brag doc app to track my impact

https://getexceeds.com/
2•ogeng•13m ago•1 comments

One Hundred Curl Graphs

https://daniel.haxx.se/blog/2026/03/15/one-hundred-curl-graphs/
2•dhruv3006•16m ago•0 comments

Reverse Engineering Apple's GPU Energy Model on the M4 Max

https://www.youtube.com/watch?v=HKxIGgyeISM
4•ricebunny•17m ago•1 comments

K programming: idiom by idiom [pdf]

https://nsl.com/papers/idioms_K3.pdf
2•tosh•18m ago•0 comments

Why are some stars always visible while others come and go with the seasons?

https://theconversation.com/why-are-some-stars-always-visible-while-others-come-and-go-with-the-s...
2•Brajeshwar•20m ago•0 comments

LPT100: A PIC32MZ Emulator for the Iomega ZIP100 Parallel Port Drive

https://www.toughdev.com/content/2026/03/pic32mz-iomega-zip100-parallel-port-emulator-part-2-hard...
2•mdanh2002•20m ago•1 comments

LangGraph human-in-the-loop has a double execution problem

https://blog.raed.dev/posts/langgraph-hitl/
1•Raed667•23m ago•0 comments

I let Claude Code configure my Arch install

https://www.willmorrison.com/blog/03-15-2026-llm-dotfiles
2•willmorrison•23m ago•0 comments

Bulwark – zero-dependency supply chain security gateway

https://github.com/Bluewaves54/Bulwark
1•Bluewaves54•25m ago•1 comments

Nile fisherman earning more from collecting plastic than fish

https://www.theguardian.com/world/2026/mar/15/cairo-fishers-catching-plastic-bottles
2•saikatsg•26m ago•0 comments

Show HN: OpenClaw plugin – hard budget limits for agent tool calls

https://github.com/runcycles/cycles-openclaw-budget-guard
1•amavashev•26m ago•1 comments

EU's Ursula von Der Leyen Calls Europe's Nuclear Exit a "Strategic Mistake" [video]

https://www.youtube.com/watch?v=Q-Pa6_CICjM
4•sorokod•27m ago•1 comments

The Iran War: How America, Israel and Iran Got Here [video]

https://www.youtube.com/watch?v=IKWMrpQOh7Y
2•kklisura•27m ago•1 comments

LLM Architecture Gallery

https://sebastianraschka.com/llm-architecture-gallery/
2•tzury•28m ago•0 comments