frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•8mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

What came first: the CNAME or the A record?

https://blog.cloudflare.com/cname-a-record-order-dns-standards/
1•sorcix•30s ago•0 comments

XSS in Meta Conversion API Gateway Leading to Zero-Click Account Takeover

https://ysamm.com/uncategorized/2025/01/13/capig-xss.html
1•phwd•1m ago•0 comments

The biggest obstacle for engineer productivity in 2026

https://strategizeyourcareer.com/p/this-ai-problem-is-the-biggest-risk-for-software-engineers-in-...
1•emreb•3m ago•0 comments

Sakana AI Agent Wins AtCoder Heuristic Contest (First AI to Place First)

https://sakana.ai/ahc058/
1•simonpure•3m ago•0 comments

The Power Law, the Grind and the Ugly

https://silvestreperret.com/posts/power-laws/
1•silverret•3m ago•0 comments

The Risk of Too Much Air Safety Regulation (2020) [pdf]

https://www.cato.org/sites/cato.org/files/2020-03/regv43n1-1.pdf
1•JumpinJack_Cash•3m ago•0 comments

Confidence in Tech > Talent in Tech

https://www.thetrueengineer.com/p/confidence-in-tech-talent-in-tech
1•andrewstetsenko•4m ago•0 comments

Amateur sleuth earns £2M reward for exposing research fraud

https://www.thetimes.com/uk/science/article/amateur-sleuth-2m-exposing-research-fraud-jhhb8wfnn
1•bookofjoe•5m ago•1 comments

Moving Beyond Agent-Centric Design: World-Centric Orchestration for AI

https://dev.to/eggp/the-mind-protocol-why-your-ai-agent-needs-a-world-before-it-can-think-2m8p
1•eggplantiny•6m ago•0 comments

OpenAI to acquire the team behind executive coaching AI tool Convogo

https://techcrunch.com/2026/01/08/openai-to-acquire-the-team-behind-executive-coaching-ai-tool-co...
1•gmays•6m ago•0 comments

Repairing a Bose SoundDock iPod Speaker

https://thomashunter.name/posts/2026-01-12-repairing-bose-sounddock-ipod-speaker
1•speckx•7m ago•0 comments

US withdrawing troops from key Middle East bases as precaution

https://www.reuters.com/world/middle-east/us-withdrawing-troops-key-middle-east-bases-precaution-...
1•zerosizedweasle•7m ago•1 comments

Show HN: Convert Go to Rust

1•KingOfCoders•9m ago•0 comments

Open Source AI May Reduce Energy Demands

https://www.cmu.edu/work-that-matters/energy-innovation/open-source-ai-may-reduce-energy-demands
1•atlasunshrugged•10m ago•0 comments

How Machines Shape the Way We Write

https://worldhistory.substack.com/p/how-machines-shape-the-way-we-write
1•crescit_eundo•14m ago•0 comments

Show HN: Beam – A desktop-style browser for iPad built by a solo developer

https://apps.apple.com/us/app/beam-browser/id6756218494
3•henrikdev•14m ago•1 comments

Alan Rickman remembered, 10 years after his death

https://www.theguardian.com/film/2026/jan/14/i-fell-in-love-with-him-on-the-spot-alan-rickman-rem...
2•sohkamyung•15m ago•0 comments

FBI Searches Home of Washington Post Journalist for Classified Documents

https://www.nytimes.com/2026/01/14/us/politics/fbi-washington-post-journalist.html
4•perihelions•15m ago•0 comments

Umami: You never say its name, yet you taste it every day

https://bigthink.com/strange-maps/umami-fifth-taste/
1•Brajeshwar•15m ago•0 comments

How to Become a Tree

https://aeon.co/essays/dying-to-be-green-are-new-eco-funerals-a-false-promise
1•Brajeshwar•15m ago•0 comments

[REVIEW] Very Important People – Ashley Mears

https://www.thepsmiths.com/p/guest-joint-review-very-important
1•barry-cotter•15m ago•0 comments

Show HN: A Serverless Neuro-Symbolic Logic Engine (Interactive Whitepaper)

https://petzi2311.github.io/
1•CausaNova•16m ago•1 comments

Just Get a Better Job

https://idiallo.com/blog/just-get-another-job
1•Brajeshwar•16m ago•0 comments

May your tokens be blessed

https://mfelix.org/stories/may-your-tokens-be-blessed/
1•threekindwords•17m ago•0 comments

The fake bomb detectors (2014)

https://www.bbc.com/news/uk-29459896
1•retSava•17m ago•0 comments

Apple Struggling with Key Material Shortage as AI Chips Drain Supply

https://asia.nikkei.com/business/technology/tech-asia/apple-and-qualcomm-fret-over-strained-suppl...
4•7777777phil•20m ago•0 comments

(Brain) Topological turning points across the human lifespan

https://www.nature.com/articles/s41467-025-65974-8
1•smartmic•21m ago•0 comments

The long-term health impacts from the LA wildfires are just becoming clear

https://www.npr.org/2026/01/14/nx-s1-5630989/la-fires-health-impact-smoke
1•andsoitis•22m ago•0 comments

The grab list: how museums decide what to save in a disaster

https://www.economist.com/1843/2025/11/21/the-grab-list-how-museums-decide-what-to-save-in-a-disa...
2•surprisetalk•22m ago•0 comments

The Slop Was Never the Failure

https://substack.com/home/post/p-183630680
1•gpi•23m ago•0 comments