frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•10mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Show HN: Graph-Based Firebase Alternative with Real-Time Sync

https://github.com/wolfoo2931/linkedrecords
1•WolfOliver•55s ago•0 comments

How an inference provider can prove they're not serving a quantized model

https://tinfoil.sh/blog/2026-02-03-proving-model-identity
1•FrasiertheLion•5m ago•0 comments

I'm with Stupid →

https://unsung.aresluna.org/im-with-stupid-/
2•tobr•6m ago•0 comments

National Parent Teacher Association Breaks Ties with Meta

https://www.cnbc.com/2026/02/20/national-pta-meta-child-safety-trials-zuckerberg.html
2•1vuio0pswjnm7•15m ago•0 comments

The Subject Supposed to Know Nothing: Lacan and the Large Language Model

http://thecombedthunderclap.blogspot.com/2025/05/the-subject-supposed-to-know-nothing.html
1•sb057•15m ago•0 comments

LipoVive: The Stimulant-Free Way to Melt Stubborn Fat (Official 2026 Batch)

https://www.morningstar.com/news/accesswire/1138075msn/lipovive-reviews-shocking-2026-report-what...
1•japxnaty•16m ago•1 comments

Perfect Ottawa Hummus

https://middleeasternstreet.com/blog-detail.html?id=blog-1
1•swengcrunch•20m ago•0 comments

Show HN: Snake and Foes – The classic snake game but with enemies and power-ups

https://ivanca.github.io/snakeandfoes/
1•AmbroseBierce•21m ago•0 comments

There Isn't a Hacker Community for Fundamental Physics (and What It Tells Us)

https://old.reddit.com/r/prequantumcomputing/comments/1ragunw/why_there_isnt_a_hacker_community_f...
1•bkaminsky•22m ago•0 comments

Typed Assembly Language

https://www.cs.cornell.edu/talc/
1•luu•31m ago•0 comments

The UK tourist with a valid visa detained by ICE for six weeks

https://www.theguardian.com/us-news/2026/feb/21/karen-newton-valid-visa-detained-ice
1•n1b0m•31m ago•0 comments

Ajail: A basic jail for programs you don't trust

https://github.com/jtolio/ajail
2•todsacerdoti•31m ago•0 comments

Show HN: How much has the ad industry spent targeting you?

https://attentionworth.com/
1•withshakespeare•38m ago•0 comments

Show HN: OffKit – an iOS app blocker that adds friction

https://apps.apple.com/us/app/offkit-app-blocker/id6758268708
1•nickfthedev•43m ago•0 comments

LDOS: Toward a Learning-Directed Operating System

https://www.sigops.org/2026/ldos-toward-a-learning-directed-operating-system/
2•matt_d•44m ago•0 comments

PromptSpy ushers in the era of Android threats using GenAI

https://www.welivesecurity.com/en/eset-research/promptspy-ushers-in-era-android-threats-using-genai/
1•Cyphase•49m ago•1 comments

Colorado moves age checks from websites to operating systems

https://www.biometricupdate.com/202602/colorado-moves-age-checks-from-websites-to-operating-systems
10•iamnothere•58m ago•8 comments

Pb-ext: Enhanced PocketBase server with monitoring, logging and API docs

https://github.com/magooney-loon/pb-ext
1•thunderbong•1h ago•0 comments

Ruby Is the Best Language for Building AI Apps

https://paolino.me/ruby-is-the-best-language-for-ai-apps/
3•thunderbong•1h ago•0 comments

Back to textbooks: Denmark rolls back digital learning

https://www.france24.com/en/tv-shows/focus/20260106-back-to-textbooks-denmark-rolls-back-digital-...
1•talonx•1h ago•0 comments

Show HN: Free tool to migrate OpenAI Assistants

https://migratetoresponses.com
1•adkfusion•1h ago•0 comments

A Galaxy Composed Almost of Dark Matter Has Been Confirmed

https://www.wired.com/story/a-galaxy-composed-almost-entirely-of-dark-matter-has-been-confirmed/
1•taubek•1h ago•0 comments

Python creator Guido van Rossum asks Elon Musk what SpaceX uses for coding

https://twitter.com/elonmusk/status/2024388903869043061
1•MilnerRoute•1h ago•0 comments

Infographic of the Navy and Air Force build up nearby Iran

https://twitter.com/sentdefender/status/2024929210867839399
1•nomilk•1h ago•0 comments

Acme Weather, from the Creators of Dark Sky

https://apps.apple.com/us/app/acme-weather/id6742032583
1•gammarator•1h ago•0 comments

Free Shadcn/UI patterns for faster UI delivery

https://reui.io
2•shuxer0205•1h ago•1 comments

Show HN: Fix-my-mic – stop macOS from switching to AirPods mic every connection

https://github.com/yigitkonur/cli-disablemic
2•yigitkonur35•1h ago•0 comments

Formula: A VST for coding custom DSP inside your DAW

https://github.com/soundspear/formula
1•peteforde•1h ago•0 comments

A 3000W Water-Cooled Power Supply (With GAN and Sic) [video]

https://www.youtube.com/watch?v=da9GwXX-0Zs
1•dmmalam•1h ago•0 comments

OpenClaw Partners with VirusTotal for Skill Security

https://openclaw.ai/blog/virustotal-partnership
1•iskiifanhaaw•1h ago•0 comments