frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•10mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Meta acquires AI agent social network Moltbook

https://www.theguardian.com/technology/2026/mar/10/meta-acquires-moltbook-ai-agent-social-network
1•bookofjoe•1m ago•0 comments

Time Travel Has Been Done

1•johnnyVincento•3m ago•0 comments

Qt SQL Under LGPL Despite MariaDB Under GPL

https://www.smallstepsystems.com/qt-sql-under-lgpl-despite-mariadb-under-gpl/
1•jandeboevrie•3m ago•0 comments

Thinking Deeply About Theming and Color Naming

https://css-tricks.com/thinking-deeply-about-theming-and-color-naming/
1•PaulHoule•4m ago•0 comments

Codex Monitor

https://github.com/Dimillian/CodexMonitor
1•tosh•5m ago•0 comments

Show HN: VeilPhantom – Open-source on-device PII detection for AI pipelines

https://helloveil.com/sdk/
1•nakaiwilliams•6m ago•0 comments

10 years of AlphaGo: The turning point for AI [video]

https://www.youtube.com/watch?v=qoinGjj60Fo
1•simonpure•6m ago•0 comments

NASA and SpaceX disagree about manual controls for lunar lander

https://arstechnica.com/space/2026/03/nasa-and-spacex-disagree-about-manual-controls-for-lunar-la...
2•voxadam•7m ago•0 comments

Ask HN: Does automatic multilingual support make sense for a launch platform?

1•LeanVibe•7m ago•0 comments

Planning a multi-city trip without the spreadsheet nightmare

https://explorinder.com
1•pabloceg•7m ago•1 comments

Designing Interfaces for AI Agents Instead of Humans

https://twitter.com/rot13maxi/status/2031429866109710649
1•rot13maxi•8m ago•0 comments

The Clarity Act, Stablecoin Yield and the Chinese Crypto Bogeyman

https://toddhbaker.substack.com/p/the-clarity-act-stablecoin-yield
1•petethomas•9m ago•0 comments

Show HN: OpenMolt – A programmatic AI agent framework for Node.js

https://openmolt.dev
1•ybouane•10m ago•0 comments

AI, Human Cognition and Knowledge Collapse (NBER Working Paper)

https://www.nber.org/papers/w34910
1•imakwana•12m ago•0 comments

Agent API Spec Design: When API Callers Change from Application to AI Agent

https://github.com/tomsun28/agent-api-spec
1•tomsun28•15m ago•1 comments

Prioritizing energy intelligence for sustainable growth

https://www.technologyreview.com/2026/03/10/1133972/prioritizing-energy-intelligence-for-sustaina...
1•joozio•16m ago•0 comments

US added more solar than any other technology in 2025, but is down 14% from 2024

https://www.nytimes.com/2026/03/10/business/energy-environment/donald-trump-solar-energy-batterie...
6•epistasis•17m ago•0 comments

Cardiac side effects of RNA-based SARS-CoV-2 vaccines

https://bpspubs.onlinelibrary.wiley.com/doi/10.1111/bph.16262
3•blumomo•17m ago•1 comments

Epstein Fallout Tracker

https://epstein.observer
1•oldfuture•17m ago•0 comments

Why AI is both a curse and a blessing to open-source developers

https://www.zdnet.com/article/ai-curse-and-blessing-to-open-source-software-developers/
1•CrankyBear•17m ago•1 comments

Astro 6.0

https://astro.build/blog/astro-6/
2•todotask2•18m ago•0 comments

Fixing request smuggling vulnerabilities in Pingora OSS deployments

https://blog.cloudflare.com/pingora-oss-smuggling-vulnerabilities/
1•Tiberium•18m ago•0 comments

Using cookies to hack into a tech college's admission system

https://eaton-works.com/2026/03/09/skcet-hack/
1•nfriedly•19m ago•0 comments

StackOverflow Site Redesign Thread (2026)

https://meta.stackoverflow.com/questions/438177/new-site-design-and-philosophy-for-stack-overflow...
1•all2•20m ago•0 comments

Stay in the Loop: How I Use Claude Code

https://jola.dev/posts/stay-in-the-loop
1•shintoist•21m ago•0 comments

Flock Flocked up: How a license plate camera misread unraveled one man's life

https://www.businessinsider.com/flock-safety-alpr-cameras-misreads-2026-3
9•text0404•21m ago•0 comments

Rcarmo/PhotosExport: Export All Your Data from Apple Photos

https://github.com/rcarmo/PhotosExport
1•rcarmo•21m ago•0 comments

Paying without Google: New consortium wants to remove custom ROM hurdles

https://www.heise.de/en/news/Paying-without-Google-New-consortium-wants-to-remove-custom-ROM-hurd...
3•derbOac•23m ago•0 comments

Show HN: autoautoresearch – Karpathy's autoresearch on steroids

https://github.com/ArmanJR/autoautoresearch
1•armanj•23m ago•0 comments

Block Cut 4k Jobs and Blamed AI. The Truth Is More Complicated

https://newsletter.chainofthought.show/p/block-cut-4000-jobs-and-blamed-ai
3•cliffclimber•24m ago•0 comments