frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•10mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Longshot – Built Minecraft in one shot, burned $5500 running 100 coding agents

https://devpost.com/software/longshot
1•talboren•3m ago•0 comments

Show HN: My dream came true: I released a mobile game

https://apps.apple.com/ua/app/color-blocks-sort-huefold/id6757859307
1•skreep•6m ago•0 comments

I hacked ChatGPT and Google's AI – and it only took 20 minutes

https://www.bbc.com/future/article/20260218-i-hacked-chatgpt-and-googles-ai-and-it-only-took-20-m...
1•Stevvo•7m ago•0 comments

Why does resizing a JPG require uploading it?

https://creatoryn.com/
1•Maaz-Sohail•7m ago•1 comments

Jupyter Kernel for Mojo

https://github.com/AnswerDotAI/mojokernel
1•tosh•8m ago•0 comments

Accenture combats AI refuseniks by linking promotions to log-ins

https://www.ft.com/content/ac672f97-a603-4c56-afa3-4a5273d45674
1•TrackerFF•8m ago•1 comments

What Do A.I. Chatbots Discuss Among Themselves? We Sent One to Find Out

https://www.nytimes.com/2026/02/18/upshot/moltbook-artificial-intelligence-ai.html
1•Anon84•10m ago•0 comments

Greece throws support behind social media bans for kids

https://www.euractiv.com/news/greece-throws-support-behind-social-media-bans-for-kids/
1•doener•12m ago•0 comments

Electrobun

https://blackboard.sh/electrobun/docs/
2•handfuloflight•20m ago•0 comments

After 3 yrs of forcing myself to love VC, I quit and became a DJ in Bali

https://www.businessinsider.com/left-venture-capital-career-to-become-silent-disco-dj-bali-2026-2
1•rafaepta•24m ago•0 comments

Ganttdown: Turn Markdown task lists into Gantt charts instantly

https://ganttdown.vercel.app/
2•sssecasiu•25m ago•1 comments

Pthinc/BCE-Prettybird-Micro-Standard-v0.0.1

https://huggingface.co/datasets/pthinc/BCE-Prettybird-Micro-Standard-v0.0.1
1•pthuser•25m ago•0 comments

Without America to rely on, EU gearing up to be a global power in its own right

https://www.theatlantic.com/international/2026/02/european-union-defense-spending/685983/
1•saubeidl•27m ago•0 comments

What Does AI Think about the Kung Fu Robot Show in the Chinese New Year Gala?

https://poe.com/s/PSuM9UlCSlNtWTmnHQEh
1•seekdeep•27m ago•1 comments

os: An operating system for the IBM PC written in machine code

https://github.com/jpcregan/os
1•hexer292•27m ago•1 comments

Show HN: Axon – Open-source agentic AI with approval gates (Apache 2.0)

https://github.com/NeuroVexon/axon-community
1•NeuroVexon•28m ago•1 comments

Freedom Is Coming

https://freedom.gov
3•seanweng•29m ago•1 comments

ShannonMax: A Library to Optimize Emacs Keybindings with Information Theory

https://github.com/sstraust/shannonmax
2•sammy0910•30m ago•0 comments

Stock Slide and Slow Sales: What's Happening in China's E.V. Market?

https://www.nytimes.com/2026/02/19/business/china-electric-vehicle-troubles.html
1•ilamont•30m ago•0 comments

Show HN: Clipthesis – free, local app to tag and search video across your drives

https://clipthesis.com/
1•hugorut•31m ago•0 comments

Former Prince Andrew Arrested over Epstein Probe

https://www.wsj.com/world/uk/former-prince-andrew-arrested-over-epstein-probe-bbc-reports-7779cc1e
5•KoftaBob•33m ago•0 comments

Show HN: Aegis.rs, the first open source Rust-based LLM security proxy

https://github.com/ParzivalHack/Aegis.rs
1•ParzivalHack•34m ago•0 comments

Show HN: I built a compliance scanner that flags WCAG GDPR and FTC risks in mins

https://www.rataify.com/
1•CraftyGuru•35m ago•0 comments

A Theoretical View on 'Something Big Is Happening'

https://telemetryagent.dev/blog/theoretical-view-something-big
1•martvdjagt•36m ago•0 comments

Bridging Elixir and Python with Oban

https://oban.pro/articles/bridging-with-oban
3•sorentwo•39m ago•0 comments

Show HN: What We Learned: a 3 question meeting closure tool

https://www.cognu.app/what-we-learned
1•anticlickwise•40m ago•0 comments

A Technical Intro to the Fediverse

https://www.krisdigital.com/en/blog/2026/02/18/technical-intro-fediverse/
1•krisdigital•40m ago•1 comments

Show HN: Schema Sentry – Type-Safe JSON-LD for Next.js with CI-Grade Validation

https://github.com/arindamdawn/schema-sentry
1•arindamdawn•40m ago•0 comments

Show HN: Elecxzy – A lightweight, Lisp-free Emacs-like editor in Electron

https://github.com/kurouna/elecxzy
1•kurouna•41m ago•0 comments

Europe Worries About Another Trump Blowup, This One on Tech

https://www.nytimes.com/2026/02/19/world/europe/europe-united-states-trump-digital-services-act.html
2•Doches•43m ago•1 comments