frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•9mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!

Show HN: Mango Lollipop – AI-powered lifecycle messaging generator

https://github.com/sr-kai/mango-lollipop
1•Nlupus•1m ago•0 comments

Hacktivists, State Actors, Cybercriminals Target Global Defense Industry

https://www.securityweek.com/hacktivists-state-actors-cybercriminals-target-global-defense-indust...
1•Bender•1m ago•0 comments

FTC Chairman Andrew N. Ferguson Issues Warning Letter to Apple CEO Tim Cook

https://www.ftc.gov/news-events/news/press-releases/2026/02/federal-trade-commission-chairman-and...
1•jacquesm•2m ago•0 comments

Vim-pencil: Rethinking Vim as a tool for writing

https://github.com/preservim/vim-pencil
1•gurjeet•3m ago•0 comments

Isomorphic unlocks a new frontier in AI drug design

https://www.isomorphiclabs.com/articles/the-isomorphic-labs-drug-design-engine-unlocks-a-new-fron...
1•lysozyme•8m ago•0 comments

Show HN: Non-Custodial Crypto Payment SDK for Node.js (BTC, ETH, Sol, USDC)

https://www.npmjs.com/package/@profullstack/coinpay
1•cranberryturkey•8m ago•0 comments

On The Crank Spectrum

https://exple.tive.org/blarg/2026/02/07/on-the-crank-spectrum/
1•pabs3•8m ago•0 comments

New Linear Homepage (2026)

https://linear.app/homepage
1•cristinacordova•9m ago•0 comments

Standards for Shipping Production LLM Features

https://teotti.com/8-standards-for-building-production-ready-features-using-llms/
1•agenteo•10m ago•1 comments

Utter Disregard for Git Commit History (2015)

https://zachholman.com/posts/git-commit-history/
1•pabs3•11m ago•0 comments

Crosstalk

https://www.dreaming.com/blog-posts/crosstalk
2•cblum•12m ago•0 comments

Lifetime Lead Exposure Can Triple Alzheimer's Risk

https://alz-journals.onlinelibrary.wiley.com/doi/10.1002/alz.71075
3•stevenwoo•12m ago•0 comments

Development on Flirt – Fabulous, Legendary, Incremental Review Tool (2025)

https://blog.buenzli.dev/announcing-development-on-flirt/
1•pabs3•12m ago•0 comments

Show HN: YOLO Push – The HQ for Founders

https://yolopush.com/
1•programad•12m ago•0 comments

Proposed commitments from Apple and Google: app certainty and interoperability

https://www.gov.uk/government/calls-for-evidence/proposed-commitments-from-apple-and-google-app-c...
1•pmontra•15m ago•0 comments

Show HN: Tudo Cálculo – 20 free calculators for finance, health and math

https://www.tudocalculo.com.br
1•viniciusborgeis•16m ago•0 comments

First Clojure Core Team Dev Call, Feb 2026 [video]

https://www.youtube.com/watch?v=ngyvDkZA3o0
1•simonpure•16m ago•0 comments

Taiwan's AI-powered economy soars in shadow of bubble fears and China threats

https://apnews.com/article/taiwan-trump-tariffs-economy-ai-tsmc-7527bd4bf3089cbd2dab1c530ee61c3e
4•jethronethro•17m ago•0 comments

Negotiation for Nerds

https://www.aadillpickle.com/blog/negotiation-for-nerds
2•aadillpickle•18m ago•0 comments

Ask HN: Is offshoring a bigger issue than AI and H1B for US workers?

https://old.reddit.com/r/cscareerquestions/comments/1r2urma/anyone_feel_like_offshoring_is_a_bigg...
4•burnerToBetOut•21m ago•1 comments

Peaceandquiet.io

https://peaceandquiet.io/
3•015UUZn8aEvW•22m ago•1 comments

Google Chrome ships WebMCP, turning every website into a tool for AI agents

https://venturebeat.com/infrastructure/google-chrome-ships-webmcp-in-early-preview-turning-every-...
3•ATechGuy•22m ago•0 comments

China's carbon emissions may have reached a turning point sooner than expected

https://www.livescience.com/planet-earth/climate-change/chinas-carbon-emissions-may-have-reached-...
3•gnabgib•23m ago•0 comments

'Another way to gamble money': prediction markets prompt confusion and concern

https://www.theguardian.com/us-news/2026/feb/12/prediction-markets-polymarket-kalshi-online-gambling
5•billybuckwheat•26m ago•0 comments

Ask HN: Would you use context-based "modes" in Instagram(work,study,sport,news)?

2•MatiasLaudonio•29m ago•1 comments

Show HN: Promptscout a local prompt enricher for Claude Code

https://github.com/obsfx/promptscout
2•obsfx•31m ago•0 comments

AWS Adds support for nested virtualization

https://github.com/aws/aws-sdk-go-v2/commit/3dca5e45d5ad05460b93410087833cbaa624754e
7•sitole•31m ago•2 comments

We must all be CEOs

https://jhendler.com/2026/02/12/we-must-all-be-ceos/
1•hendler•33m ago•0 comments

Don't Wire Workflows, Build Skills Instead

https://github.com/MooseGoose0701/skill-compose
1•ChocoluvH•35m ago•1 comments

OPP – An open protocol for AI image provenance that survives screenshots

https://github.com/HumanLemming996/OPP
1•BhuvanChalla•36m ago•1 comments