frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Show HN: Fenced – Markdown to self-contained HTML with decorated code blocks

https://fenced.dev/
1•lars-dev•3s ago•0 comments

Under the Trump crypto playbook, the family always wins

https://www.reuters.com/investigations/under-trump-crypto-playbook-family-always-wins-investors-d...
1•puzzlingcaptcha•14s ago•0 comments

Ford's AI rally recalls the dot com bubble

https://www.bloomberg.com/opinion/articles/2026-06-08/ford-s-ai-rally-recalls-the-dot-com-bubble
1•xrd•1m ago•0 comments

Show HN: Welter's game, a little Nim-like combinatorial game

https://welter.fuglede.dk
1•fuglede_•1m ago•0 comments

Generate synthetic FHIR patient records for EHR testing

https://apify.com/johnvc/synthea-medical-record-generator-api/examples/synthetic-fhir-patient-rec...
1•johncole•2m ago•0 comments

Show HN: Learn from 30 historical figures, open source, nonprofit, self-hosted

https://github.com/chipmates/agoracosmica
1•micstradev•3m ago•1 comments

A new database category – the predictive database

https://aito.ai/blog/introducing-a-new-database-category-the-predictive-database/
1•arauhala•4m ago•0 comments

Show HN: Sheaft – simulate service failures before deploying

https://github.com/MB3R-Lab/Sheaft
1•a-a-k•7m ago•0 comments

Pg_plan_advice: Plan Stability and User Planner Control for PostgreSQL?

http://rhaas.blogspot.com/2026/03/pgplanadvice-plan-stability-and-user.html
1•ksec•10m ago•0 comments

Anbernic RG Rotate handheld game console coming soon for $88 and up

https://liliputing.com/anbernic-rg-rotate-handheld-game-console-with-flip-out-display-coming-soon...
1•PaulHoule•10m ago•0 comments

FlashMemory-DeepSeek-V4

https://arxiv.org/abs/2606.09079
1•GaggiX•11m ago•1 comments

Most "Chat with Your Data" Products Will Fail

https://medium.com/codex/most-chat-with-your-data-products-will-fail-5956f1aff212
1•theanonymousone•11m ago•0 comments

ChiWhoBike Interview #93

https://chiwho.bike/interviews/93
1•mooreds•12m ago•0 comments

Kaku: A fast, out-of-the-box macOS terminal built for AI coding

https://kaku.fun/
1•emschwartz•12m ago•0 comments

AI won't replace your mentor, but it might be the only one available

https://irowell.io/blog/AI_won%27t_replace_your_mentor,_but_it_might_be_the_only_one_available
1•irowell•13m ago•0 comments

Show HN: Gravity – interactive solar-system simulator, from Newton to Einstein

https://qunabu.github.io/Gravity/
1•qunabu•13m ago•0 comments

Canada to ban social media for kids under 16

https://cultmtl.com/2026/06/canada-to-ban-social-media-for-kids-under-16/
1•mooreds•13m ago•0 comments

Performance Improvements in JDK 26

https://inside.java/2026/06/09/jdk-26-performance-improvements/
1•rowbin•13m ago•0 comments

AI and Identity Survey Findings

https://fusionauth.io/blog/2026-ai-identity-report
1•mooreds•14m ago•0 comments

Apple Outlines AI and Developer Tool Updates at Platforms State of the Union

https://www.macrumors.com/2026/06/09/apple-outlines-major-ai-and-developer-tool-updates/
1•ksec•14m ago•0 comments

Jaydebeapiarrow – A high-performance JDBC bridge for Python

https://github.com/HenryNebula/jaydebeapiarrow
1•HenryNebula•16m ago•0 comments

A living journal of your pet's life that happens to be smart

https://tamadoggo.com
1•sorinvasiliu•18m ago•0 comments

I Saw the Future of Windows at Microsoft Build, and It's Unrecognizable

https://www.pcmag.com/opinions/i-saw-the-future-of-windows-at-microsoft-build-2026-and-its-unreco...
4•redbell•21m ago•1 comments

'Picks and shovels' companies profit from data centre scramble

https://www.ft.com/content/5ede9d4d-3989-49b5-a282-4722c8d8fc59
1•1vuio0pswjnm7•24m ago•0 comments

Kan Extensions in Haskell

https://bartoszmilewski.com/2026/06/08/kan-extensions-in-haskell/
1•ibobev•25m ago•0 comments

SmithDB

https://buttondown.com/jaffray/archive/smithdb/
1•ibobev•26m ago•0 comments

Siri AI? I'll believe it when I see it

https://www.macworld.com/article/3158943/siri-ai-ill-believe-it-when-i-see-it.html
1•thm•26m ago•0 comments

A cute little trick to running classic IIR filters on the GPU

https://themaister.net/blog/2026/06/08/a-cute-little-trick-to-running-classic-iir-filters-on-the-...
2•ibobev•26m ago•0 comments

Rust goal: cargo script (like uv does for Python)

https://rust-lang.github.io/rfcs/3935-Project-Goals-2026.html#running-rust-scripts-will-get-more-...
1•IdiotSavage•26m ago•1 comments

Visual Information Theory

https://colah.github.io/posts/2015-09-Visual-Information/
2•montyanderson•27m ago•0 comments
Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•1y ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!