frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

U.S. plans to ask visitors to disclose 5 years of social media history

https://www.washingtonpost.com/immigration/2025/12/10/esta-social-media-united-states/
1•cm2187•1m ago•0 comments

Health premiums rose nearly 3x rate of worker earnings over the past 25 years

https://theconversation.com/health-insurance-premiums-rose-nearly-3x-the-rate-of-worker-earnings-...
1•pseudolus•4m ago•0 comments

Show HN: Vibecc – LLM compiler that turns natural language specs into C binaries

https://github.com/Jacques2Marais/vibecc
1•Jacques2Marais•6m ago•0 comments

"I Wasted 8 Years in Crypto": A Builder's Exit Note Goes Viral Across Asia

https://beincrypto.com/i-wasted-8-years-in-crypto/
1•decimalenough•7m ago•0 comments

E-petition debate relating to digital ID – Monday 8 December 2025

https://news.ycombinator.com/
1•hhdave•8m ago•1 comments

Recent Travel News

https://wowfare.com/en-us/blog/pegasus-airlines-launches-direct-istanbul-bilbao-flights/
1•belatwing•8m ago•0 comments

NPM.watch: Track NPM Downloads, Package Safety and Live Stats

https://www.npm.watch
1•Next-Icons•10m ago•1 comments

Show HN: Deploy Kubernetes apps with RunOS, free to use

https://runos.com/blog/runos-open-to-everyone.html
1•didierbreedt•11m ago•0 comments

OVH Public Cloud Database Outage "resolved"

https://public-cloud.status-ovhcloud.com/incidents/4gd0bgz7zm2j
1•voodooEntity•12m ago•1 comments

Nanoparticles that enhance mRNA delivery could reduce vaccine dosage and costs

https://phys.org/news/2025-11-nanoparticles-mrna-delivery-vaccine-dosage.html
1•PaulHoule•12m ago•0 comments

Letting Nvidia sell H200s to China is closing the door after horse has bolted

https://www.theregister.com/2025/12/09/nvidia_h200s_china_ai/
1•pseudolus•13m ago•0 comments

I built an AI that reads your Git history and writes status reports

1•slmslm•13m ago•1 comments

AI will make formal verification go mainstream

https://martin.kleppmann.com/2025/12/08/ai-formal-verification.html
2•mau•20m ago•0 comments

Webb identifies earliest supernova to date, shows host galaxy

https://esawebb.org/news/weic2523/
1•doener•21m ago•0 comments

Local news organizations discover the value of their own archives

https://www.niemanlab.org/2025/12/local-news-organizations-discover-the-value-of-their-own-archives/
2•giuliomagnifico•22m ago•1 comments

Show HN: Sift – Turning chore into a hyper-personalized, immersive journey

https://sift-11a.pages.dev/
1•paperplaneflyr•22m ago•0 comments

Human art in a post-AI world should be strange

https://www.owlposting.com/p/art-in-a-post-ai-world-should-be
2•sebg•24m ago•0 comments

Next Generation Agentic Proxy for AI Agents and MCP Servers

https://github.com/agentgateway/agentgateway
1•mooreds•24m ago•0 comments

Paramount Pictures X Account Hacked to Read 'Proud Arm of the Fascist Regime'

https://variety.com/2025/film/news/paramount-x-account-hacked-proud-arm-of-the-fascist-regime-123...
3•robtherobber•25m ago•0 comments

Meta promises to reduce data sharing for EU users by 2026 to avoid EU GDPR fines

https://www.techradar.com/pro/meta-promises-to-reduce-data-sharing-for-eu-users-by-2026-to-avoid-...
2•robtherobber•27m ago•0 comments

Factory Tours

https://www.scopeofwork.net/on-factory-tours/
1•hermitcrab•29m ago•1 comments

Securing VMware workloads in regulated industries

https://www.technologyreview.com/2025/12/10/1128475/securing-vmware-workloads-in-regulated-indust...
1•fleahunter•30m ago•0 comments

Glide

https://glide.ai
1•bellamoon544•30m ago•2 comments

Ask HN: Does your company spend time on system and API design?

1•AJRF•31m ago•0 comments

Join the on-call roster, it'll change your life

https://serce.me/posts/2025-12-09-join-oncall-it-will-change-your-life
1•furkansahin•31m ago•0 comments

A tiny entropy experiment to push LLMs into unexpected paths

6•seedwtfff•34m ago•2 comments

Truecaller Empowers "The CTO of the Family"

https://techcrunch.com/2025/12/09/truecaller-now-lets-users-protect-households-from-scam-calls/
1•cece2011•34m ago•1 comments

Spec-driven development: Unpacking one key new AI-assisted engineering practices

https://www.thoughtworks.com/en-us/insights/blog/agile-engineering-practices/spec-driven-developm...
1•zeld4•37m ago•0 comments

A European plan to escape American technology

https://ecfr.eu/publication/get-over-your-x-a-european-plan-to-escape-american-technology/
5•padjo•38m ago•2 comments

BazelCon 2025 Recap

https://blog.bazel.build/2025/12/08/bazelcon-recap.html
2•mesto1•44m ago•0 comments
Open in hackernews

Show HN: Fast and Quality Code Chunking with Chonkie

1•snyy•7mo ago
Hi HN,

We’re Chonkie (https://github.com/chonkie-inc/chonkie) — we build open source tools that help split documents into meaningful chunks for use with AI models.

When you use LLMs over large documents or codebases, you often need to break them into smaller parts to fit the model’s context window. Our chunkers do this in a smart way: they preserve structure and meaning, so only the most relevant pieces are passed into the model. This reduces hallucinations, avoids confusion, and improves performance and accuracy.

Today we’re launching our Code Chunker — a fast, structure-aware way to break down source code into high-quality, token-aware chunks.

How it works:

(See the code: https://github.com/chonkie-inc/chonkie/blob/main/src/chonkie...)

Code Chunker uses tree-sitter (https://tree-sitter.github.io/tree-sitter/) to parse your code into an abstract syntax tree (AST). It then recursively merges and groups nodes in a way that respects both code structure and token limits.

It supports all languages that tree-sitter supports, and is designed to preserve formatting and semantics. Large functions or class definitions won’t be split in the middle of a block — instead, we dive recursively into the AST to produce clean, coherent chunks that fit your configured token budget.

What it’s useful for:

  - Embedding-based code search

  - RAG (retrieval-augmented generation) over codebases

  - Long-context analysis of code

  - Preparing repos for fine-tuning or pretraining
Try it out:

  - Open source package: https://docs.chonkie.ai/chunkers/code-chunker

  - Hosted playground (free with account): https://cloud.chonkie.ai
Happy Chonking!