frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: RAG chunk size "best practices" failed on legal text – I benchmarked it

https://medium.com/@TheWake/i-built-a-rag-tuning-tool-and-discovered-intuition-fails-on-legal-text-9744be9a4bc5
2•metawake•1h ago

Comments

metawake•1h ago
Author here. Built RagTune to stop guessing at RAG configs.

Surprising findings:

1. On legal text (CaseHOLD), 1024 chunks scored WORST (0.618). The "small" 256 chunks won (0.664). 7% swing.

2. On Wikipedia text? All chunk sizes hit ~99%. No difference.

3. Plot twist: At 5K docs, optimal chunk size FLIPPED from 256→1024. Scale changes everything.

Code is MIT: github.com/metawake/ragtune

Happy to discuss methodology.

patrakov•1h ago
Now that you have 5K docs, can you try estimating the statistical uncertainty of the Recall@5 and MRR metrics measured via smaller datasets? Just make some different 400-document subsets of the whole 5K HotpotQA dataset and recalculate the metrics.

Ask HN: What's your biggest challenge with context engineering for AI agents?

1•karpathunter•1m ago•0 comments

Oxford PV targets 20-year lifetime for perovskite-silicon tandem modules by 2028

https://www.pv-magazine.com/2026/01/16/oxford-pv-targets-20-year-lifetime-for-perovskite-silicon-...
1•akamaka•1m ago•0 comments

Sandboxing – Claude Code Docs

https://code.claude.com/docs/en/sandboxing
1•alsko•1m ago•1 comments

AI Supercharges Attacks in Cybercrime's New 'Fifth Wave'

https://www.infosecurity-magazine.com/news/ai-supercharges-attacks-cybercrime/
1•hentrep•1m ago•0 comments

The Oligarchs Pushing for Conquest in Greenland

https://newrepublic.com/article/205102/oligarchs-pushing-conquest-greenland-trump
2•manoDev•3m ago•0 comments

Open-source toolkit for enterprise-ready AI development using PostgreSQL

https://www.pgedge.com/ai-toolkit
1•pgedge_postgres•4m ago•1 comments

Free ADHD Routine Charts

https://www.habitualy.app/
1•abdullah9•4m ago•0 comments

When AI and Human Worlds Collide

https://www.noemamag.com/when-ai-human-worlds-collide/
1•Brajeshwar•4m ago•0 comments

China steers the Gulf's driverless future as U.S. rivals stay home

https://restofworld.org/2026/robotaxis-gulf-china/
1•Brajeshwar•4m ago•0 comments

Is a billion dollars still cool?

https://restofworld.org/2026/unicorn-billion-dollar-startups/
1•Brajeshwar•5m ago•0 comments

Sage: AI-powered Git commit message and branch name generator

https://github.com/thanipro/sage
2•FPurchess•6m ago•1 comments

Let AI catalog your house for insurance

https://mattsayar.com/let-ai-catalog-your-house-for-insurance/
1•MattSayar•7m ago•0 comments

Show HN: FeedOwn – Self-hosted RSS reader running on free tiers ($0/month)

https://github.com/kiyohken2000/feedown
1•kiyohken2000•7m ago•0 comments

Claude's New Constitution

https://www.anthropic.com/news/claude-new-constitution
2•meetpateltech•8m ago•0 comments

chrome://crash is the best home page

https://blog.thomasorlita.com/chrome-crash-home-page/
2•thomascz•8m ago•0 comments

Anthropic's CEO stuns Davos with Nvidia criticism

https://techcrunch.com/2026/01/20/anthropics-ceo-stuns-davos-with-nvidia-criticism/
1•pseudolus•11m ago•1 comments

Book Notes – What Is Existentialism

https://arpitbhayani.me/blogs/book-notes-what-is-existentialism/
1•vbanurag•13m ago•0 comments

DOGE improperly shared sensitive social security data, DOJ court filing reveals

https://www.theguardian.com/us-news/2026/jan/21/doge-social-security-data
5•GuinansEyebrows•14m ago•1 comments

Show HN: iOS app I made to track my anxiety

https://mudoapp.com
1•adictonator•15m ago•0 comments

Designing a Programming Language for the Desert

https://futhark-lang.org/blog/2018-06-18-designing-a-programming-language-for-the-desert.html
2•pcfwik•16m ago•0 comments

Show HN: I built a chess explorer that explains strategy instead of just stats

https://www.atlaschess.me/
1•Ahmad_shuja•16m ago•0 comments

FlashLabs Chroma 1.0: A Real-Time End-to-End Spoken Dialogue Model

https://huggingface.co/FlashLabs/Chroma-4B
1•pretext•16m ago•0 comments

Architecture as Data Is the Missing Unlock for Generative Code

https://spynejs.com/blog/frontend-architecture-has-met-its-reasoning-moment#architecture-as-data
1•nybatista•17m ago•0 comments

Show HN: A multiplayer browser-based RPG

https://delvethedepths.online/
1•xazzzzzzz•19m ago•0 comments

Show HN: SpeechOS – Wispr Flow-inspired voice input for any web app

https://www.speechos.ai/
1•gangster_dave•21m ago•0 comments

Ask HN: Are you going to meetups/conferences?

1•carimura•21m ago•1 comments

BBC announces landmark deal to make bespoke content for YouTube

https://www.theguardian.com/media/2026/jan/21/bbc-announces-landmark-deal-to-make-bespoke-content...
1•bookofjoe•21m ago•0 comments

Everyone Deserves a Better Computer

https://www.aheadcomputing.com/blog/everyone-deserves-a-better-computer
1•turoczy•21m ago•0 comments

Winter Storm to Target over 180M from Texas to New England

https://weather.com/storms/winter/news/2026-01-21-winter-storm-fern-ice-snow-forecast-south-north...
1•washedup•23m ago•0 comments

Show HN: DynamoLens – native, open-source DynamoDB desktop client (Go and Wails)

https://github.com/rasjonell/dynamo-lens
1•rasjonell•23m ago•0 comments