frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Tilth v0.4.1 – 29% cheaper Sonnet, 22% on Opus (benchmark: 114 runs)

2•jahala•1h ago
Smart code reading for humans and AI agents. Tilth is what happens when you give ripgrep, tree-sitter, and cat a shared brain.

--

v0.4.0 added search ranking, sibling surfacing, transitive callees, cognitive load stripping, smart truncation, and bloom filters. Got -17% on Sonnet, -20% on Opus.

v0.4.1 was pure instruction tuning — zero code changes that alone jumped Sonnet adoption from 89% to 98% and $ cost/correct answer from -17% to -29%.

The instruction tuning result surprised me. The model already knew tilth tools existed — it just wasn’t choosing them consistently. Making the replacement relationship explicit in the tool description was worth more than all the search ranking work in v0.4.0.

Haiku remains the outlier — only 42% tilth adoption despite instruction tuning.

--

https://github.com/jahala/tilth/

Full results: https://github.com/jahala/tilth/blob/main/benchmark/README.m...

-- PS: I dont have the budget to run the benchmark a lot (especially with Opus), so if any token whales has capacity to run some benchmarks, please feel free to PR results.

I graded 234 stocks on free cash flow (not earnings)

https://aureus-swart.vercel.app
1•babylonprince•57s ago•0 comments

Watching an elderly relative trying to use the modern web

1•ColinWright•1m ago•0 comments

Ask HN: What is something someone else did that made your day better?

1•blahaj•1m ago•0 comments

Show HN: OpenEntropy – 47 hardware entropy sources from your computer's physics

https://github.com/amenti-labs/openentropy
1•amentiflow•2m ago•0 comments

Shard – A Distributed P2P AI Network for Shared Inference

https://github.com/TrentPierce/Shard
1•tpierce89•3m ago•1 comments

Earn $50 PER REFERRAL and $2 PER CLICK

https://hunnypack.com/
1•bokeke1•5m ago•0 comments

A fluid can store solar energy and then release it as heat months later

https://arstechnica.com/science/2026/02/dna-inspired-molecule-breaks-records-for-storing-solar-heat/
1•pseudolus•7m ago•0 comments

Could an Electronic Real-Time Coach Help Ski Jumpers Leap Farther?

https://www.nytimes.com/2026/02/15/science/olympics-technology-ski-jump.html
1•bookofjoe•12m ago•1 comments

The End of the Office

https://blog.andrewyang.com/p/the-end-of-the-office
2•cebert•16m ago•0 comments

Meta patented an AI that lets you keep posting from beyond the grave

https://www.businessinsider.com/meta-granted-patent-for-ai-llm-bot-dead-paused-accounts-2026-2
1•johnhamlin•17m ago•0 comments

Show HN: Task Automation Analysis of Labor Statistics and O*Net Jobs Data

1•Falimonda•18m ago•0 comments

I don't think AI performance will plateau

https://honnibal.dev/blog/ai-bubble
1•syllogism•21m ago•2 comments

Racket Syntax: The Great, the Good and the Back-to-the-Drawing-Board (2024) [video]

https://www.youtube.com/watch?v=ZtTqRH1uwu4
1•so-cal-schemer•22m ago•1 comments

MacKenzie Scott's $26B Sugar Pile

https://garryslist.org/posts/mackenzie-scott-s-26-billion-sugar-pile
2•gmays•22m ago•0 comments

Game developers and pixel artists are losing their jobs

https://www.sprite-ai.art
1•tjco•23m ago•2 comments

What would a "permissions-first ORM" look like? Looking for spec feedback

https://typescript-superapp.bunnytech.app/docs
1•iosifnicolae2•24m ago•2 comments

Instagram boss says 16 hours of daily use is 'problematic' not addiction

https://www.bbc.com/news/articles/cn71mgmzljlo
1•pseudolus•25m ago•0 comments

Earthquake Magnitude Scale

https://www.mtu.edu/geo/community/seismology/learn/earthquake-measure/magnitude/
1•teleforce•29m ago•0 comments

India's 'AI Impact Summit' Promises Little More Than Spectacle

https://internetfreedom.in/indias-ai-impact-summit-promises-little-more-than-spectacle/
1•akbarnama•29m ago•0 comments

AI Writes Code in Seconds. Why Do Your Tests Take Minutes?

http://stumpy.ai/blog/your-ai-writes-code-in-seconds
1•bluesnowmonkey•30m ago•2 comments

Robert Duvall, Oscar-winning actor and 'Godfather' mainstay, dead at 95

https://www.cnbc.com/2026/02/16/robert-duvall-dies-at-95.html
2•pseudolus•33m ago•1 comments

Rare Pokemon card sets record with $16.5M sale

https://www.japantimes.co.jp/life/2026/02/16/digital/pokemon-card-sale-most-expensive-pikachu-ill...
1•anigbrowl•35m ago•0 comments

Beating GPT-2 for less than $100 – Andrej Karpathy

https://github.com/karpathy/nanochat/discussions/481
2•logicprog•39m ago•0 comments

Show HN: Bulwark – Open-source governance layer for AI agents (Rust, MCP-native)

https://github.com/bpolania/bulwark
1•bpolania•42m ago•2 comments

Ask HN: Best roles in tech where I can be in meetings mostly?

2•general_reveal•43m ago•2 comments

Vulnerabilities in cloud-based password managers [pdf]

https://eprint.iacr.org/2026/058.pdf
3•leobdkr•45m ago•2 comments

Ask HN: Which password manager do you use / would you recommend?

3•unodonut•47m ago•10 comments

Linux CVE Assignment Process

http://www.kroah.com/log/blog/2026/02/16/linux-cve-assignment-process/
2•LorenDB•49m ago•1 comments

Lack of measurement invariance in mental health across intelligence levels

https://www.sciencedirect.com/science/article/abs/pii/S0160289625000662
1•i7l•50m ago•0 comments

Show HN: Krea iPad – real-time editing model with Apple Pencil input

https://twitter.com/venturetwins/status/2023107207500566675
1•dvrp•51m ago•0 comments