frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Evaluations in the Real World?

1•stichers•1h ago
What do you think about measuring agentic AI in practice. A few weeks ago I read something on Anthropic’s blog on evals for AI agents https://www.anthropic.com/engineering/demystifying-evals-for-ai-agents, and then yesterday saw this on Medium https://medium.com/quantumblack/evaluations-for-the-agentic-world-c3c150f0dd5a Feels like this is becoming a thing.

Anthropic talk about how to structure agent evals and what they’ve learned from running these internally. The QuantumBlack post clooks more programmatic or lifecyle focussed. How evals need to change once agents are combined and using tools. What to do whenthey're deployed, and how to factor them in early.

Curious what peeple are doing in the real world. Are you rolling your own task suites? Using offline + online evals? Or mostly vibes and logs for now?

The Hidden Costs of Additions to a System

https://leomax.fyi/blog/the-hidden-costs-of-additions-to-a-system/
1•MaxMussio•1m ago•0 comments

FOSDEM 2026

https://fosdem.org/2026/schedule/
1•torvald•2m ago•0 comments

PicoDAV – Single-file WebDAV server with file manager/text editor

https://github.com/kd2org/picodav
2•indigodaddy•6m ago•0 comments

China's Q-Ship Containerized Weapon System

https://www.hisutton.com/Chinese-Q-Ship.html
1•u1hcw9nx•6m ago•0 comments

Board Games in Ancient Fiction: Egypt, Iran, Greece

https://reference-global.com/article/10.2478/bgs-2022-0016
1•bryanrasmussen•6m ago•0 comments

The Charge of the Rohirrim Is the Most Epic Scene Ever Filmed

https://www.extended-cut.com/p/the-charge-of-the-rohirrim-is-the
1•ewpierce•8m ago•0 comments

The Advantage2 is approaching end-of-life

https://t.e2ma.net/webview/dv1nen/5d9af704c152d22050f438e1a95703e2
1•ndrake•8m ago•0 comments

Pussh: A simple parallel SSH tool for batch command execution by Bearstech

https://github.com/bearstech/pussh
1•kadrek•10m ago•0 comments

How "95%" escaped into the world – and why so many believed it

https://www.exponentialview.co/p/how-95-escaped-into-the-world
1•Jimmc414•12m ago•0 comments

Wisconsin communities signed secrecy deals for billion-dollar data centers

https://www.wpr.org/news/4-wisconsin-communities-signed-secrecy-deals-billion-dollar-data-centers
7•sseagull•12m ago•1 comments

Daily-Driving an IBM Mainframe () [video]

https://www.youtube.com/watch?v=Dn1E2On_sok
1•rbanffy•16m ago•0 comments

Adversarial Wordle

https://www.savageevan.com/adversarial-wordle/
1•candu•16m ago•1 comments

Trump Nominates Kevin Warsh for Federal Reserve Chair to Succeed Jerome Powell

https://www.cnbc.com/2026/01/30/trump-nominates-kevin-warsh-for-federal-reserve-chair-to-succeed-...
1•throw0101c•17m ago•2 comments

Show HN: A tiny React lib for avatar fallbacks

https://www.facehash.dev/
2•frenchriera•17m ago•0 comments

Zo-Topia: My Computer in the Cloud

https://www.jplhomer.org/posts/zo-topia-my-zo-computer-experience/
1•erhuve•22m ago•1 comments

Norway EV Push Nears 100 Percent: What's Next?

https://spectrum.ieee.org/norway-ev-policy-electric-vehicles
3•rbanffy•23m ago•0 comments

How to avoid common AI pitfalls in the workplace

https://www.economist.com/briefing/2026/01/29/how-to-avoid-common-ai-pitfalls-in-the-workplace
1•Anon84•24m ago•0 comments

Apple-1 Computer Prototype Board #0 Sells for $2.7M

https://www.rrauction.com/auctions/lot-detail/350902407346003-apple-1-computer-prototype-board-0-...
2•oldnetguy•24m ago•0 comments

Nvidia's 10-year effort to make the Shield TV the most updated Android

https://arstechnica.com/gadgets/2026/01/inside-nvidias-10-year-effort-to-make-the-shield-tv-the-m...
2•vanburen•25m ago•0 comments

From bad omen to national treasure:rare bone-swallower stork saved byfemale army

https://www.bbc.com/future/article/20260128-the-protectors-of-indias-greater-adjutant-storks
1•koolhead17•25m ago•0 comments

I've been compiling my Sass wrong, for years

https://www.alwaystwisted.com/articles/ive-been-compiling-my-sass-wrong-for-years
1•speckx•27m ago•0 comments

Nailz: Input device using touch-enabled nails (2021)

https://www.youtube.com/watch?v=O_uiL49IA2I
1•downboots•27m ago•0 comments

Why Wide Top Surfaces Are Essential for Shared Spaces

https://dreamhomestore.co.uk/products/3-over-4-chest-of-drawers
1•dreamhomestore•28m ago•1 comments

Best Gas Masks

https://www.theverge.com/policy/868571/best-gas-masks
1•dtj1123•29m ago•0 comments

How to Run Self-Hosted LLMs on Kubernetes

https://oneuptime.com/blog/post/2026-01-29-self-hosted-llms-on-kubernetes/view
1•ndhandala•29m ago•0 comments

Running Out of Claude? How to Use Self-Hosted LLMs with Moltbot

https://oneuptime.com/blog/post/2026-01-30-self-hosted-llm-with-moltbot/view
2•ndhandala•30m ago•0 comments

ISS SIM

https://iss-sim.spacex.com/
2•belter•31m ago•0 comments

Why Are You Still Dizzy?

https://dizzypt.substack.com/p/why-you-are-still-dizzy-the-survival
1•DIZZYPT•32m ago•0 comments

Show HN: Unrar5j – Pure Java RAR5 Extractor

https://github.com/RealBurst/unrar5j
1•RealBurst•32m ago•1 comments

Monitiser – Automated Social Media content generation and posting

https://monitiser.com/
1•jackegerton•32m ago•1 comments