frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Evaluating AI agents: Real-world lessons from building agentic systems at Amazon

https://aws.amazon.com/blogs/machine-learning/evaluating-ai-agents-real-world-lessons-from-building-agentic-systems-at-amazon/
2•bpedro•1h ago

Comments

lumpilumpi•1h ago
I get the justification but I found it hard to understand how the actual evaluation at each step is carried out. For example, is there any calibration to some human gold standard involved or is the AI evaluating the AI without calibration/oversight?

Codeberg as an OIDC Provider for Tailscale (2023)

https://kennyqin.com/posts/codeberg-as-an-oidc-provider-for-tailscale/
1•arm•34s ago•0 comments

They're Made Out of Meat

https://www.mit.edu/people/dpolicar/writing/prose/text/thinkingMeat.html
1•tornikeo•1m ago•0 comments

The digital death of collecting (2021)

https://kylechayka.substack.com/p/essay-the-digital-death-of-collecting
1•robtherobber•2m ago•0 comments

Socket brings supply chain security to skills.sh

https://socket.dev/blog/socket-brings-supply-chain-security-to-skills
1•ryoidong•3m ago•0 comments

DitchingDiscord Wiki

https://wiki.alopex.li/DitchingDiscord
1•keyle•4m ago•0 comments

Andrew Mountbatten-Windsor arrested on suspicion of misconduct in public office

https://www.bbc.com/news/live/c70kjr9wjw0t
19•asdefghyk•8m ago•4 comments

12-hour days, no weekends: AI's brutal work culture is a warning for all of us

https://www.theguardian.com/technology/ng-interactive/2026/feb/17/ai-startups-work-culture-san-fr...
2•Stratoscope•8m ago•0 comments

The Programming Language Doesn't Matter So You Should Use Rust

https://tavakyan.substack.com/p/the-programming-language-doesnt-matter
2•tavakyan•11m ago•0 comments

Drizz.dev

1•drizz_dev•13m ago•0 comments

Berkshire Hathaway's website today resembles its 1997 design

https://web.archive.org/web/19970530212007/http://www.berkshirehathaway.com/
1•thewavelength•13m ago•2 comments

Drizz.dev

1•drizz_dev•15m ago•0 comments

Open Sesame – I Now Have to Ask My Internet Router to Give Me Internet

https://kryptokommun.ist/tech/2026/02/19/llm-gatekeeper-router.html
1•kryptokommunist•15m ago•1 comments

Ask HN: Why Science and philosophy are together?

2•modinfo•15m ago•0 comments

Advent of Compiler Optimisations 2025

https://www.youtube.com/playlist?list=PL2HVqYf7If8cY4wLk7JUQ2f0JXY_xMQm2
1•tosh•18m ago•0 comments

Show HN: Heroku/Fly.io-like app deployments to Cloudflare Containers

https://github.com/michaloo/flarepilot
1•michaloo•19m ago•0 comments

Zuna: A 380M-parameter foundation model for EEG signals

https://huggingface.co/Zyphra/ZUNA
1•victormustar•19m ago•1 comments

I reverse-engineered Zomato's Food Rescue real-time notification system

https://medium.com/@jatin.b.rx3/i-reverse-engineered-zomatos-food-rescue-feature-here-s-what-i-fo...
1•jatin-dot-py•19m ago•0 comments

On-the-fly code generation with OpenClaw won't fly

https://medium.com/versanova/on-the-fly-code-generation-wont-fly-0f7b02e69195
1•gauravsc•20m ago•0 comments

State of Clojure 2025 Results

https://clojure.org/news/2026/02/18/state-of-clojure-2025
1•adityaathalye•22m ago•0 comments

Permissive, then restrictive: concrete solutions and examples in Haskell (2020)

https://www.williamyaoh.com/posts/2020-05-03-permissiveness-solutions.html
1•todsacerdoti•23m ago•0 comments

AI, Entropy, and the Illusion of Convergence in Modern Software

https://www.abelenekes.com/p/when-change-becomes-cheaper-than-commitment
2•enekesabel•24m ago•1 comments

Baking the Context Cake

https://theelderscripts.com/baking-the-context-cake/
1•haarlemist•26m ago•0 comments

Signal launches version 8.0 with Signal Secure Backups

https://aboutsignal.com/news/signal-launches-version-8-0-with-signal-secure-backups/
2•mikae1•27m ago•1 comments

UK Names Antonia Romeo as First Woman to Head Civil Service

https://www.bloomberg.com/news/articles/2026-02-19/uk-names-antonia-romeo-as-first-woman-to-head-...
1•JustSkyfall•27m ago•0 comments

We don't need AI to cure cancer

https://outspeaker.com/post/12
1•onesandofgrain•30m ago•8 comments

/Deslop

https://tahigichigi.substack.com/p/12-red-flags-of-ai-writing-and-how
2•yayitswei•33m ago•0 comments

Ask HN: Since of humanity do we have made any difference in the universe?

1•modinfo•35m ago•0 comments

Oral history of Robert P. Colwell, Intel Pentium / IA32 lead architect [pdf]

https://www.sigmicro.org/media/oralhistories/colwell.pdf
1•fanf2•37m ago•0 comments

Wellington rages as litres of raw sewage pour into ocean

https://www.theguardian.com/world/2026/feb/19/wellington-raw-sewage-leak-spill-water-new-zealand
3•rguiscard•37m ago•1 comments

Bitwarden ignored serious CVEs reported 4 years ago

https://www.reddit.com/r/Bitwarden/s/LsJWCaQ6YD
1•cromka•38m ago•1 comments