fp.
newest
Open in hackernews
Evaluating AI agents: Real-world lessons from building agentic systems at Amazon
https://aws.amazon.com/blogs/machine-learning/evaluating-ai-agents-real-world-lessons-from-building-agentic-systems-at-amazon/
2
•
bpedro
•
1h ago
Comments
lumpilumpi
•
1h ago
I get the justification but I found it hard to understand how the actual evaluation at each step is carried out. For example, is there any calibration to some human gold standard involved or is the AI evaluating the AI without calibration/oversight?
Codeberg as an OIDC Provider for Tailscale (2023)
https://kennyqin.com/posts/codeberg-as-an-oidc-provider-for-tailscale/
1
•
arm
•
34s ago
•
0 comments
They're Made Out of Meat
https://www.mit.edu/people/dpolicar/writing/prose/text/thinkingMeat.html
1
•
tornikeo
•
1m ago
•
0 comments
The digital death of collecting (2021)
https://kylechayka.substack.com/p/essay-the-digital-death-of-collecting
1
•
robtherobber
•
2m ago
•
0 comments
Socket brings supply chain security to skills.sh
https://socket.dev/blog/socket-brings-supply-chain-security-to-skills
1
•
ryoidong
•
3m ago
•
0 comments
DitchingDiscord Wiki
https://wiki.alopex.li/DitchingDiscord
1
•
keyle
•
4m ago
•
0 comments
Andrew Mountbatten-Windsor arrested on suspicion of misconduct in public office
https://www.bbc.com/news/live/c70kjr9wjw0t
19
•
asdefghyk
•
8m ago
•
4 comments
12-hour days, no weekends: AI's brutal work culture is a warning for all of us
https://www.theguardian.com/technology/ng-interactive/2026/feb/17/ai-startups-work-culture-san-fr...
2
•
Stratoscope
•
8m ago
•
0 comments
The Programming Language Doesn't Matter So You Should Use Rust
https://tavakyan.substack.com/p/the-programming-language-doesnt-matter
2
•
tavakyan
•
11m ago
•
0 comments
Drizz.dev
1
•
drizz_dev
•
13m ago
•
0 comments
Berkshire Hathaway's website today resembles its 1997 design
https://web.archive.org/web/19970530212007/http://www.berkshirehathaway.com/
1
•
thewavelength
•
13m ago
•
2 comments
Drizz.dev
1
•
drizz_dev
•
15m ago
•
0 comments
Open Sesame – I Now Have to Ask My Internet Router to Give Me Internet
https://kryptokommun.ist/tech/2026/02/19/llm-gatekeeper-router.html
1
•
kryptokommunist
•
15m ago
•
1 comments
Ask HN: Why Science and philosophy are together?
2
•
modinfo
•
15m ago
•
0 comments
Advent of Compiler Optimisations 2025
https://www.youtube.com/playlist?list=PL2HVqYf7If8cY4wLk7JUQ2f0JXY_xMQm2
1
•
tosh
•
18m ago
•
0 comments
Show HN: Heroku/Fly.io-like app deployments to Cloudflare Containers
https://github.com/michaloo/flarepilot
1
•
michaloo
•
19m ago
•
0 comments
Zuna: A 380M-parameter foundation model for EEG signals
https://huggingface.co/Zyphra/ZUNA
1
•
victormustar
•
19m ago
•
1 comments
I reverse-engineered Zomato's Food Rescue real-time notification system
https://medium.com/@jatin.b.rx3/i-reverse-engineered-zomatos-food-rescue-feature-here-s-what-i-fo...
1
•
jatin-dot-py
•
19m ago
•
0 comments
On-the-fly code generation with OpenClaw won't fly
https://medium.com/versanova/on-the-fly-code-generation-wont-fly-0f7b02e69195
1
•
gauravsc
•
20m ago
•
0 comments
State of Clojure 2025 Results
https://clojure.org/news/2026/02/18/state-of-clojure-2025
1
•
adityaathalye
•
22m ago
•
0 comments
Permissive, then restrictive: concrete solutions and examples in Haskell (2020)
https://www.williamyaoh.com/posts/2020-05-03-permissiveness-solutions.html
1
•
todsacerdoti
•
23m ago
•
0 comments
AI, Entropy, and the Illusion of Convergence in Modern Software
https://www.abelenekes.com/p/when-change-becomes-cheaper-than-commitment
2
•
enekesabel
•
24m ago
•
1 comments
Baking the Context Cake
https://theelderscripts.com/baking-the-context-cake/
1
•
haarlemist
•
26m ago
•
0 comments
Signal launches version 8.0 with Signal Secure Backups
https://aboutsignal.com/news/signal-launches-version-8-0-with-signal-secure-backups/
2
•
mikae1
•
27m ago
•
1 comments
UK Names Antonia Romeo as First Woman to Head Civil Service
https://www.bloomberg.com/news/articles/2026-02-19/uk-names-antonia-romeo-as-first-woman-to-head-...
1
•
JustSkyfall
•
27m ago
•
0 comments
We don't need AI to cure cancer
https://outspeaker.com/post/12
1
•
onesandofgrain
•
30m ago
•
8 comments
/Deslop
https://tahigichigi.substack.com/p/12-red-flags-of-ai-writing-and-how
2
•
yayitswei
•
33m ago
•
0 comments
Ask HN: Since of humanity do we have made any difference in the universe?
1
•
modinfo
•
35m ago
•
0 comments
Oral history of Robert P. Colwell, Intel Pentium / IA32 lead architect [pdf]
https://www.sigmicro.org/media/oralhistories/colwell.pdf
1
•
fanf2
•
37m ago
•
0 comments
Wellington rages as litres of raw sewage pour into ocean
https://www.theguardian.com/world/2026/feb/19/wellington-raw-sewage-leak-spill-water-new-zealand
3
•
rguiscard
•
37m ago
•
1 comments
Bitwarden ignored serious CVEs reported 4 years ago
https://www.reddit.com/r/Bitwarden/s/LsJWCaQ6YD
1
•
cromka
•
38m ago
•
1 comments
Load More
lumpilumpi•1h ago