fp.
newest
Open in hackernews
Anthropic: Demystifying Evals for AI Agents
https://www.anthropic.com/engineering/demystifying-evals-for-ai-agents
4
•
Bayram
•
8h ago
Comments
dangelosaurus
•
7h ago
I work on Promptfoo (an open-source eval framework). Appreciate the mention here. This post captures a lot of the hard lessons around agent evals. In particular, task ambiguity and brittle graders are things we run into constantly.
What I learned building an options portfolio tracker for retail traders
1
•
optioneer
•
43s ago
•
0 comments
Show HN: A word puzzle where you rearrange words to form a semantic loop
https://puzzles.madebynathan.com/chains?date=2026-01-11
1
•
nathan_f77
•
2m ago
•
0 comments
Show HN: Llama 4 Maverick explores Japantown in Watch Dogs 2
https://www.youtube.com/watch?v=2NUUNVvzuD0
1
•
dandelionv1bes
•
3m ago
•
0 comments
Show HN: A WebSite Space – A collection of 100 fun and weird links
https://www.awebsite.space/
1
•
AliGalip1545
•
3m ago
•
0 comments
Below the Surface: Archeological Finds from the Amsterdam Noord/Zuid Metro Line
https://belowthesurface.amsterdam/en/vondsten
1
•
stefanvdw1
•
6m ago
•
0 comments
A City on Mars
https://en.wikipedia.org/wiki/A_City_on_Mars
2
•
ColinWright
•
6m ago
•
0 comments
Show HN: Instagram Saved Posts Downloader
https://chromewebstore.google.com/detail/instagram-saved-posts-dow/alkgglonfjgmdolnjafmbmldmmalhdoi
1
•
qwikhost
•
9m ago
•
0 comments
Show HN: I made a livekit-powered video chat game with AI in 5 hours
https://president.alephz.com/
1
•
puppion
•
15m ago
•
0 comments
Ask HN: Are you having existential questions about being a software engineer?
2
•
ronbenton
•
15m ago
•
0 comments
Stan Tames Autoregressive Nonsense
https://twitter.com/karmaniverous/status/2010351714055123069
1
•
karmaniverous
•
16m ago
•
0 comments
GitHub PR Challenge
https://memu.pro/hackathon/rules
1
•
k_kiki
•
18m ago
•
1 comments
Show HN: Atom – The Open Source AI Workforce and Multi-Agent Orchestrator
https://github.com/rush86999/atom
2
•
rush86999
•
20m ago
•
0 comments
AI Psychosis, AI Apotheosis
https://www.oblomovka.com/wp/2026/01/07/ai-psychosis-ai-apotheosis/
2
•
baxtr
•
23m ago
•
0 comments
Why Finding Motivation Is Often Such a Struggle
https://nautil.us/why-finding-motivation-is-often-such-a-struggle-1260605/
2
•
Brajeshwar
•
23m ago
•
0 comments
Covid lockdowns changed the beak shape of city birds
https://newatlas.com/biology/lockdowns-beak-shape-birds/
2
•
Brajeshwar
•
23m ago
•
0 comments
Free Printable Coloring Pages
https://coloringbook.im/
1
•
Evan233
•
23m ago
•
1 comments
Non-Traditional Profiling
https://www.mgaudet.ca/technical/2026/1/8/non-traditional-profiling
1
•
lumpa
•
24m ago
•
0 comments
Ask HN: One-Shot or Iterate?
1
•
indigodaddy
•
24m ago
•
0 comments
Why does AI suck at making clocks?
https://www.popsci.com/technology/ai-making-clocks/
1
•
Brajeshwar
•
25m ago
•
0 comments
Praxis (Proposed City)
https://en.wikipedia.org/wiki/Praxis_(proposed_city)
1
•
gehwartzen
•
28m ago
•
0 comments
Implementing a Tiny CPU Rasterizer
https://lisyarus.github.io/blog/posts/implementing-a-tiny-cpu-rasterizer-part-1.html
2
•
todsacerdoti
•
30m ago
•
0 comments
China applies to put 200K satellites in space after calling Starlink crash risk
https://www.scmp.com/news/china/science/article/3339493/china-applies-put-200000-satellites-space...
2
•
nkurz
•
33m ago
•
1 comments
U.S. releases new dietary guidelines [video]
https://www.youtube.com/watch?v=dlQOpR7CAIU
1
•
mgh2
•
33m ago
•
0 comments
A History of Disbelief in Large Language Models
https://shadowcodebase.substack.com/p/the-shifting-skepticisms-in-ai
1
•
kevin42
•
34m ago
•
1 comments
Show HN: A mini paged-KV and prefix-cache scheduler (learning inference engine)
https://github.com/tyfeng1997/tailor
1
•
bofeng1997
•
42m ago
•
0 comments
AI Skills Marketplace: A New Digital Economy?
https://vibeandscribe.xyz/posts/2026-01-11-skills-marketplace.html
1
•
ryanthedev
•
43m ago
•
0 comments
The Largest Protest in Human History: The Baltic Way
https://en.wikipedia.org/wiki/Baltic_Way
3
•
ViktorRay
•
43m ago
•
0 comments
Writing a Work Log
https://fredrikmeyer.net/2026/01/11/work-log.html
1
•
FredrikMeyer
•
44m ago
•
0 comments
One Thousand Words
https://drewmayo.com/1000-words/
1
•
pabs3
•
45m ago
•
0 comments
Happy 50th Birthday KIM-1
https://github.com/netzherpes/KIM1-Demo
6
•
JKCalhoun
•
46m ago
•
0 comments
Load More
dangelosaurus•7h ago