news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

My LLM optimization loop reward-hacked its own benchmark (and other lessons) [pdf]

https://github.com/CodeReclaimers/bishop-loop-experiment-3/blob/main/paper/paper.pdf

1•CodeReclaimers•53m ago

Comments

cold_harbor•49m ago

reward hacking = the model finding the fastest path to a high score, not the behavior you wanted. same reason RLHF reward models degrade with too many optimization steps.

CodeReclaimers•39m ago

Agreed. The wrinkle I thought was worth writing up is: there's no learned reward model here and no training at all. The "reward" is wall-clock executiion time and the model is frozen; the search is happening at inference time, not in an RL loop. So the usual "the proxy is a fuzzy approximation that degrades under optimization pressure" story doesn't apply.

This was on a ~200-line surface I thought I'd locked down, and it still got gamed in a way I might not have caught right away if it wasn't a nearly impossible run time (~45usec). So anyways...you apparently don't need a soft proxy or a lot of steps for this kind of thing to show up.

Comparisons as Predictable as the Sunrise

https://pudding.cool/2026/05/similes/

1•surprisetalk•1m ago•0 comments

Pixar's 22 Rules of Storytelling (2013)

https://www.aerogrammestudio.com/2013/03/07/pixars-22-rules-of-storytelling/

1•downbad_•2m ago•0 comments

Less is exponentially more (2012)

https://commandcenter.blogspot.com/2012/06/less-is-exponentially-more.html

1•downbad_•3m ago•0 comments

Launch HN: Chert (YC P26) – Twilio for iMessage

https://www.trychert.com

2•garygao•4m ago•0 comments

Artemis II Reference Guide [pdf]

https://www.nasa.gov/wp-content/uploads/2025/12/sls-5558-artemis-ii-sls-reference-guide.pdf

1•fortran77•5m ago•0 comments

New OpenAI Phone

https://www.msn.com/en-us/news/technology/openai-is-reportedly-making-its-own-phone-here-s-what-t...

1•gibbsrich•7m ago•0 comments

Seed Oils as a Hypothesized Contributor to Heart Disease: A Narrative Synthesis

https://pmc.ncbi.nlm.nih.gov/articles/PMC12923254/

1•bilsbie•9m ago•0 comments

Pope Apologizes for Slavery

https://apnews.com/article/pope-apologizes-slavery-role-holy-see-vatican-78df993c5604eb098b19f255...

1•dzonga•9m ago•0 comments

PaintFE: Open-source raster image editor built in Rust. Single portable binary

https://github.com/kylejckson/PaintFE

2•ksec•9m ago•0 comments

Robotaxis need to be tested in real traffic

https://www.ft.com/content/75af4d22-03ba-4fcc-b269-0ffee042d8cc

2•1vuio0pswjnm7•11m ago•0 comments

Where Is the JVM Tax?

https://semyonsinchenko.github.io/ssinchenko/post/jvm-tax/

2•Malp•11m ago•0 comments

Jaeger hit 8.6× compression on 10M spans with ClickHouse

https://thenewstack.io/jaeger-clickhouse-storage-backend/

1•Brajeshwar•14m ago•0 comments

10 Years of Spiffe

https://joe.dev/posts/10-years-of-spiffe/

1•mooreds•14m ago•0 comments

Zero-Player Games

https://alexanderbjoy.com/on-zero-player-games/

1•surprisetalk•16m ago•0 comments

Distributing LLM Inference in DwarfStar

https://antirez.com/news/167

1•surprisetalk•16m ago•0 comments

Building a Host-Tuned GCC to Make GCC Compile Faster

https://peter0x44.github.io/posts/super-gcc/

1•signa11•16m ago•0 comments

Open-Source Crypto Mining GUI for M3 / M4 / M5 Macs

https://github.com/shiftingeden/kawpow-mac

1•shiftingeden•16m ago•0 comments

A Free AI SEO Tool That Audits Any Website in Seconds

https://github.com/kian9375/seoclaw-by-kb-software

1•kb5•17m ago•2 comments

Companies pay bots 63B per year, while they try to reach humans

https://www.nexertise.com/founding

2•tzofit•17m ago•3 comments

Credential Brokering for AI Agents Explained

https://infisical.com/blog/credential-brokering-for-ai-agents

2•FinnLobsien•17m ago•0 comments

Dark proteome yields 1,785 new microproteins that could reshape disease research

https://phys.org/news/2026-05-dark-proteome-yields-microproteins-reshape.html

1•janandonly•17m ago•0 comments

Linus Torvalds Is Unhappy About the AI Influence in Linux Kernel Development

https://ostechnix.com/linus-torvalds-ai-influence-linux-kernel-development/

3•chungy•18m ago•0 comments

LLM-Insights, local demo for people comments and ideas

https://github.com/yuvhaim-gif/LLM_InSight

1•yuvalhaim•18m ago•0 comments

New Shai-Hulud malware wave compromises 600 NPM packages

https://itnerd.blog/2026/05/19/new-shai-hulud-malware-wave-compromises-600-npm-packages/

3•mooreds•19m ago•1 comments

The Stunt Pilot Hunting Russian Drones

https://www.newyorker.com/magazine/2026/06/01/the-stunt-pilot-hunting-russian-drones

1•fortran77•21m ago•0 comments

A trial user told us our ERP workflow felt disconnected. They were right

https://www.paxerp.com/blog/user-feedback

1•robeym•21m ago•0 comments

Windows and Linux: A Tale of Two Kernels (2004) [video]

https://www.youtube.com/watch?v=HdV9QuvgS_w

1•aragonite•22m ago•0 comments

PlainMarkdown v1.6: Clean conversion on heavy sites, AI actions and Exports

https://plainmarkdown.com

1•p_bits•25m ago•1 comments

The Ask

https://randsinrepose.com/archives/the-ask/

1•digitallogic•25m ago•0 comments

Build with Modern Web Guidance

https://developer.chrome.com/docs/modern-web-guidance

1•spking•25m ago•0 comments