frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Framework selling first GPU-upgradable laptop, with Nvidia's blessing

https://www.theverge.com/laptops/765528/framework-is-now-selling-the-first-gaming-laptop-that-let...
1•bsimpson•14s ago•0 comments

15-Fold increase in solar thermoelectric generator performance

https://idp.nature.com/authorize?response_type=cookie&client_id=grover&redirect_uri=https%3A%2F%2...
1•bookofjoe•15s ago•0 comments

Ask HN: Has anyone else used online communities of this specific archetype?

1•Use•50s ago•0 comments

Make and SQL: An old new way for Data Science workloads

https://vasvir.wordpress.com/2025/08/26/make-sql-an-old-new-way-for-data-science-workloads/
1•vasvir•1m ago•0 comments

Why America Still Needs Punk Rock

https://www.currentaffairs.org/news/why-america-still-needs-punk-rock
1•XzetaU8•1m ago•0 comments

More on Seed Phrase Words

https://www.johndcook.com/blog/2025/08/26/seed-phrase-words-2/
1•ibobev•1m ago•0 comments

Apple Event on September 9: 'Awe Dropping'

https://www.macrumors.com/2025/08/26/apple-september-2025-event/
1•Bogdanp•2m ago•0 comments

Novelty Is the Secret Ingredient to Product Success, Thriving Teams,Happiness

https://spin.atomicobject.com/novelty-secret-ingredient/
1•philk10•5m ago•0 comments

Show HN: Enterprise MCP Bridge – Solving the MCP Chaos for IT

https://blog.inxm.ai/p/enterprise-it-cant-afford-mcp-chaosheres
2•raelmiu•7m ago•0 comments

Principles of great DX for data infrastructure

https://clickhouse.com/blog/eight-principles-of-great-developer-experience-for-data-infrastructure
1•craneca0•10m ago•0 comments

Delta Lake: Transform Pandas Prototypes into Production

https://codecut.ai/from-pandas-to-production-delta-rs/
1•Ben5554•10m ago•0 comments

Google says China-linked cyber operations targeted Southeast Asia diplomats

https://www.cnn.com/2025/08/26/business/google-china-linked-hacking-southeast-asia-diplomats-intl...
1•mooreds•11m ago•0 comments

Intel and the Foundry State of Play

https://d2d.substack.com/p/d2d-contd-intel-and-the-foundry-state
1•mooreds•12m ago•0 comments

Titles Matter

https://joshcollinsworth.com/blog/titles-matter
2•speckx•13m ago•0 comments

What It Means to Choose Life

https://www.nytimes.com/2025/08/24/opinion/assisted-suicide-canada-orchid-embryos.html
1•whack•15m ago•0 comments

Tomorows Growth Starts with Todays

https://adia.substack.com/p/tomorrows-growth-starts-with-todays
1•jemiluv8•15m ago•1 comments

Anthropic Settles Copyright Lawsuit

https://www.courtlistener.com/docket/70991505/26/bartz-et-al-v-anthropic-pbc/
1•miohtama•15m ago•0 comments

Type Inference for Plain Data

https://www.haskellforall.com/2025/08/type-inference-for-plain-data.html
1•fanf2•16m ago•0 comments

Show HN: My Financial Pal – Free AI-Powered Personal Financial Planner

https://my-financial-pal-baf4b5e07c1c.herokuapp.com/
1•shormigo•17m ago•1 comments

Michigan Supreme Court: Unrestricted Phone Searches Violate Fourth Amendment

https://reclaimthenet.org/michigan-supreme-court-rules-phone-search-warrants-must-be-specific
15•mikece•21m ago•3 comments

Understanding Neural Networks, Visually

https://visualrambling.space/neural-network/
1•LordNibbler•22m ago•0 comments

SuperNICs Explained and Compared to DPUs

https://www.technetbooks.com/2025/08/supernics-network-accelerator-for.html
1•tanelpoder•23m ago•0 comments

Britain's datacentre boom promises growth- Ireland's grid crisis shows the costs

https://nearlyright.com/britains-data-centre-boom-promises-growth-but-irelands-grid-crisis-shows-...
3•indigodaddy•23m ago•0 comments

Squarespace Is Down

https://status.squarespace.com
2•gkolli•24m ago•0 comments

Detecting colorectal cancer with gut bacteria and AI

https://www.rts.ch/info/sciences-tech/2025/article/une-ia-detecte-90-des-cas-de-cancer-colorectal...
2•speckx•24m ago•0 comments

Serviz: Command Object Interface for Ruby

https://github.com/markets/serviz
1•thunderbong•25m ago•0 comments

Google Gemini's AI image model gets a 'bananas' upgrade

https://techcrunch.com/2025/08/26/google-geminis-ai-image-model-gets-a-bananas-upgrade/
1•breadwinner•26m ago•1 comments

Unstract: Open-source platform to ship document extraction APIs in minutes

https://github.com/Zipstack/unstract
1•naren87•26m ago•0 comments

ElixirForum Problem

2•rixilexhp•26m ago•2 comments

Agentic RAG and Context Engineering for Agents

https://www.vincirufus.com/posts/agentic-rag-context-engineering/
2•vincirufus•29m ago•0 comments
Open in hackernews

I reverse-engineered a bug in my PPO agent that gave it a 9x performance boost

https://theprincipledagent.com/2025/08/26/forensic-rl-investigating-a-surprisingly-successful-bug-breakout-baseline-5/
1•wmaxlees•2h ago

Comments

wmaxlees•2h ago
Hi HN, author here.

In my last post, I found a critical bug in my PPO agent. Fixing it was the "right" thing to do, but it tanked my agent's performance from a score of 84 all the way down to 9.

This post is the forensic investigation into why that bug was so helpful. I started with a simple hypothesis that it was just adding random noise for exploration, which turned out to be partially correct but didn't explain the whole story.

The real "secret sauce" was that the bug was adding correlated noise, creating a consistent optimistic or pessimistic bias for an entire trajectory. I managed to reverse-engineer this effect into a new, principled technique that successfully reproduced the 84 score.

The post is the full deep dive, from visualizing the original bug's signal to designing a new form of state-dependent exploration from scratch. Happy to answer any questions about the process or the JAX/Flax implementation.