frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Business Insider Deletes Dozens of Articles After AI Scam Probe

https://www.thedailybeast.com/business-insider-deletes-dozens-of-articles-after-ai-scam-probe/
1•c420•1m ago•0 comments

Neovim as a Terminal Multiplexer and Neovide as a Terminal Emulator

https://loosh.ch/blog/neovidenal
1•looshch•2m ago•1 comments

Trump to Rename Department of Defense to Department of War

https://apnews.com/article/trump-department-of-war-8cb6fe42fee4128f5a8fcab3f49e6e41
1•ratdoctor•2m ago•0 comments

AI Induced Psychosis: A shallow investigation

https://www.lesswrong.com/posts/iGF7YcnQkEbwvYLPA/ai-induced-psychosis-a-shallow-investigation
1•giuliomagnifico•4m ago•0 comments

Show HN: PlanEat AI, weekly meal planner with smart shopping lists

https://apps.apple.com/ua/app/planeat-ai-meal-planner/id6745792604
2•torianyk•6m ago•0 comments

Ryan Holiday: There's No Such Thing as 'Quality' Time

https://ryanholiday.net/quality-time/
1•skadamat•7m ago•0 comments

The Math of Shuffling Cards

https://www.scientificamerican.com/article/how-the-math-of-shuffling-cards-almost-brought-down-an...
2•indigodaddy•11m ago•0 comments

Let us Git rid of it, angry GitHub users say of forced Copilot features

https://www.theregister.com/2025/09/05/github_copilot_complaints/
2•sohkamyung•13m ago•0 comments

Google Weather Lab: Cyclones

https://deepmind.google.com/science/weatherlab
1•sylvainkalache•13m ago•0 comments

"Democrat" is not an adjective

https://lee-phillips.org/democrat/
2•leephillips•14m ago•1 comments

Prompt injection as a solution to make agents reliable

https://www.controlpla.in/blog/prompt-injection-to-make-agents-reliable
1•HardikVala•15m ago•0 comments

So you want to control flow in PyTorch 2

https://blog.ezyang.com/2025/09/so-you-want-to-control-flow-in-pt2/
1•matt_d•15m ago•0 comments

Dave Richards – City of Largo Work Blog (2016)

http://davelargo.blogspot.com/
1•indigodaddy•18m ago•0 comments

Some French publishers are giving AI revenue directly to journalists

https://www.niemanlab.org/2025/09/in-france-ai-revenue-is-going-directly-to-journalists-could-tha...
2•giuliomagnifico•21m ago•0 comments

The Benefit of the Doubt

https://whatwelost.substack.com/p/the-benefit-of-the-doubt
2•nativeit•23m ago•0 comments

Show HN: Demo of My Hyprland Setup

https://tube.jeena.net/w/2EpbXJnMrDokc3362oXSTQ
1•jeena•27m ago•0 comments

How to Build an Agent (Or: The Emperor Has No Clothes)

https://ampcode.com/how-to-build-an-agent
1•sqs•30m ago•0 comments

Ask HN: Freelancers, what features matter most in invoicing software?

1•ahmadhamza19•33m ago•0 comments

Supercharger for Business – Tesla

https://www.tesla.com/supercharger-for-business
2•bilsbie•34m ago•0 comments

Ben-Hur on a Computer Screen

https://daviramos.com/ben-hur-on-a-computer-screen/
1•bananamerica•35m ago•2 comments

The Universe Within 12.5 Light Years

http://www.atlasoftheuniverse.com/12lys.html
13•algorithmista•41m ago•2 comments

Federal Trade Commission Files to Accede to Vacatur of Non-Compete Clause Rule

https://www.ftc.gov/news-events/news/press-releases/2025/09/federal-trade-commission-files-accede...
3•moonka•42m ago•2 comments

Coordinated Thermal and Electrical Balancing for Lithium-Ion Cells

https://www.mdpi.com/1996-1073/18/16/4231
1•PaulHoule•42m ago•0 comments

Logic Theorist

https://en.wikipedia.org/wiki/Logic_Theorist
5•geox•42m ago•0 comments

What to do about "mirror life"?

https://www.foreignaffairs.com/world/what-do-about-mirror-life
3•hhs•43m ago•0 comments

Blockbench: A low-poly 3D model editor

https://www.blockbench.net/
1•marcodiego•44m ago•0 comments

Show HN: Desk clock with rotating Earth showing real-time day/night cycles

https://www.atelierludo.com/
1•lludo•44m ago•0 comments

Hosting SQLite databases on GitHub Pages (2021)

https://phiresky.github.io/blog/2021/hosting-sqlite-databases-on-github-pages/
1•whatisabcdefgh•45m ago•0 comments

Quantum Mechanics, Concise Book

https://github.com/basketballguy999/Quantum-Mechanics-Concise-Book
4•pykello•53m ago•0 comments

Money for Nothing, Chips for Free

https://phrack.org/issues/72/14_md#article
2•rmason•53m ago•2 comments
Open in hackernews

When LLMs Grow Hands and Feet, How to Design Our Agentic RL Systems?

https://amberljc.github.io/blog/2025-09-05-agentic-rl-systems.html
3•amberjcjj•2h ago

Comments

amberjcjj•2h ago
Lately I’ve been building AI agents for scientific research. In addition to build better agent scaffold, to make AI agents truly useful, LLMs need to do more than just think—they need to use tools, run code, and interact with complex environments. That’s why we need Agentic RL.

While working on this, I notice the underlying RL systems must evolve to support these new capabilities. So, I wrote a blog post to capture my thoughts and lessons learned.

“When LLMs Grow Hands and Feet, How to Design our Agentic RL Systems?”

TL;DR: The frontier of AI is moving from simple-response generation to solving complex, multi-step problems through agents. Previous RL frameworks for LLMs aren’t built for this—they struggle with the heavy, heterogeneous resource demands that agents need, like isolated environments or tool interactions.

In the blog, I cover:

How RL for LLM-based agents differs from traditional RL for LLM.

The critical system challenges when scaling agentic RL.

Emerging solutions top labs and companies are using

If you’re interested in agentic intelligence—LLMs that don’t just think but act—I go into the nuts and bolts of what it takes to make this work in practice.