frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Environments Hub: Your Language Model needs better (open) environments to learn

https://huggingface.co/blog/anakin87/environments-hub
1•anakin87•2h ago

Comments

anakin87•2h ago
LLMs improve when they can practice and reason in interactive environments.

Recent work (DeepSeek-R1, GRPO) shows RL can teach models to prefer better outputs by giving rewards.

But most RL environments for LLMs are fragmented or closed. That makes it hard for the community to experiment or reproduce results.

Environments Hub is a new open platform by Prime Intellect where anyone can share RL environments for training or evaluating LLMs. Think of them as software packages: data, harness and scoring rules.

Agents today incorporate models and tools (from APIs to a terminal), so environments need to capture that complexity.

I wrote a hands-on walkthrough covering:

  - RL + LLM basics

  - Navigating the Environments Hub

  - Evaluating models and agents

  - GRPO-style training of a tiny model on an alphabetical sort task
If you want to experiment with RL for LLMs or just see how open environments can accelerate learning, this walkthrough is a practical starting point.

Scientists develop 'glue gun' that 3D prints bone grafts directly onto fractures

https://www.livescience.com/health/surgery/scientists-develop-glue-gun-that-3d-prints-bone-grafts...
1•geox•8m ago•0 comments

Start Experimenting with Neural Super Sampling for Mobile Graphics

https://community.arm.com/arm-community-blogs/b/mobile-graphics-and-gaming-blog/posts/how-to-acce...
1•PaulHoule•9m ago•0 comments

A/B testing and LLM personalization on Cloudflare Workers using Visual Editor

1•puzanov•9m ago•0 comments

Slice Merchant Services – Salesforce Admin – Onsite

https://startslice.com/careers
1•sswanson•10m ago•1 comments

Ask HN: Why hasn't Google monetized ReCAPTCHA with ads?

3•ATechGuy•10m ago•2 comments

ReActionView: A New ActionView-Compatible ERB Engine

https://reactionview.dev
1•ksec•11m ago•0 comments

Exploring Interlisp-10 and Twenex

https://journal.paoloamoroso.com/exploring-interlisp-10-and-twenex
1•naves•12m ago•0 comments

Ask HN: What Does a Social Media Analytic Want and Do?

1•bpavuk•12m ago•0 comments

USA Cycling bans transgender athletes from female categories beginning Sep. 15

https://www.cyclingweekly.com/news/usa-cycling-bans-transgender-athletes-from-female-categories-b...
1•nradov•12m ago•0 comments

Ice gRPC toolkit and Protobuf alternative

https://github.com/zeroc-ice/ice
1•just_human•14m ago•0 comments

Show HN: What to Do with an Old iPad

http://odb.ar/blog/2025/09/05/hosting-my-blog-on-an-iPad-2.html
1•owenmakes•15m ago•0 comments

Rails World 2025 Opening Keynote – David Heinemeier Hansson [video]

https://www.youtube.com/watch?v=gcwzWzC7gUA
1•foofoo4u•19m ago•0 comments

Show HN: We wrote an open-source Text to CAD app

https://github.com/Adam-CAD/CADAM
2•zachdive•19m ago•0 comments

Apple Arcade, Six Years IN

https://sixcolors.com/link/2025/09/apple-arcade-six-years-in/
1•ksec•20m ago•2 comments

Ask HN: What are the best biographies of the firm?

1•miletus•20m ago•0 comments

SQLite can handle most of it

https://binaryigor.com/sqlite-db-simple-in-process-reliable-fast.html
1•BinaryIgor•22m ago•0 comments

Value type and bits pattern in MoonBit, 30% faster than Rust

https://www.moonbitlang.com/blog/moonbit-value-type
1•dlib•22m ago•0 comments

Understanding Bazel Remote Caching

https://blogsystem5.substack.com/p/bazel-remote-caching
2•oftenwrong•24m ago•0 comments

Just in Time for the Most Overengineered Calculator (2023)

https://l-m.dev/cs/jitcalc/
2•hggh•24m ago•0 comments

GPT-5 negotiates harder and better than Opus 4.1

https://mdahardy.substack.com/p/gpt-5-negotiates-harder-and-better
1•mdahardy•25m ago•0 comments

Why Language Models Hallucinate

https://openai.com/index/why-language-models-hallucinate
1•meetpateltech•27m ago•0 comments

If AI agents take the jobs, who buys the stuff?

3•babua•28m ago•0 comments

PKM apps need to get better at resurfacing information

https://ankursethi.com/blog/pkm-apps-need-to-get-better-at-resurfacing-information/
1•abhin4v•28m ago•0 comments

Freeway guardrails are now a favorite target of thieves

https://laist.com/news/transportation/guardrails-aluminum-theft
5•jaredwiener•31m ago•0 comments

Elon Musk Could Become First Trillionaire Under New Tesla Pay Plan

https://www.nytimes.com/2025/09/05/business/elon-musk-tesla-pay-trillionaire.html
5•dctoedt•32m ago•1 comments

Cagent – customizable multi-agent runtime

https://github.com/docker/cagent
1•fossa1•32m ago•0 comments

Why Everybody Is Losing Money on AI

https://www.wheresyoured.at/why-everybody-is-losing-money-on-ai/
39•speckx•33m ago•13 comments

We No Longer Lock Premium Features

https://neon.com/blog/why-we-no-longer-lock-premium-features
1•superchink•33m ago•0 comments

Show HN: BenchWrk – Get Logs in VSCode

https://marketplace.visualstudio.com/items?itemName=Benchwrk.benchwrk
1•aliatwa•35m ago•0 comments

Show HN: I want to create a TradingView alternative, can you try it?

https://www.aulico.com
2•feedbackcolle•35m ago•0 comments