frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: Benchmax, a new open-source RL environment framework for LLM finetuning

https://github.com/cgftinc/benchmax
1•kumama•13h ago
Hello HN!

I’ve been working on `benchmax`, a open-source framework for building, running, and parallelizing environments, to fine-tune LLMs with reinforcement learning.

What I wanted to solve for:

- Environments are tightly coupled with RL trainers, leading to fragmentation and limited compatibility.

- These coupled environments are tend to be mostly competitive math and coding → for OSS RL + LLMs to scale, we need more complex, real-world environments.

- Scaling these environments in parallel is still not easily possible

What I'm excited about:

- benchmax is training framework agnostic with adapters already built out for verl and verifiers. we’re gonna build more adapters for other frameworks (e.g. SkyRL, etc.), instead of forcing others to adopt our standard (though ofc they’re welcome to )

- benchmax comes with a few interesting environments out of the box: spreadsheet processing, CRM, etc. → more coming soon!

- benchmax supports MCP as a first class citizen. there has been an explosion of MCP servers/tools built out for usecases ranging from browser use to excel to game creation.`benchmax` allow folks to leverage and compose these existing MCP servers to build environments integrated with real world systems

- Multi-node environment parallelization coming soon!

If you like what you see, feel free to *star* the *repo* to support the project!! Our hope’s to really let anyone benchmax on their tasks, with benchmax

https://github.com/cgftinc/benchmax

It’s still very early! And I expect to be shipping a lot more things → more environments, more trainer integrations. Would love y’all’s thoughts what environments and trainer integrations could be interesting!

How Do You Handle Branching for Database GitOps?

https://harness.io/blog/gitops-branching-for-database-devops
1•sonichigo•45s ago•1 comments

Analysis Shows Competitive LCOE Target for Small Modular Reactors

https://www.nucnet.org/news/analysis-shows-competitive-lcoe-target-for-small-modular-reactors-7-3-2025
1•mpweiher•5m ago•0 comments

Show HN: I built a free backlink exchange marketplace

https://launchigniter.com/link-exchange
1•maulikdhameliya•6m ago•0 comments

Bitchat Mesh

https://apps.apple.com/us/app/bitchat-mesh/id6748219622
2•doener•8m ago•0 comments

Apisix Integration with AI/ML API

https://apisix.apache.org/blog/2025/07/29/announcing-integration-of-apisix-and-ai-ml-api/
1•Yilialinn•9m ago•0 comments

Automatic A2A Service Discovery in Kubernetes with Inference Gateway

https://github.com/inference-gateway/inference-gateway/tree/main/examples/kubernetes/a2a
1•edenr•9m ago•1 comments

The Online Safety Act for forum and blog owners

https://successfulsoftware.net/2025/07/29/the-online-safety-act-for-forum-owners/
1•hermitcrab•10m ago•1 comments

Most Watched Software Engineering Talks Of 2025 (so far)

https://www.techtalksweekly.io/p/50-most-watched-software-engineering
3•hal918•10m ago•0 comments

Parity of Zero

https://en.wikipedia.org/wiki/Parity_of_zero
1•derdi•16m ago•2 comments

Hypercube 3d ultimate tic tac toe

https://dhkts1.github.io/ultimate-nd-tictactoe-3d/
1•dhkts1•18m ago•0 comments

Tell HN: NISAR Satellite to Launch Today

1•_448•18m ago•0 comments

New battery manufacturer with European software: GAZ Energy

https://www.ess-news.com/2025/07/28/new-battery-manufacturer-with-european-software-gaz-energy-builds-factory-in-czech-republic/
1•doener•19m ago•0 comments

Nostr Auth Provider · clerk · Discussion #6435

https://github.com/orgs/clerk/discussions/6435
2•kehiy•24m ago•0 comments

Show HN: Deno is amazing. I built a toy TUI text editor to make sure of that

https://github.com/eu-ge-ne/toy
1•eu-ge-ne•25m ago•0 comments

Happy 20th Birthday MDN

https://web.dev/blog/mdn-birthday
2•feross•27m ago•0 comments

Do LLMs Identify Fonts?

https://maxhalford.github.io/blog/llm-font-identification/
3•Lemaxoxo•27m ago•1 comments

The Torch of Terrorism (1994)

https://time.com/archive/6726261/the-torch-of-terrorism/
2•thomassmith65•29m ago•0 comments

Decoding the Chinese Computer

https://www.sixthtone.com/news/1017405
2•sohkamyung•29m ago•0 comments

YouTube to be included in Australia's teen social media ban

https://www.bbc.com/news/articles/cpv0zkxx0njo
2•nojs•29m ago•0 comments

The chaos and confusion of itch.io and Steam's abrupt adult game ban

https://www.theverge.com/games/715299/itchio-games-delisting-payment-processor-paypal
2•isaacfrond•32m ago•0 comments

Intra-procedural lifetime and borrowing analysis in Clang

https://discourse.llvm.org/t/rfc-intra-procedural-lifetime-analysis-in-clang/86291
2•fanf2•32m ago•0 comments

Dead Internet Theory becomes more real – Now anyone can start botting easily

https://twitter.com/ArtusVranken/status/1950476396033175721
2•reeeeee•34m ago•1 comments

Seriously, Why Do Some AI Chatbot Subscriptions Cost More Than $200?

https://www.wired.com/story/seriously-why-do-some-ai-chatbot-subscriptions-cost-more-than-200/
9•isaacfrond•37m ago•1 comments

Show HN: I built a local AI assistant as a browser extension (zero cloud)

https://github.com/NativeMindBrowser/NativeMindExtension
3•kaylakay•38m ago•0 comments

Sleep all comes down to the mitochondria

https://www.science.org/content/blog-post/it-all-comes-down-mitochondria
2•A_D_E_P_T•40m ago•1 comments

Nvidia CEO Jensen Huang Sells $27.6M in Stock over Five Days

https://techgraph.co/stock-market/nvidia-ceo-jensen-huang-sells-27-6-million-in-stock-over-five-days/
3•visitednews•48m ago•1 comments

Show HN: Bear.Share – Turn any webpage into beautiful sharing cards

https://chromewebstore.google.com/detail/bearshare-web-sharing-car/njgbcdlfpmkgbdkiagganmdgmkfegidh
1•BearBest•51m ago•0 comments

Oscar-Winning 'No Other Land' Awdah Hathaleen Killed by Israeli Settler

https://www.latimes.com/entertainment-arts/story/2025-07-29/awdah-hathaleen-killed-no-other-land-palestinian-activist-israeli-settler
36•_shadi•55m ago•6 comments

AWS Introduces Vector Capabilities on Amazon S3

https://www.infoq.com/news/2025/07/aws-s3-vectors/
2•NomDePlum•57m ago•0 comments

Show HN: RentUp – I built a rent manager for my parents

https://play.google.com/store/apps/details?id=ai.sach.rentup&hl=en_US
1•Sachinrao•58m ago•0 comments