frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

Show HN: I replicated GRPO and made it one-click runnable on HPC-AI.com

https://hpc-ai.com
20•cheerGPU•3h ago
Hi HN,

I’m excited to share RUNRL JOB, our new one-click service for running Reinforcement Learning Fine-Tuning (RFT) workloads—think GRPO, PPO, or any custom reward-based tuning—directly on HPC-AI.com.

What It Is Pre-wired RFT pipeline: dual-network configs, memory optimizations, logging, and reward modules are all set up for you.

Model support: demos with Qwen-3B and Qwen-1.5 out of the box; drop in your own model if you like.

Cost & performance transparency: real-hardware benchmarks on 8× H100/H200, with live metrics in TensorBoard and built-in cost tracking.

Why It Matters Memory-efficient GRPO: up to 40% memory savings vs PPO—no separate value network or double backward pass.

Zero setup: no Dockerfiles, no dependency hell—just click “Start” and your training job spins up.

Accessible RLHF: lowers the barrier for researchers, students, and indie hackers to experiment at scale.

How to Try Visit the blog post: https://hpc-ai.com/blog/RUNRL_JOB_is_live_on_hpc-ai

Click “Launch GPU Instances”, choose H100 or H200.

Select the RUNRL JOB template and hit “Start Job”.

Monitor progress live in JupyterLab or via TensorBoard—zero extra setup.

Comments

icemount•3h ago
Is it free?
cheerGPU•3h ago
Sign up today to claim $6 credit!
adamfly•3h ago
You are the GOAT GPU Cloud
cheerGPU•3h ago
If you're training big models or running GRPO at scale, we’re here to make it fast, affordable, and hassle-free. Let me know if you ever need a trial code or want to spin something up — HPC-AI.COM's got you covered!
thisisaacc•3h ago
It's interesting, most clouds can only provide SFT, not the latest RFT.
cheerGPU•3h ago
Would love any feedback if you give it a try!

Show HN: I made a HARO (Help A Reporter Out) matcher using vector similarity

https://haro.today/
1•erol444•1m ago•0 comments

Can Playwright MCP generate reliable tests? [video]

https://www.youtube.com/watch?v=MIlcVo1x3Is
2•tnolet•2m ago•0 comments

Management Fundamentals for the Modern Leader

https://maven.com/kellyvaughn/engineering-management
2•mooreds•4m ago•0 comments

Shop cleared of discrimination over €68 payment in coins

https://www.rte.ie/news/business/2025/0620/1519549-shop-cleared-of-discrimination-over-68-payment-in-coins/
1•austinallegro•4m ago•0 comments

Historical Tech Tree

https://www.historicaltechtree.com/
1•Luc•4m ago•0 comments

Gov. Greg Abbott vetoes THC ban

https://www.texastribune.org/2025/06/22/texas-thc-ban-bill-greg-abbott-veto-senate-bill-3/
4•DocFeind•11m ago•0 comments

3min Quick Survey on Your Thoughts on Virtual Pet

https://wss.pollfish.com/link/6fdbe13c-264f-4a70-9126-622b5f861ad3
1•Klwy•11m ago•0 comments

Microsoft Build 2025 – agents, models, GitHub, and beast mode Windows

https://redmonk.com/jgovernor/2025/06/20/microsoft-build-2025-agents-models-github-and-beast-mode-windows/
1•mooreds•11m ago•0 comments

IPFire – The Open Source Firewall – Adds Support for WireGuard

https://www.ipfire.org/blog/ipfire-2-29-core-update-195-released-wireguard-inside
2•mstremer•12m ago•0 comments

The first CNN for playing Scrabble – a game of imperfect information

https://www.cesardelsolar.com/posts/2025-06-21-nn-scrabble/
1•cdelsolar•12m ago•0 comments

Vultr Raises over $300M in Debt

https://www.cnbc.com/2025/06/23/vultr-raises-300-million-in-debt-from-bank-of-america-citi-goldman.html
1•mfiguiere•14m ago•0 comments

Xunit.v3, Testcontainers, and .NET

https://azan-n.com/projects/2025-01-25t112215212z/
1•azan-n•14m ago•1 comments

Cannabis use disorder may increase risk for certain psychiatric illnesses

https://medicalxpress.com/news/2025-06-cannabis-disorder-psychiatric-illnesses.html
2•PaulHoule•14m ago•0 comments

A CNN from scratch in C++/Vulkan (no ML/math libs) – A detailed guide

https://deadbeef.io/cnn_from_scratch
1•rjinman•14m ago•1 comments

A CX Leaders Guide to Embracing AI

https://cba-gbl.com/cx-leaders-guide-to-embracing-ai/
1•athousandsteps•14m ago•0 comments

Astrid: Personal Shopping Agent for Fashion

https://www.astridstyle.com
1•kylerush•17m ago•0 comments

Balancing Security and Fair Competition

https://open-web-advocacy.org/blog/balancing-security-and-fair-competition/
1•pentagrama•18m ago•0 comments

AI at the Edge: How Red Hat Is Powering Smarter Factories

https://gazeon.site/ai-at-the-edge-how-red-hat-is-powering-smarter-factories/
1•eligrid•18m ago•0 comments

Claude Code Best Practices

https://tylerburnam.medium.com/how-i-use-claude-code-c73e5bfcc309
2•tylerburnam•21m ago•0 comments

Run.sh – Task organisation for dev projects, based on a pure shell script

https://run.jotaen.net/
1•HunOL•22m ago•0 comments

Billiard Fractals from floor(k·√2) mod 2 – visualizing symbolic sequences

https://github.com/xcontcom/billiard-fractals
1•xcontcom•23m ago•1 comments

Ask HN: What is the scrappiest thing you've heard of that drove startup success?

1•randerson001•23m ago•0 comments

Modeling the World in 280 Characters

https://www.xordev.com/
1•lovegrenoble•24m ago•0 comments

What do professional so ware developers need to know to succeed in an age of AI?

https://arxiv.org/abs/2506.00202
1•azhenley•24m ago•0 comments

iPadOS 26 Local Capture Feature Solves iPad's Podcasting Problem

https://www.macrumors.com/2025/06/23/ipados-26s-local-capture-feature-tested/
1•tosh•25m ago•0 comments

Balikbayan Box

https://en.wikipedia.org/wiki/Balikbayan_box
1•speckx•25m ago•0 comments

Model Context Protocol, Without the Hype

https://petabridge.com/blog/mcp-without-the-hype/
2•Aaronontheweb•26m ago•0 comments

The Ways Long Contexts Fail

https://www.dbreunig.com/2025/06/22/how-contexts-fail-and-how-to-fix-them.html
2•dbreunig•28m ago•0 comments

An Extensible Iteration Facility

https://reindeereffect.com/0003
1•kmstout•28m ago•0 comments

Ask HN: Who's Building Agents?

1•ddl•30m ago•0 comments