frontpage.

Hi HN,

I’m excited to share RUNRL JOB, our new one-click service for running Reinforcement Learning Fine-Tuning (RFT) workloads—think GRPO, PPO, or any custom reward-based tuning—directly on HPC-AI.com.

What It Is Pre-wired RFT pipeline: dual-network configs, memory optimizations, logging, and reward modules are all set up for you.

Model support: demos with Qwen-3B and Qwen-1.5 out of the box; drop in your own model if you like.

Cost & performance transparency: real-hardware benchmarks on 8× H100/H200, with live metrics in TensorBoard and built-in cost tracking.

Why It Matters Memory-efficient GRPO: up to 40% memory savings vs PPO—no separate value network or double backward pass.

Zero setup: no Dockerfiles, no dependency hell—just click “Start” and your training job spins up.

Accessible RLHF: lowers the barrier for researchers, students, and indie hackers to experiment at scale.

How to Try Visit the blog post: https://hpc-ai.com/blog/RUNRL_JOB_is_live_on_hpc-ai

Click “Launch GPU Instances”, choose H100 or H200.

Select the RUNRL JOB template and hit “Start Job”.

Monitor progress live in JupyterLab or via TensorBoard—zero extra setup.

Show HN: I made a HARO (Help A Reporter Out) matcher using vector similarity

Can Playwright MCP generate reliable tests? [video]

Management Fundamentals for the Modern Leader

Shop cleared of discrimination over €68 payment in coins

Historical Tech Tree

Gov. Greg Abbott vetoes THC ban

3min Quick Survey on Your Thoughts on Virtual Pet

Microsoft Build 2025 – agents, models, GitHub, and beast mode Windows

IPFire – The Open Source Firewall – Adds Support for WireGuard

The first CNN for playing Scrabble – a game of imperfect information

Vultr Raises over $300M in Debt

Xunit.v3, Testcontainers, and .NET

Cannabis use disorder may increase risk for certain psychiatric illnesses

A CNN from scratch in C++/Vulkan (no ML/math libs) – A detailed guide

A CX Leaders Guide to Embracing AI

Astrid: Personal Shopping Agent for Fashion

Balancing Security and Fair Competition

AI at the Edge: How Red Hat Is Powering Smarter Factories

Claude Code Best Practices

Run.sh – Task organisation for dev projects, based on a pure shell script

Billiard Fractals from floor(k·√2) mod 2 – visualizing symbolic sequences

Ask HN: What is the scrappiest thing you've heard of that drove startup success?

Modeling the World in 280 Characters

What do professional so ware developers need to know to succeed in an age of AI?

iPadOS 26 Local Capture Feature Solves iPad's Podcasting Problem

Balikbayan Box

Model Context Protocol, Without the Hype

The Ways Long Contexts Fail

An Extensible Iteration Facility

Ask HN: Who's Building Agents?

Show HN: I replicated GRPO and made it one-click runnable on HPC-AI.com

Comments