frontpage.

We built an RL environment for credit card reward optimization and trained Qwen 32B with GRPO against it. The trained model scores ~0.51 on held-out tasks vs. Opus 4 at ~0.41 and GPT-4o at 0.36. Environment is open source (Apache 2.0). Blog post explains the reward design, what broke during training, how we fixed it, and what we'd do differently.

Show HN: Gyrus – Open-Source AI Agents for Snowflake, SQL and Postgres

App host Vercel says it was hacked and customer data stolen

Show HN: Tmux-bar – One-tap switching between windows in current tmux session

Show HN: Themeable HN

The Vibe Code 103,000 AI-generated repos, only 1% production ready

Tell HN: Codex/Claude Code one-off credit purchases are a money sink

What I Learned About Billionaires at Jeff Bezos's Private Retreat

Germany's Merz says industrial AI needs less stringent EU regulation

Are Strings Still Our Best Hope for a Theory of Everything?

Claude helped build a wetlab+sequence my DNA at home, with 0 lab experience

Effectful Recursion Schemes

Did Artemis II mission do lunar science or go to the Moon for humanity?

Physical Media Is Pretty Cool

Show HN: Mailto.Bot – Email API for AI agents with native MCP support

Engineers Kick-Started the Scientific Method

Can LLMs Flip Coins in Their Heads?

Itanium: Intel's Great Successor [video]

The Abstraction Fallacy: Why AI Can Simulate but Not Instantiate Consciousness

Students are speeding through their online degrees in weeks, alarming educators

Deezer says 44% of songs uploaded to its platform daily are AI-generated

Known modeling errors keep the federal expansion machine running

So What If They Have My Data?

Kimi K2.6: Advancing Open-Source Coding

Licensing Best Practices for the Sharing of Scientific Data

The printing press for biological data (Sterling Hooten)

MoA-X: Mixture of Agents Orchestration Framework

Top Gun 3 Is Happening: The Need for Speed Lives On

Anthropic tests user trust with ID and selfie checks for Claude

The "AI Vulnerability Storm": Building a "Mythos- Ready" Security Program [pdf]

I'm never buying another Kindle, and neither should you

Show HN: We trained a 32B model to beat Opus 4 at credit card optimization