frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

SiteBuilder: Edit Web Content with Natural Language

https://manuel.kiessling.net/2026/01/25/introducing-sitebuilder/
1•speckx•1m ago•0 comments

We've got Cloudflare at home (building my own Cloudflare)

https://dylans.link/blog/2026-01-26-weve-got-cloudflare-at-home
1•dyl000•2m ago•0 comments

Show HN: Monetize your Chrome extensions with "App Pass"

https://joinapppass.com/partner
1•hao1300•4m ago•0 comments

Consuming an unprocessed diet reduces energy intake

https://www.sciencedirect.com/science/article/pii/S0002916525007750
1•PaulHoule•4m ago•0 comments

Claude's Constitutional Structure

https://thezvi.substack.com/p/claudes-constitutional-structure
1•7777777phil•5m ago•0 comments

Technology in 1776

https://www.a16z.news/p/technology-in-1776
1•jbredeche•5m ago•0 comments

Gold price tops $5k an ounce for first time

https://www.theguardian.com/business/2026/jan/26/gold-prices-record-5000-ounce-trump-turmoil
1•teleforce•5m ago•0 comments

SQUR beats humans in Capture The Flag

https://squr.ai/blog/squr-beats-humans-ctf/
1•adamlundqvist•5m ago•1 comments

Science of Habit Building

https://invertedpassion.com/science-of-habit-building/
1•twapi•6m ago•0 comments

Crossbars 2

https://www.shadertoy.com/view/mdKXWh
1•keepamovin•6m ago•0 comments

Spack: A flexible package manager for HPC software

https://computing.llnl.gov/projects/spack-hpc-package-manager
1•teleforce•10m ago•0 comments

NASA is sending people to the moon in spacecraft some experts think is not safe

https://www.cnn.com/2026/01/23/science/artemis-2-orion-capsule-heat-shield
2•ck2•11m ago•1 comments

New Patches Aim to Lower Linux Memory Use for Swap, Slightly Improve Performance

https://www.phoronix.com/news/Linux-Better-Swap-Tencent
1•XzetaU8•11m ago•0 comments

Welcome to the American Winter

https://www.theatlantic.com/politics/2026/01/minneapolis-uprising/685755/
3•empath75•11m ago•1 comments

The Danger of a Single Capital Letter: How I Almost Ruined a Redmine Instance

https://blog.devbert.de/the-danger-of-a-single-capital-letter/
1•preezer•12m ago•0 comments

Brex and the Pros and Cons of Hubristic Fundraising

https://www.saastr.com/brex-and-the-pros-and-cons-of-hubristic-fundraising/
1•wslh•13m ago•0 comments

Qwen3-Max-Thinking

https://qwen.ai/blog?id=qwen3-max-thinking
7•vinhnx•14m ago•0 comments

Building Brains on a Computer

https://www.asimov.press/p/brains
1•mailyk•14m ago•0 comments

Is the US Supreme Court Biased Towards the Rich?

https://www.nominalnews.com/p/is-the-us-supreme-court-bias-wealthy
4•MasPL•15m ago•1 comments

Notes on German Exit Tax from Paid Tax Advisor Calls

https://wegzugsteuer.info/en
1•olieidel•15m ago•0 comments

My vibe engineering process and stack

https://aimode.substack.com/p/my-vibe-engineering-process-and-stack
1•warthog•17m ago•0 comments

What We Can't Control (2016)

https://solomon.io/what-we-cant-control/
1•samsolomon•18m ago•0 comments

Competitive Pure Functional Languages

https://blog.samibadawi.com/2026/01/competitive-pure-functional-languages.html
3•type-lambda•18m ago•0 comments

Technology is changing how we write – and how we think about writing

https://www.nature.com/articles/d41586-026-00245-0
1•geox•20m ago•1 comments

Mods, when will you get on top of the constant AI slop posts?

https://old.reddit.com/r/programming
2•birdculture•21m ago•0 comments

Show HN: I built a tool for automated failure analysis in GitHub Actions

https://github.com/marketplace/actions/github-actions-failure-analysis
1•calebevans•23m ago•0 comments

Linear-Term: A TUI for Linear Project Management

https://github.com/tjburch/linear-term
1•tjburch•24m ago•0 comments

The Age of Impoliteness: Galateo: Or, a Treatise on Politeness (1774)

https://publicdomainreview.org/collection/galateo/
2•Anon84•24m ago•0 comments

Novel biosensor enables real-time tracking of iron (II) in living cells

https://pubs.acs.org/doi/10.1021/acssensors.5c02481
1•bookofjoe•25m ago•0 comments

Go tests probably don't need a mocking library

https://rednafi.com/go/mocking-libraries-bleh/
1•AlexeyBelov•25m ago•1 comments