frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

So you don't want to be a manager

https://another.rodeo/straight-talk-ic-path/
1•mooreds•29s ago•0 comments

Show HN: I ditched frameworks to build a <512kb personal site

https://jotalea.com.ar/read.php?entry=KBojOqSWHyZPMYCf
1•Jotalea•1m ago•0 comments

What invariants would you enforce for broker import pipelines?

2•julien_devv•2m ago•0 comments

A college student's perspective on using AI in class

https://www.npr.org/2026/03/06/nx-s1-5732793/college-student-perspective-using-ai-in-class
2•cainxinth•2m ago•0 comments

Disassemble Z80 instructions by changing the font

https://blog.adafruit.com/2026/03/09/disassemble-z80-instructions-by-changing-the-font/
2•SergeAx•2m ago•0 comments

React library to add mentions in textarea

https://github.com/getnao/prompt-mentions
2•blef•3m ago•0 comments

How Do You Choose a Managed Security Service Provider (MSSP)?

https://msspproviders.io/
2•datacorp•4m ago•1 comments

XAI: Explainable Artificial Intelligence

https://www.darpa.mil/research/programs/explainable-artificial-intelligence
2•andsoitis•4m ago•0 comments

Show HN: ColumnLens – Query millions of rows in milliseconds on your Mac

2•hilti•6m ago•0 comments

The Found Report: A Look Inside America's Lost Luggage

https://www.unclaimedbaggage.com/pages/2026-found-report
2•SeanLang•7m ago•0 comments

Load-Bearing Walls

https://brianschrader.com/archive/load-bearing-walls/
2•sonicrocketman•8m ago•0 comments

Maybe All You Need Is the Friends You Made Along the Way

https://oooooooo.my/ayn.html
3•hpwfranks•8m ago•1 comments

Formalizing Data Structures and Algorithms with Agents

https://risemsr.github.io/blog/2026-03-06-autoclrs/
2•matt_d•8m ago•0 comments

Civic.games – Browser playable governance simulations

https://abeer1904.github.io/ai-race/
2•jimsojim•8m ago•0 comments

Ask HN: What Happened to xAI?

3•zof3•9m ago•0 comments

An OpenClaw skill for think-tank style analysis of crises like the Iran war

https://github.com/vassiliylakhonin/global-think-tank-analyst
2•vassilbek•10m ago•1 comments

Longest snake ever measured is over 23.5 feet long

https://www.popsci.com/environment/longest-snake-in-the-world/
2•bookofjoe•10m ago•0 comments

Six Selfish Reasons to Have Kids

https://kevinkelly.substack.com/p/six-selfish-reasons-to-have-kids
2•surprisetalk•13m ago•0 comments

The MIT Flea Swapfest 2026 Dates

https://web.mit.edu/w1mx/www/swapfest/flyer-2026.txt
2•oldnetguy•13m ago•0 comments

Ask HN: Optimizing Claude Code Workflow: Subscription or API Billing?

3•robeym•13m ago•0 comments

N times faster than C, Arm edition (2023)

https://blog.xoria.org/n-times-faster/
2•surprisetalk•13m ago•0 comments

Trapped in MS Office

https://ia.net/topics/trapped-in-ms-office
2•surprisetalk•13m ago•0 comments

Why haven't π±e, 2^e, π^e, or π^(√2) been shown to be irrational?

https://old.reddit.com/r/math/comments/d6xvk/why_havent_%CF%80e_2e_%CF%80e_or_%CF%802_been_shown_...
2•surprisetalk•13m ago•0 comments

We Tried Sitting on the Floor for 5 Years and Here's What Happened [video]

https://www.youtube.com/watch?v=L_9oU88UH_I
2•airstrike•14m ago•0 comments

Show HN: Gui.new – The Visual Layer for AI

https://gui.new
4•mynameisntbob•15m ago•1 comments

Ask HN: What is your current Agentic and/or Vibe coding setup?

2•sschueller•15m ago•0 comments

Black vs. White Gloss Chest: Pick a Tone to Match Your Style

https://dreamhomestore.co.uk/collections/chest-of-drawers
2•kevinwalton•16m ago•1 comments

Click-to-Release Strategy to Control Bond Cleavage and Prodrug Activation

https://onlinelibrary.wiley.com/doi/10.1002/anie.202515594
2•PaulHoule•17m ago•0 comments

Forge – OpenClaw for Enterprise

https://github.com/initializ/forge
2•ibreakthecloud•17m ago•0 comments

Sutton and Barto, Ch. 11: Off-Policy Methods with Approximation (Personal Notes)

https://chizkidd.github.io//2026/03/09/rl-sutton-barto-notes-ch011/
1•ibobev•19m ago•0 comments