frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

If society had a scorecard, what would be on it?

1•baptou12•58s ago•0 comments

Edible Plants Wiki

https://edibleplants.wikioasis.org/wiki/Main_Page
1•altilunium•8m ago•0 comments

Mnemory – Persistent memory for AI agents

https://github.com/fpytloun/mnemory
1•genunix64•12m ago•1 comments

Show HN: WebAssembly Interpreter in a Header

https://github.com/lifthrasiir/wah
1•lifthrasiir•12m ago•0 comments

Codeonix: Python task automation for your desktop – with AI, 14 triggers

https://codeonix.app
2•hassananayi•12m ago•1 comments

A month after being laid off, I wrote the story I needed to make sense of it

https://anushkakarmakar.substack.com/p/1-why-did-i-choose-to-run-that-marathon
1•thinkingkite•13m ago•0 comments

Why CFOs Need a Consensus Hardening Protocol for AI Decisions

https://cubiczan.substack.com/p/why-cfos-need-a-consensus-hardening
1•cubiczan•16m ago•0 comments

As a Ukrainian journalist, I've covered the US for 20 years. I find it shocking

https://www.theguardian.com/world/2026/apr/30/as-a-ukrainian-journalist-ive-covered-the-us-for-20...
1•YeGoblynQueenne•17m ago•0 comments

Graphene is on track to deliver on its promises (2019)

https://www.nature.com/articles/s41565-019-0557-0
1•simonebrunozzi•18m ago•0 comments

Livecodes – A Code Playground That Just Works

https://github.com/live-codes/livecodes
1•modinfo•29m ago•0 comments

Fujian Tulou

https://en.wikipedia.org/wiki/Fujian_tulou
1•simonebrunozzi•30m ago•0 comments

KMRI: A chunk-based compression format for MRI-style 3D volumes

https://github.com/Kiamehr5/KMRI
1•kiamehr•32m ago•0 comments

Ripple – the elegant TypeScript UI framework

https://www.ripple-ts.com/
2•modinfo•34m ago•0 comments

SMS blaster rising in Switzerland (French)

https://www.rts.ch/info/suisse/2026/article/sms-blaster-l-arnaque-aux-fausses-amendes-se-repand-d...
2•kuon•35m ago•0 comments

Man tar why we use -f

1•modinfo•38m ago•0 comments

Getting Gooier

https://contraptions.venkateshrao.com/p/getting-gooier
1•jger15•38m ago•0 comments

The Lore of Sam Altman Is Being Tested Like Never Before

https://www.wsj.com/tech/ai/the-lore-of-sam-altman-is-being-tested-like-never-before-968227ea
1•JumpCrisscross•41m ago•0 comments

Wikipedia in the Terminal

https://github.com/ImpulseDoes/wiki
2•thximpulse•45m ago•1 comments

The Bureaucratic Escalator and how it operates

https://profserious.substack.com/p/the-bureaucratic-escalator
2•idw•49m ago•1 comments

Does APL Need a Type System? (2018) [video]

https://www.youtube.com/watch?v=z8MVKianh54
1•tosh•53m ago•0 comments

Text Files as a User Interface

https://ratfactor.com/cards/text-files-as-ui
1•dev_hugepages•54m ago•0 comments

Matt Mullenweg thinks WordPress is in decline. He may be right

https://werd.io/matt-mullenweg-thinks-wordpress-is-in-decline-he-may-be-right/
2•vinhnx•55m ago•1 comments

Valuation Spaces and Relativisation: The Lambda Calculus Example

https://practal.com/blog/valuation-spaces-and-relativisation/
1•auggierose•56m ago•0 comments

ChatGPT Wrestles with Its Most Chilling Conversation: How Do I Plan an Attack?

https://www.wsj.com/us-news/chatgpt-mass-shooting-openai-78a436d1
3•vednig•57m ago•0 comments

Show HN: Speq – A collaborative web-based repository for your product's spec

https://getspeq.com
1•iowes•1h ago•0 comments

Made in China means made in Yiwu

https://mondediplo.com/2026/05/08yiwu
1•JumpCrisscross•1h ago•0 comments

Do LLMs Reason, or Do They Just Predict Math Text?

https://daridor.blog/2026/05/01/do-llms-reason-or-do-they-just-predict-math-text/
3•beagle3•1h ago•1 comments

Investors pile into clean energy as Iran war drives push for energy security

https://www.ft.com/content/9921f2b5-c910-4cec-a50f-cad453935a1a
4•JumpCrisscross•1h ago•0 comments

MCPages

https://github.com/NoahCzelusta/mcpages
1•swimninja247•1h ago•4 comments

Thoth – open-source Local-first AI Assistant

https://github.com/siddsachar/Thoth
2•sydsachar•1h ago•0 comments