frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Proposed New Test of AI Capabilities:)

1•VikRubenfeld•40s ago•0 comments

Seemingly Conscious AI Risks

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6588659&trk=comments_comments-list_comment-text
1•andy99•1m ago•0 comments

Dockframe, modular USB-C hub based on framework adapter cards

http://dockframe.com
1•heatmiser•4m ago•1 comments

Jay Shetty: "I Read 10 Books That Changed My Life"

https://www.youtube.com/watch?v=HfNYp5k2wf8
1•Brysonbw•4m ago•0 comments

The Quantization Robustness of Diffusion Language Models in Coding Benchmarks

https://arxiv.org/abs/2604.20079
1•matt_d•4m ago•0 comments

Memory-harness: Linux Rust CLI for low-overhead peak-RSS and memory profiling

https://github.com/mjgil-rust/memory-harness
1•mjgil•5m ago•0 comments

GoDaddy Gave a Domain to a Stranger Without Any Documentation

https://anchor.host/godaddy-gave-a-domain-to-a-stranger-without-any-documentation/
1•jamesponddotco•5m ago•0 comments

Paramount Is Down (UK)

https://downdetector.co.uk/status/paramountplus/
1•librasteve•5m ago•1 comments

Awesome Codex Automations

https://github.com/onurkanbakirci/awesome-codex-automations
1•onurkanbkrc•8m ago•0 comments

10x Is a Lot

https://www.quarter--mile.com/10x-Is-a-Lot
1•gkolli•8m ago•0 comments

Ben Horowitz on What Makes a Great Founder

https://www.youtube.com/watch?v=dFT4xj57D7U
1•Brysonbw•13m ago•0 comments

I scanned 1M domains and found the web's AI instruction layer

https://dialtoneapp.com/2026/april/i-scanned-1M-domains
2•fcpguru•14m ago•0 comments

Quick tutorial to get a blog online from Org Mode thanks to Org Social

https://en.andros.dev/blog/c68f00c3/quick-tutorial-to-get-a-blog-online-from-org-mode-thanks-to-o...
1•ibobev•17m ago•0 comments

Toolchain Horizons: Exploring Rust Dependency-Toolchain Compatibility

https://tigerbeetle.com/blog/2026-04-24-toolchain-horizons/
1•ibobev•17m ago•0 comments

The predictable failure of the QDay Prize

https://algassert.com/post/2601
1•firefly284•18m ago•0 comments

Staying a Spell with the Exidy Sorcerer

https://bumbershootsoft.wordpress.com/2026/04/25/staying-a-spell-with-the-exidy-sorcerer/
1•ibobev•18m ago•0 comments

A weekend with LoRA on Gemma 4 E2B: instrumenting what fine-tuning changes

https://aiexplr.com/post/fine-tuning-5b-code-assistant-three-lessons
1•mailharishin•18m ago•0 comments

New robotic control software avoids jamming their joints

https://arstechnica.com/science/2026/04/kinematic-intelligence-helps-robots-learn-their-limits/
2•Brajeshwar•20m ago•0 comments

The West forgot how to make things, now it's forgetting how to code

https://conduit.arewefriends.org/s/the-west-forgot-how-to-make-things-now-its-forgetting-how-to-8...
2•01-_-•21m ago•0 comments

The Visible Zorker: Zork 1

https://eblong.com/infocom/visi/zork1/
3•PLenz•21m ago•0 comments

One last trip to the internet in 2009 with The Rough Guide 14

https://www.planetjones.net/blog/19-04-2026/one-last-trip-to-the-internet-in-2009-with-the-rough-...
1•planetjones•21m ago•0 comments

I worked just as hard, failed just as hard–then saw it was rigged

https://comuniq.xyz/post?t=996
2•01-_-•23m ago•0 comments

pvlib: Open-source Python library for solar power modeling

https://github.com/pvlib/pvlib-python
1•ep_jhu•25m ago•0 comments

Invincat – terminal AI coding agent with tiered, auditable long-term memory

https://github.com/dog-qiuqiu/invincat
1•qiuqiu123•26m ago•0 comments

MCP Server and CLI for Accessing Work IQ

https://github.com/microsoft/work-iq
2•saikatsg•26m ago•0 comments

IMaySellIt – A marketplace where every listing is offer-only

https://imaysellit.com/
2•imaysellit•30m ago•1 comments

Mali's Tuareg rebels announce deal for Russian Africa Corps withdrawal

https://www.france24.com/en/africa/20260426-new-fighting-erupts-in-north-mali-s-kidal-as-army-cla...
2•mooreds•30m ago•0 comments

Show HN: Jigs-tiny Rust framework for interactive maps of composable pipelines

https://github.com/ValeriaVG/jigs
2•valeriavg_dev•30m ago•0 comments

AI Reverses the Political Logic of the Internet

https://www.techpolicy.press/how-ai-reverses-the-political-logic-of-the-internet/
3•mooreds•30m ago•0 comments

Arctic Temperatures

https://zacklabe.com/arctic-temperatures/
2•mooreds•31m ago•0 comments