frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Overconfidence associated with anti-consensus views on scientific issues

https://www.science.org/doi/10.1126/sciadv.abo0038
1•mhb•2m ago•0 comments

ArchtSoft – AI generates software architecture from requirements

1•SougataAS•3m ago•0 comments

Over-Regulation Is Doubling the Cost

https://rein.pk/over-regulation-is-doubling-the-cost
1•gdeglin•3m ago•0 comments

Disallow code usage with a custom `clippy.toml`

https://www.schneems.com/2025/11/19/find-accidental-code-usage-with-a-custom-clippytoml/
1•schneems•3m ago•0 comments

Nomor Call Center Agoda Indonesia. Hubungi 0815-4054-505

1•carmycars•3m ago•0 comments

Call Center Agoda Bandung 0815-4054-505

1•carmycars•5m ago•0 comments

Thunderbird Pro November 2025 Update

https://blog.thunderbird.net/2025/11/thunderbird-pro-november-2025-update/
2•ImJamal•5m ago•0 comments

Layanan Agoda Call Center Indonesia

1•carmycars•6m ago•0 comments

Commercially available mouthguards: First time unearthing trace elements (2024)

https://www.sciencedirect.com/science/article/abs/pii/S0048969724029371
1•robtherobber•6m ago•0 comments

Show HN: Lite³ – A JSON-Compatible Zero-Copy Serialization Format in 9.3 kB of C

https://github.com/fastserial/lite3
1•eliasdejong•6m ago•0 comments

Firefox 147 Will Support the XDG Base Directory Specification

https://www.phoronix.com/news/Firefox-147-XDG-Base-Directory
1•bradrn•6m ago•0 comments

Unlocking Hidden Operational Value in Utility Security Tech

https://www.convergint.com/unlocking-hidden-operational-value-in-utility-security-tech/
1•mooreds•6m ago•0 comments

Cara Reschedule Atau Refund Agoda

1•Basiaflux•8m ago•0 comments

Far-UVC: Continuous, safe, quiet protection against airborne diseases

https://www.faruvc.org/
1•mhb•9m ago•0 comments

Gemini 3 image model is live

https://llmgateway.io/models/gemini-3-pro-image-preview
1•steebchen•10m ago•0 comments

Call Center Agoda 24 Jam 08154054505

1•Basiaflux•13m ago•0 comments

Autism and Vaccines

https://www.cdc.gov/vaccine-safety/about/autism.html
3•thesuperbigfrog•14m ago•2 comments

Debiasing Reward Models by Representation Learning with Guarantees

https://arxiv.org/abs/2510.23751
1•PaulHoule•14m ago•0 comments

Stablecoins and tribal chiefs: Monetary authority after the GENIUS Act

https://duckbucks.com/a/stablecoins-genius-money
1•coloneltcb•16m ago•0 comments

Euipo Study: Major Brand Ads on Pirate Sites Surged 567%

https://torrentfreak.com/euipo-study-major-brand-ads-on-pirate-sites-surged-567/
1•gslin•16m ago•0 comments

How to write a great agents.md: Lessons from over 2,500 repositories

https://github.blog/ai-and-ml/github-copilot/how-to-write-a-great-agents-md-lessons-from-over-250...
5•achow•17m ago•0 comments

Request For Comments: A secure contact import scheme for social networks

https://docs.bsky.app/blog/contact-import-rfc
1•janpio•17m ago•1 comments

Xenofeminism a Politics for Alienation

https://laboriacuboniks.net/manifesto/xenofeminism-a-politics-for-alienation/
1•dweyn•17m ago•0 comments

Upgrading Fans with a Custom Shroud on a RTX3090 – Goodbye Fan Noise

https://boilingsteam.com/upgrading-fans-with-a-custom-shroud-on-a-rtx3090-goodbye-fan-noise/
1•ekianjo•19m ago•0 comments

AI-calls-Editor: IDE-native refactoring for AI coding assistants

https://blog.strnisa.com/p/ai-calls-editor
1•strnisa•19m ago•0 comments

Austin 4 Released

https://github.com/P403n1x87/austin
1•p403n1x87•20m ago•1 comments

User Identity Isn't Complete Without Authorization

https://fusionauth.io/blog/fusionauth-acquires-permify
2•mooreds•21m ago•0 comments

Signal Polls

https://signal.org/blog/polls/
1•fasz•21m ago•0 comments

A 17-Year-Old Nearly Built a Nuclear Reactor in His Backyard

https://scienceclock.com/how-a-17-year-old-nearly-built-a-nuclear-reactor-in-his-backyard/
3•ashishgupta2209•23m ago•0 comments

Boosting "hope molecules" through exercise

https://www.gq.com/story/how-to-boost-your-anti-aging-hope-molecules
1•DaveZale•25m ago•1 comments