frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

External Visitation

https://blog.jsbarretto.com/post/better-visitors
1•ibobev•1m ago•0 comments

Ask HN: Meta/IG deactivated my trademarked handle

1•throwawaycities•1m ago•0 comments

Efficiently Reconstructing Dynamic Scenes One D4RT at a Time

https://d4rt-paper.github.io/
1•fidotron•1m ago•0 comments

Is Helium the Browser Brave Was Meant to Be?

https://itsfoss.com/helium-browser/
1•speckx•2m ago•0 comments

DuckDuckGo Icons Easter Egg

1•njsubedi•2m ago•0 comments

Skin-roasted peanut consumption improves brain vascular function and memory

https://www.clinicalnutritionjournal.com/article/S0261-5614(25)00292-4/fulltext
1•PaulHoule•4m ago•0 comments

Jolla Phone: Meet the independent European do-it-together Linux phone

https://jolla.com/
2•nateb2022•4m ago•0 comments

U.S.-Sanctioned Firms Find Opening in Apple and Google App Stores

https://www.techtransparencyproject.org/articles/u.s.-sanctioned-firms-find-opening-in-apple-and-...
1•freedomben•4m ago•0 comments

Israel Used Palantir Technologies in Pager Terrorist Attack in Lebanon

https://the307.substack.com/p/revealed-israel-used-palantir-technologies
29•cramsession•6m ago•0 comments

How Russia's Largest Private University Is Linked to a $25M Essay Mill

https://krebsonsecurity.com/2025/12/drones-to-diplomas-how-russias-largest-private-university-is-...
1•doener•6m ago•0 comments

Show HN: Melhorix AI – AI-powered image upscaler without watermarks

https://melhorarimagem.net
1•cby821555203•7m ago•0 comments

Human Monogamy in Mammalian Context

https://royalsocietypublishing.org/rspb/article/292/2060/20252163/363965/Human-monogamy-in-mammal...
2•cs702•8m ago•0 comments

Show HN: Proliferation of LLM-Generated Text

https://ssojet.com/jwt-validation/validate-jwt-using-es256-in-dashjl/
1•ekjhgkejhgk•10m ago•1 comments

Reaching 10M App Store users

https://lapcatsoftware.com/articles/2025/12/1.html
2•robenkleene•12m ago•0 comments

Gaming Magazine Covers from Afar

https://chludens.hypotheses.org/4520
1•doener•13m ago•0 comments

Writing an Outlook Add-in in Rust

https://tritium.legal/blog/outlook
2•piker•14m ago•0 comments

Why AGI Will Not Happen

https://timdettmers.com/2025/12/10/why-agi-will-not-happen/
4•dpraburaj•14m ago•1 comments

Show HN: Poker Puzzles – Train poker skills with scenario-based challenges

https://holdempuzzles.com
1•dougSF70•16m ago•1 comments

I built a unified Git activity engine to clean the mess between GitHub,Bitbucket

1•slmslm•17m ago•1 comments

Show HN: I built a self-hosted Linear roadmap to avoid paying for guest seats

https://feedvote.app
1•dragssine•17m ago•1 comments

Shopify Editions: Winter '26

https://www.shopify.com/editions/winter2026
2•tompritchard•18m ago•0 comments

Show HN: Compact standing desk designed for small London flats

https://www.urbanergo.co.uk/products/quiet-compact-standing-desk-white-frame
1•cdublew•19m ago•0 comments

Show HN: Managing Multi-Environment AWS Parameter Store with YAML

https://blog.kinto-technologies.com/posts/2025-12-10-aws-parameter-store-automation-en/
1•apollo789•19m ago•1 comments

The Ghost of ChatGPT 4o

https://firasd.substack.com/p/the-ghost-of-chatgpt-4o
1•firasd•19m ago•0 comments

One gadget could give China a back door into the U.S. power grid

https://www.washingtonpost.com/business/2025/12/10/us-energy-grid-china-threat-blackouts/
2•thelastgallon•20m ago•0 comments

Show HN: GitStory – A cinematic "Spotify Wrapped" for GitHub profiles

https://gitstory.pankajk.tech
1•pankajkumardev•20m ago•0 comments

Microsoft to invest $17.5B in India by 2029 as AI race accelerates

https://techcrunch.com/2025/12/09/microsoft-to-invest-17-5b-in-india-by-2029-as-ai-race-accelerates/
1•gmays•20m ago•0 comments

The Dead Weight Loss of Entertainment

https://moultano.wordpress.com/2025/12/09/the-dead-weight-loss-of-entertainment/
1•moultano•20m ago•0 comments

Ask HN: How do small businesses handle phone calls?

2•duckkg5•20m ago•2 comments

Show HN: Finbley (Finance Tracker) Now Has Multi-User Shared Accounts

https://www.finbley.com
1•mo_hackernews•22m ago•0 comments