frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Own a Graph

https://staysaasy.com/strategy/2025/11/25/own-a-graph.html
1•RyeCombinator•54s ago•0 comments

Indoor Dog Park Directory – Find Climate-Controlled Dog Play Spaces California

https://www.indoordogpark.org
1•mabalal•10m ago•1 comments

The Art of KPop Demon Hunters

https://theartofkpopdemonhunters.com/
1•lehi•11m ago•0 comments

MetaFun: Compile Haskell-like code to C++ template metaprograms

https://gergo.erdi.hu/projects/metafun/
1•todsacerdoti•12m ago•0 comments

Strategic Fabrication in AI Self-Governance: An Empirical Audit of 9 Major LLMs

https://zenodo.org/records/17754943
1•mikeup91•15m ago•1 comments

Ask HN: What is the purpose of all these AI spam comments?

6•GaryBluto•16m ago•1 comments

Google Images: Shirts Without Stripes

https://www.google.com/search?newwindow=1&fbs=&q=shirts%2Bwithout%2Bstripes&sa=X&biw=1152&bih=958...
1•gregsadetsky•16m ago•1 comments

Are Peptide Injections Safe?

https://www.washingtonpost.com/health/2025/11/26/peptides-bodybuilding-injections-side-effects/
1•bookofjoe•19m ago•1 comments

Software Issue Hits Planes

https://news.sky.com/story/airbus-latest-software-issue-hits-thousands-of-planes-13476780
3•scopeh•21m ago•1 comments

Building a Distributed Database in Elixir, Part 3: Storage Layer and Why RocksDB

https://medium.com/@gawry/storage-layer-why-rocksdb-part-3-814e1d24a1a6
3•gawry•25m ago•1 comments

Keeping the Streak Alive

https://quartr.com/insights/edge/keeping-the-streak-alive-the-story-of-duolingo
1•sujayk_33•26m ago•0 comments

Chicago Data Center Overheated–and Shut Down Trade in Key Markets

https://www.wsj.com/finance/cme-options-futures-trading-halted-amid-data-center-issue-16e96ed1
1•perihelions•26m ago•0 comments

Turris Om Nia NG

https://www.discomp.cz/turris-omnia-ng_d130526.html
1•senorqa•29m ago•0 comments

Who's Grading You on Coursera? The Shift from Human Peers to AI

https://www.classcentral.com/report/coursera-peer-assessment-still-broken/
3•raybb•35m ago•0 comments

One point I made that didn't come across: Ilya

https://twitter.com/ilyasut/status/1994424504370581726
2•sabareesh•35m ago•0 comments

In Denmark, 'Night's Watch' Guards Monitor Trump from the Foreign Ministry

https://jen.jiji.com/jc/eng_agt?g=adnkronos&k=20251128KRONOS-202511112509571700_eng
1•SanjayMehta•36m ago•0 comments

The Best Improvement I've made to my Cursor workflow

https://foundinglean.substack.com/p/the-best-improvement-ive-made-to
1•indigodaddy•36m ago•0 comments

CME Group Commodity Futures Trading Halted, Traders Say

https://www.bloomberg.com/news/articles/2025-11-28/cme-group-commodity-futures-trading-halted-tra...
3•petethomas•38m ago•0 comments

Social media algorithms can alter political views, browser extension study shows

https://www.euronews.com/next/2025/11/28/social-media-algorithms-can-alter-political-views-browse...
2•geox•39m ago•0 comments

Ask HN: Why don't closed captions boldface words that are likely to be misheard?

1•amichail•39m ago•1 comments

Trump says he will cancel all Biden executive orders signed by autopen

https://www.theguardian.com/us-news/live/2025/nov/28/trump-washington-dc-shooting-politics-updates
3•vinni2•47m ago•1 comments

LX: A CLI tool for LaTeX notes management

https://github.com/kamal-hamza/lx-cli
1•hkamal233•48m ago•0 comments

Solving Polynomials Is Hard

https://youtu.be/9HIy5dJE-zQ
2•bane•49m ago•0 comments

Flights disrupted as Airbus requests modifications to planes

https://www.bbc.co.uk/news/articles/c8e9d13x2z7o
6•martinald•51m ago•1 comments

The Day Anonymity Died: Inside the OpenReview / ICLR 2026 Leak

https://medium.com/@billxu_atoms/the-day-anonymity-died-inside-the-openreview-iclr-2026-leak-ee68...
2•luizcdc•1h ago•0 comments

FedEx joins list of billion-dollar companies laying off workers

https://www.dailymail.co.uk/yourmoney/article-15332637/fedex-layoffs-coppell-texas.html
4•Bender•1h ago•0 comments

Nerd-Sniped: Project Search

https://zed.dev/blog/nerd-sniped-project-search
1•andromedaM31•1h ago•0 comments

Search – The Moat of the Search Index

https://robonomics.substack.com/p/search-the-moat-of-the-search-index
1•gmays•1h ago•0 comments

What I've Been Reading

https://marginalrevolution.com/marginalrevolution/2025/11/what-ive-been-reading-280.html
1•paulpauper•1h ago•0 comments

'Worrying' virus resistant to body's defense system

https://www.dailymail.co.uk/health/article-15335177/bird-flu-virus-resistant-fever.html
3•Bender•1h ago•0 comments