frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

How Paris is harnessing the Seine to replace air-con

https://www.theguardian.com/environment/2026/jun/26/underground-revolution-seine-cooling-network-...
1•rocketbop•31s ago•0 comments

WebKit always enables the Copy menu item in every app

https://lapcatsoftware.com/articles/2026/6/5.html
1•Udo_Schmitz•2m ago•0 comments

Dot Net Developer

1•KiranMakkineni•2m ago•0 comments

Texas Man Gets 30 Years in Prison for Transporting 'Anti-Government' Pamphlets

https://reason.com/2026/06/25/texas-man-gets-30-years-in-prison-for-transporting-anti-government-...
1•mrtesthah•2m ago•0 comments

Wallace the 6 inch f/2.8 telescope, building it, and hiking with it

https://lucassifoni.info/blog/hiking-with-wallace/
1•chantepierre•2m ago•0 comments

Google's hand-gesture reCAPTCHA wants access to your camera

https://blog.mega.io/google-hand-gesture-recaptcha
1•dotcoma•5m ago•0 comments

You probably don't need a UUID

https://ssg.dev/you-probably-dont-need-a-uuid/
1•sedatk•5m ago•0 comments

Framingham won't renew Flock Safety contract after months of resident opposition

https://www.boston.com/news/local-news/2026/06/25/framingham-police-will-not-renew-flock-safety-c...
1•pilingual•5m ago•0 comments

Is CRO real? Can you point to one thing hurting my landing page?

https://zenvesto.com/
1•zenvesto•7m ago•0 comments

The Meadows of Medieval Summer

https://www.historytoday.com/archive/out-margins/meadows-medieval-summer
1•lermontov•10m ago•0 comments

Goodbye, Scientific American

https://www.lawyersgunsmoneyblog.com/2026/06/goodbye-scientific-american
2•throwaway81523•12m ago•0 comments

How not to be a tennis parent

https://www.bbc.co.uk/sport/tennis/articles/cx23r1r55npo
1•mmarian•15m ago•0 comments

SpaceX plans to build 'Starpipe' natural gas pipeline to fuel Starship rockets

https://www.reuters.com/business/energy/spacex-plans-build-starpipe-natural-gas-pipeline-fuel-sta...
1•JumpCrisscross•16m ago•0 comments

Terminal Agents in 2026: Goose, Claude Code, OpenCode, and Pi Compared

https://outofcontext.dev/blog/goose-claude-code-opencode-pi/
1•leianixcheese•19m ago•0 comments

Anthropic Alleges Largest-Ever Claude Distillation Attack by Alibaba

https://twitter.com/MTSlive/status/2070141140607832353
1•seviu•21m ago•2 comments

Mapping Networks: CVPR 2026 Best Paper Award Nominee

https://arxiv.org/abs/2602.19134
3•aurenvale•23m ago•0 comments

Samsung readies $648B bet, report says, as AI boom reshapes South Korea

https://www.reuters.com/world/asia-pacific/samsung-invest-1000-trillion-won-south-korea-media-rep...
1•JumpCrisscross•23m ago•0 comments

Top DOJ Official Tells Staff He Wants to Avoid Antitrust Trials

https://www.wsj.com/politics/policy/top-doj-official-tells-staff-he-wants-to-avoid-antitrust-tria...
1•JumpCrisscross•24m ago•0 comments

A curated, non-BS library of the best resources for evaluating agents

https://github.com/benchflow-ai/awesome-evals
1•xdotli•25m ago•0 comments

Gemini Spark

https://gemini.google/overview/agent/spark/
1•czeizel•26m ago•2 comments

A Structured Generation Framework for Transforming Scientific Papers into Patent

https://arxiv.org/abs/2601.02589
1•teleforce•26m ago•0 comments

Rage Against the Dying of Critical Thinking

https://mmaksimovic.dev/rage-against-the-dying-of-critical-thinking
1•Liriel•26m ago•0 comments

PatentScore: Multi-Dimensional Evaluation of LLM-Generated Patent Claims

https://aclanthology.org/2025.emnlp-main.1564/
1•teleforce•27m ago•0 comments

Notion killing Skiff-influenced email app since most users use AI agents instead

https://arstechnica.com/gadgets/2026/06/notion-killing-skiff-influenced-email-app-since-most-user...
1•joozio•29m ago•0 comments

Worldwide X (Twitter) Trends for last 24 hours

https://trends24.in/
4•aurenvale•34m ago•0 comments

List of all UK universities currently having redundancies/restructuring

https://qmucu.org/qmul-transformation/uk-he-shrinking/
1•theanonymousone•37m ago•0 comments

The food science behind designing an ice cream

https://altermag.com/articles/designing-a-summer-ice-cream-for-india
1•trojanalert•39m ago•0 comments

Position: Stop Anthropomorphizing Intermediate Tokens as Reasoning/Thinking Trac

https://arxiv.org/abs/2504.09762
1•xiaoyu2006•45m ago•0 comments

My BASB Implementation in Org Mode

https://ftwynn.com/posts/my-basb-implementation-in-org-mode-v2023-06/
1•ankitg12•46m ago•0 comments

The largest scorpion lived 415M years ago

https://www.sciencenews.org/article/largest-scorpion-lived-415-million-years-ago
4•celadonceladon•46m ago•0 comments