frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Analog Activism: Kicking AI Out of New York

https://nowvoyagermag.com/reporting/analog-activism
1•giuliomagnifico•49s ago•0 comments

Show HN: Zanagrams

https://zanagrams.com/
1•pompomsheep•2m ago•0 comments

Hi I M a Hack

1•GaraBoxHACKER•3m ago•0 comments

California legislature agrees to upload driver's licenses to national database

https://papersplease.org/wp/2026/06/27/california-legislature-agrees-to-upload-drivers-licenses-t...
3•iamnothere•5m ago•0 comments

Show HN: ClassicTunes – a from-scratch remake of iTunes 7-10 for Apple Silicon

https://smaran-vallabhaneni.com/ClassicTunes/
1•Eonexus•8m ago•0 comments

Icon

https://en.wikipedia.org/wiki/Icon_(programming_language)
1•tosh•8m ago•0 comments

The Collaboration Layer for AI Intelligence

https://gitix.ai/
1•azolf•11m ago•0 comments

Are You Recommended by AI?

https://www.mentionedby.world/
1•aykhanstoic•12m ago•1 comments

Show HN: Import the HN Home to a reading queue with clean reader view and TL;DR

https://readplace.com/import?mode=from-url
2•fagnerbrack•16m ago•1 comments

Reimagining Systems Thinking as Cybersystemic Researching

https://stream.syscoi.com/2025/12/01/reimagining-systems-thinking-as-cybersystemic-researching-an...
2•andsoitis•16m ago•0 comments

Grok 4.5, based on our 1.5T V9 foundation model, with Cursor data added in su

https://twitter.com/elonmusk/status/2071184354756477041
5•cyrc•20m ago•2 comments

The shift from browsing to commanding: Autonomous web agents in action

https://www.fognitix.com/
2•fognitix•20m ago•2 comments

Five Months in Munich: Revisiting '91 Without Erasing Decades That Made It Scale

https://akmaier.substack.com/p/five-months-in-munich-revisiting
2•felixbraun•21m ago•0 comments

Was Ozempic discovered thanks to "silly" research?

https://www.oscillator.blog/p/was-ozempic-discovered-thanks-to
3•salonium_•21m ago•0 comments

The vibration of the pager has a sound all its own

https://www.notyouremergency.com/triage-intro
3•mooreds•21m ago•0 comments

32BJ Health Fund and Northwell Direct announce direct health care contract

https://www.northwell.edu/news/the-latest/northwell-direct-32bj-largest-direct-health-care-contra...
2•mooreds•21m ago•0 comments

How should founders choose the right tech stack for a startup website?

https://moonsofts.net/
2•MoonSofts•23m ago•0 comments

America's largest companies have no simple way to report security flaws

https://this.weekinsecurity.com/dozens-of-americas-largest-companies-have-no-simple-way-to-report...
3•mooreds•23m ago•0 comments

Installing SerenityOS on My Old ThinkPad T60

https://btxx.org/posts/serenity-t60/
5•jandeboevrie•24m ago•0 comments

Forensic tools as instruments of repression: Cellebrite use in Russia

https://andreafortuna.org/2026/06/28/cellebrite-russia-pivovarov/
2•iamnothere•24m ago•0 comments

"Quality is downstream from caring"

https://graybeard.ing/quality-is-downstream-from-caring/
3•rglover•24m ago•0 comments

Bjorn Lomborg – 'An Inconvenient Truth' 20 Years Later

https://signalscv.com/2026/06/bjorn-lomborg-an-inconvenient-truth-20-years-later/
2•RickJWagner•24m ago•1 comments

CATL online store for direct sales of energy storage to small/medium customers

https://carnewschina.com/2026/06/26/catl-launches-online-store-for-direct-sales-of-energy-storage...
2•DamonHD•25m ago•0 comments

Ask HN: Is Hacker News selling your email?

3•tyleo•28m ago•2 comments

Clarity, Accountability, and Care – The Three Conditions That Make Teams Work

https://nmcqueen.substack.com/p/clarity-accountability-and-care
2•backlit4034•30m ago•0 comments

Cachebox, a small cache server with TTLs, dogpile locks, tags and bounded memory

https://github.com/smarzola/cachebox
3•smarzola•33m ago•0 comments

California's landmark anti-plastics law sparks anger as 17 states move to sue

https://www.theguardian.com/environment/2026/jun/26/california-single-use-plastic-law
4•andsoitis•33m ago•1 comments

Show HN: A REPL for browsers that agents love

https://fuckui.com
2•keepamovin•35m ago•1 comments

The dordolec, the 'evil eye' and superstition in Albania

https://michaelharrison.org.uk/2013/05/the-dordolec-the-evil-eye-and-superstition-in-albania/
2•jruohonen•35m ago•0 comments

The Fake Pilot (2010)

https://www.news.com.au/travel/travel-updates/fake-pilot-thomas-salme-says-passengers-were-never-...
3•redbell•36m ago•0 comments