frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

A Rejection on the Eve of Launch (2024)

https://jonofyi.substack.com/p/a-rejection-on-the-eve-of-launch
1•xk3•3m ago•0 comments

Tell HN: Stripe ToS demands biometrics, freezes payments until given

2•cuz-reasons•6m ago•0 comments

Bumblebees can solve problems like chimps and elephants

https://www.npr.org/2026/06/07/nx-s1-5846947/bumblebees-problem-solving-research
2•marojejian•6m ago•1 comments

My automated doubt development process

https://www.alexself.dev/blog/automated-doubt
1•aself101•9m ago•1 comments

Seattle unemployed worker stretches $690 per week

https://www.seattletimes.com/business/seattle-unemployed-worker-stretches-690-per-week-affording-...
1•petethomas•10m ago•0 comments

Where Do F1 Drivers Live? The Monaco Effect

https://www.kymillman.com/blog/where-do-f1-drivers-live-the-monaco-effect/
1•thunderbong•11m ago•0 comments

Making Peace with Your Unlived Dreams

https://nik.art/making-peace-with-your-unlived-dreams/
1•herbertl•11m ago•0 comments

Uni president told graduates to 'end themselves'

https://xcancel.com/TaiwanSpecial/status/2063099174019874882
2•bsgada•12m ago•0 comments

Memory safety is a matter of life and death

https://joshlf.com/posts/memory-safety-life-and-death/
1•birdculture•12m ago•0 comments

The complete IPv4 address space, mapped

https://worldip.io/
2•theanonymousone•13m ago•0 comments

A newly discovered organelle could help reduce cow methane emissions

https://phys.org/news/2026-05-newly-organelle-cow-methane-emissions.html
1•PaulHoule•15m ago•0 comments

Show HN: Axiomax – Cryptographic proof of AI inference carbon footprint

https://axiomaxllc.com
2•axiomaxllc•17m ago•0 comments

Not by AI

https://notbyai.fyi/
1•lopespm•18m ago•0 comments

The plan to give Americans an equity stake in AI

https://www.ft.com/content/8559a3f9-86de-4a1c-8a75-6623e83e6a00
1•marojejian•20m ago•2 comments

Self-Hosted JA4 to combat AI bots

https://blog.miloslavhomer.cz/deploying-ja4/
1•ArcHound•21m ago•0 comments

RTO Stalled: Weekly office visits remain down 30%

https://www.a16z.news/p/charts-of-the-week-rto-stalled
1•simonpure•21m ago•0 comments

Donald Trump, Bernie Sanders and Sam Altman are talking public ownership in AI

https://apnews.com/article/sam-altman-ai-bernie-sanders-trump-public-ownership-772224f9cd138eb79d...
2•breve•23m ago•0 comments

Feedback on my vision? DNS for AI

https://olw.gtll.app/plan
2•gabrielsmartin•26m ago•1 comments

Rebuilding a Web Text Editor

https://blog.readymag.com/rebuilding-web-text-editor/
2•imedvedev•28m ago•0 comments

Manufacturing and design aspects of BYD powertrain commented during disassembly [video]

https://www.youtube.com/watch?v=4LfDuyqmsts
1•2DcAf•29m ago•0 comments

Boomurl.com

https://boomurl.com
3•dorongrinstein•30m ago•1 comments

Building a Gifford-McMahon Cryocooler with 3D-Printed Parts [video]

https://www.youtube.com/watch?v=Jj7Q7OqaW4A
1•skibz•32m ago•0 comments

Desalinated ocean water gets one step closer to helping Arizona with drought

https://www.kjzz.org/politics/2026-06-04/desalinated-ocean-water-gets-one-step-closer-to-helping-...
1•bilsbie•33m ago•0 comments

Researchers Uncover Espionage in Mobile Networks

https://citizenlab.ca/researchers-uncover-espionage-in-mobile-networks/
3•jruohonen•35m ago•0 comments

Two notes on notation (Knuth, 1992)

https://arxiv.org/abs/math/9205211
1•tosh•36m ago•0 comments

SpaceX Perpetual Futures on Hyperliquid

https://hyperdash.com/asset/spcx-hyperliquid
2•davedx•36m ago•1 comments

CodePal: Snap Built an AI Code Reviewer for the Age of AI-Written Code

https://eng.snap.com/codepal
1•Kaedon•37m ago•0 comments

I scraped 743 large employers' careers pages to find their ATS

https://github.com/Kayvan-Zahiri/state-of-ats-2026
1•kzahiri•42m ago•1 comments

GraphRAG – a knowledge graph LLMs can traverse and write back to

https://github.com/mmkumar5401/GraphRag
2•mmkumar•43m ago•0 comments

A "Computer Science-Fiction" novel, Blue Screen, about the AI end of the world

https://www.amazon.com/Blue-Screen-Peter-Gustafson-Defragmented/dp/B084QL16YT
2•WWIII_Historian•45m ago•0 comments