frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

ssh -p 48958 play@royale.boxd.sh

https://royale.boxd.sh/
1•chadfowler•1m ago•1 comments

Show HN: Check how AI agents see your website (free, 8-point scan)

https://www.platinum.ai/
1•apasila•2m ago•0 comments

A macOS menu-bar Pomodoro timer with built-in task manager

https://github.com/younghoandrewchaa/pomodoro
1•andrewchaa•4m ago•0 comments

Show HN: SEO Audit and Backlink Monitoring. All-in-One Platform

https://selinkpro.com/
1•avldokuchaev•4m ago•0 comments

We built hash-chained workflow histories to make agent execution tamper-evident

https://v1-18.docs.dapr.io/developing-applications/building-blocks/workflow/workflow-history-sign...
1•yaronsc•5m ago•1 comments

Chat Health – see when your ChatGPT conversation is losing context

https://chromewebstore.google.com/detail/chat-health/apeclnloiofkpebnfbphjccbbhldmcll
1•Loknertim•6m ago•0 comments

Mvsep – AI-driven music and voice separation

https://mvsep.com/en
1•listenfaster•7m ago•0 comments

The Electric Vampire (1910)

https://cyberneticzoo.com/bionics/1910-electric-vampire-f-h-power-british/
1•joebig•8m ago•0 comments

Full Page Paralysis

https://blog.jim-nielsen.com/2026/full-page-paralysis/
1•speckx•9m ago•0 comments

US says ASML's top chip tool may be in China. ASML says it isn't

https://techcrunch.com/2026/06/19/the-us-says-asmls-top-chip-tool-may-be-in-china-asml-says-it-isnt/
1•tcp_handshaker•9m ago•0 comments

WorldMonitor: Real-time global intelligence dashboard

https://github.com/koala73/worldmonitor
4•vantareed•10m ago•0 comments

A Crisis in Measurement Is a Crisis in Management

https://mapbrief.com/2026/06/18/a-crisis-in-measurement-is-a-crisis-in-management/
1•mooreds•10m ago•0 comments

Learning Is a Skill

https://blog.micr.dev/blog/learning-is-a-skill
1•theblazehen•10m ago•0 comments

GLM-5.2 – How to Run Locally

https://unsloth.ai/docs/models/glm-5.2
1•tosh•11m ago•0 comments

Sovereign AI is not a model, but a supply chain problem

https://www.bullbear.ninja/board/12
2•gmays•12m ago•0 comments

How to Land a Frontier Lab Job

https://vladfeinberg.com/2026/05/10/how-to-land-a-job-at-a-frontier-lab.html
2•abhaynayar•13m ago•0 comments

Better slopes in AABB collision systems

https://andreyor.st/posts/2026-06-17-better-slopes-in-aabb-collision-systems/
1•ibobev•14m ago•0 comments

Morry Kolman on making things: "Keep it stupid"

https://digitalseams.com/blog/morry-kolman-on-making-things
1•bobbiechen•14m ago•0 comments

GLM 5.2 playing text adventures

https://entropicthoughts.com/glm-5-2-playing-text-adventures
1•ibobev•14m ago•0 comments

Introduction to Compilers

https://www.cs.cornell.edu/courses/cs4120/2026sp/?schedule
1•ibobev•15m ago•0 comments

The Future of the Con Is Here, It's Just Not Evenly Distributed

https://manishearth.github.io/blog/2026/06/17/the-future-of-the-con-is-already-here/
2•birdculture•17m ago•0 comments

The Affordability Discourse

https://thedispatch.com/article/affordability-crisis-healthcare-housing-childcare/
1•jeffreyrogers•18m ago•0 comments

Comparison Is a Con

https://www.joanwestenberg.com/p/comparison-is-a-con
2•spking•18m ago•0 comments

Musician correctly predicts rise of local LLMs

https://www.youtube.com/watch?v=aXy8mQeuObk
3•wg0•20m ago•0 comments

Hashing at 130 GB/s with XXH3, Rust and AVX-512 on AMD Zen 5

https://kerkour.com/xxh3-zen5
3•cold_pizz4•20m ago•0 comments

Fearless Concurrency Gets Real

https://clef-lang.com/blog/fearless-concurrency-gets-real/
2•Aaronontheweb•22m ago•0 comments

Good Software Takes Ten Years. Get Used to It

https://www.joelonsoftware.com/2001/07/21/good-software-takes-ten-years-get-used-to-it/
1•kalcode•23m ago•2 comments

Show HN: Hacker News Comments Drill Down Chrome Extension

https://github.com/bachmitre/hacker-news-comments
1•bachmitre•23m ago•0 comments

My router said sonnet. The invoice said fable

https://ax.necmttn.com
1•necmttn•24m ago•0 comments

My 1992 view of the problems of computer programming in 1992

https://blog.plover.com/prog/fortran-i.html
1•speckx•30m ago•0 comments