frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Do We Think Too Much About the Future?

https://www.newyorker.com/culture/open-questions/do-we-think-too-much-about-the-future
2•FinnLobsien•2m ago•0 comments

Predicate Pushdown in Query Planner

https://floedb.ai/blog/predicate-pushdown-a-planner-perspective
4•isomorphisme•4m ago•0 comments

Steering Zig Fmt

https://matklad.github.io/2026/05/08/steering-zig-fmt.html
1•surprisetalk•5m ago•0 comments

W Social uncovered: the reality behind the hype

https://blog.elenarossini.com/w-social-uncovered-the-reality-behind-the-hype/
1•nemoniac•5m ago•0 comments

Asami: A flexible graph store, written in Clojure

https://github.com/quoll/asami
1•tosh•18m ago•0 comments

Blockchain Expansion Slowing Down? Try Solana Solutions

https://www.securitytokenizer.io/create-your-own-token-and-coin
1•ishariya•19m ago•0 comments

'Dirty Frag' exploit leaks out, gives root on most Linux machines

https://www.tomshardware.com/tech-industry/cyber-security/dirty-frag-exploit-gets-root-on-most-li...
4•lschueller•21m ago•1 comments

Anomalies

https://github.com/cognitect-labs/anomalies
1•tosh•22m ago•0 comments

8-Ball Game in Browser

https://www.karaqu.com/guest/billiard
1•hbi99•22m ago•0 comments

Elevated errors across Claude Models (May 8, 09:49 UTC)

https://status.claude.com/incidents/378dqscjgghp
1•pramodbiligiri•23m ago•0 comments

GeoJSON

https://geojson.org/
5•tosh•25m ago•0 comments

go-libghostty: Go bindings for libghostty-vt

https://tangled.org/mitchellh.com/go-libghostty
2•icy•25m ago•0 comments

Unique index failure on Postgres – my bad

1•robshep•25m ago•0 comments

Show HN: A Local-First Agentic Knowledge Manager

https://github.com/egroup-labs/kept
13•Mapika•27m ago•0 comments

Stop Using Yarn Classic

https://charpeni.com/blog/stop-using-yarn-classic
1•thunderbong•27m ago•0 comments

As NASA eyes lunar base, there's still much to learn about landing on the Moon

https://arstechnica.com/space/2026/05/as-nasa-eyes-lunar-base-theres-still-much-learn-about-landi...
1•rbanffy•27m ago•0 comments

Show HN: The agent which teaches you while you build

https://contral.ai
2•samagragune•30m ago•0 comments

Happy birthday, David Attenborough Famed naturalist marks 100 years

https://www.scientificamerican.com/article/david-attenborough-celebrates-his-100th-birthday/
3•yreg•31m ago•0 comments

Show HN: Airplane AI – Local NDA Safe AI Powered by Gemma

https://airplane-ai.franzai.com/
1•franze•40m ago•0 comments

Shopping for Happiness

https://putanumonit.com/2016/05/11/shopping-for-happiness/
1•jimsojim•41m ago•0 comments

Build the Shared Memory First

https://avwrm-5iaaa-aaaal-qdhcq-cai.icp0.io/blog/260505-agentic-org-transition/
1•gann_•41m ago•0 comments

Show HN: I built a dead simple App Store screenshot maker

https://ezscreenshots.com
3•abrowniejr•58m ago•1 comments

Salary isn't everything: Why flexibility to work remotely is the future of work

https://thehill.com/opinion/finance/5859902-hybrid-work-performance-retention/
2•robtherobber•1h ago•0 comments

Tesla's 4680 battery cells are underperforming and frustrating buyers – Electrek

https://electrek.co/2026/05/07/tesla-4680-battery-cell-performance-data-shows-cant-build-own-cells/
2•xbmcuser•1h ago•0 comments

Introductory Lectures on Black Hole Thermodynamics [pdf]

https://www.physics.umd.edu/grt/taj/776b/lectures.pdf
3•gone35•1h ago•0 comments

Rpow2: A tribute to the original RPOW by Hal Finney

https://github.com/frkrueger/rpow
1•janandonly•1h ago•0 comments

GTM Engineer Roles at WorkMotion, Supabase, SymphonyAI

https://gtmjobs.beehiiv.com/p/9-gtm-engineer-roles-this-week-workmotion-supabase-symphonyai-more
2•benchmarkapp•1h ago•0 comments

Claude Flags Hantavirus Vaccine Questions as Security Risk

5•pell•1h ago•4 comments

Syrian Tourist Map

https://alnashra.org/map11/gis_syria2/syria_tourism.php
2•altilunium•1h ago•0 comments

Data Centers in Space

https://nb1t.sh/data-centers-in-space/
3•freakynit•1h ago•0 comments