frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

What Is Code?

https://martinfowler.com/articles/what-is-code.html
1•Garbage•50s ago•0 comments

Channelized topography amplifies melt-sensitivity of cold Antarctic ice shelves

https://www.nature.com/articles/s41467-026-71828-8
1•bryanrasmussen•56s ago•0 comments

Fsnotify Maintainer Dispute Sparks Supply Chain Concerns

https://socket.dev/blog/fsnotify-maintainer-dispute-sparks-supply-chain-concerns
1•elashri•1m ago•0 comments

Effects of Algorithmic Flagging on Fairness: Evidence from Wikipedia

https://mako.cc/copyrighteous/effects-of-algorithmic-flagging-on-fairness-quasi-experimental-evid...
1•surprisetalk•1m ago•0 comments

Good QC for RL Data

https://www.seancai.com/philosophy/good_qc_rl_data
1•gmays•1m ago•0 comments

Show HN: We Built OpenClaw but Worse

https://www.supafax.com/
1•rohanmahen•1m ago•0 comments

Molecular Views of Mineral Carbonation: Reaction of CO2 with Wollastonite

https://pubs.acs.org/doi/10.1021/acsnano.5c19629
1•PaulHoule•2m ago•0 comments

Rusternetes: Kubernetes, Reimplemented in Rust

https://github.com/calfonso/rusternetes
1•nateb2022•2m ago•0 comments

Big Tech Keeps Piling on AI Debt. Spending Is Set to Soar

https://www.barrons.com/articles/ai-debt-big-tech-bonds-da5291dc
1•1vuio0pswjnm7•2m ago•0 comments

Party on Spotify (your data since sign up)

https://partyoftheyears.withspotify.com/
1•mdrzn•3m ago•0 comments

Show HN: Self-hosted Spanner-like database that speaks Redis

https://github.com/swytchdb/swytch
1•withinboredom•3m ago•0 comments

Seeking fingerstick beta testers: Glucera Local-first iPhone glucose app

https://glucera.app/
1•kv0•3m ago•0 comments

Files.md – open-source alternative to Obsidian

https://github.com/zakirullin/files.md
2•zakirullin•5m ago•0 comments

Meta Sued by Santa Clara County for Allegedly Enabling Billions of 'Scam' Ads

https://www.law.com/therecorder/2026/05/11/meta-sued-by-santa-clara-county-for-allegedly-enabling...
1•1vuio0pswjnm7•5m ago•0 comments

Cruise-ship hantavirus cluster exposes a wider preparedness gap

https://www.nature.com/articles/d41586-026-01518-4
1•Brajeshwar•5m ago•0 comments

Show HN: JavaScript port of SQLite's parser, 2x-200x faster than others

https://github.com/justjake/sqlite3-parser-js
1•jitl•5m ago•0 comments

Interaction models by Thinking Machines Lab [video]

https://www.youtube.com/watch?v=A12AVongNN4
1•merqurio•8m ago•0 comments

Microsoft, your climate plan ran into a problem and needs to restart

https://microsoftlies.com
1•fermier•8m ago•0 comments

"Can you paint this Apple orange?"

https://blog.backslasher.net/orange-apple.html
2•Backslasher•8m ago•0 comments

Implicit Knowledge Is a Liability

1•gruyaume•9m ago•0 comments

MinIO adds petabyte-scale MemKV cache for Nvidia GPU inference

https://www.blocksandfiles.com/ai-ml/2026/05/12/minio-adds-petabyte-scale-memkv-cache-for-nvidia-...
1•bjflanne•9m ago•0 comments

The anti-minimalist backlash is the bigger story behind KDE oxygen revival

https://filipfila.wordpress.com/2026/05/10/the-anti-minimalist-backlash-is-the-bigger-story-behin...
1•rickcarlino•10m ago•0 comments

Ask HN: How are you collecting video testimonials without annoying ur customers?

1•touseefbuilds•11m ago•0 comments

What happens when everything becomes content?

https://velvetnoise.substack.com/p/what-happens-when-everything-becomes
1•jger15•13m ago•0 comments

Cacheman: A Comprehensive Last-Level Cache Management System

https://dl.acm.org/doi/10.1145/3774934.3786415
1•blakepelton•14m ago•1 comments

Show HN: Rate and Review Recruiters

https://www.recruiter.info/
1•recruiterinfo•14m ago•0 comments

Own your GH stars and HN upvotes

https://github.com/mipmip/startaste
1•mipselaer•15m ago•1 comments

Official Launch: DC-ROMA RISC-V Mainboard III for Framework Laptop 13

https://store.deepcomputing.io/products/dc-roma-risc-v-mainboard-iii-for-framework-laptop-13
1•YesterdayOK94•15m ago•0 comments

A 3.5 MB C++ engine for deterministic RAG deduplication hitting 30 GB/s

https://github.com/corbenicai/merlin-community
1•Corbenic•15m ago•0 comments

Anthropic is proof that SaaS isn't dead

https://www.octavehq.com/post/claude-code-wont-replace-saas
1•cmogni1•15m ago•0 comments