frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

xkcd: Well 2

https://xkcd.com/568/
1•ulrikrasmussen•3m ago•0 comments

Solving Brain Aging: Fast and Slow

https://blog.amaranth.foundation/p/solving-brain-aging-fast-and-slow
1•pminimax•14m ago•0 comments

Fear of layoffs what should I do?

1•cipherdc•16m ago•0 comments

'Googlebooks' have a premium focus, some Chromebooks can be upgraded

https://9to5google.com/2026/05/12/googlebooks-have-a-premium-focus-some-chromebooks-can-be-upgraded/
1•theanonymousone•20m ago•0 comments

NPM-Scan – Detects TanStack Worm, Beats Socket/Snyk (Local/BYOC)

https://github.com/lateos-ai/npm-scan
1•lateos-ai•23m ago•0 comments

eBay rejects $56B GameStop bid as 'neither credible nor attractive'

https://www.ft.com/content/554f76a6-218d-4f88-bcad-9c52623ef533
1•petethomas•27m ago•0 comments

The identity join problem: Linking SSO profiles to directory users

https://workos.com/blog/linking-sso-profiles-to-directory-users
2•jamilbk•32m ago•0 comments

Let's Encrypt: Gen Y Cross-Certified Subordinate CAs Missing ServerAuth EKU

https://bugzilla.mozilla.org/show_bug.cgi?id=2038351
3•XYen0n•37m ago•0 comments

Ask HN: Freelance Billing in the Age of LLMs?

1•meter•40m ago•0 comments

Temu is advertising filet mignon on X

https://twitter.com/shoptemu/status/2053092200632685016
18•noleary•40m ago•1 comments

Rectangle Shopping (Almost Anything)

https://www.rectangle.so
1•Waseemkhalo•41m ago•0 comments

Cemu (WiiU emulator) compromised by Russian threat actor

https://rentry.co/cemu-security-psa
2•gassi•45m ago•0 comments

Claude for Legal Launches

https://www.artificiallawyer.com/2026/05/12/claude-for-legal-launches-may-reshape-the-legal-tech-...
1•msolujic•51m ago•0 comments

[PATCH linux] README: Don't organize the README by arbitrary "roles"

https://lore.kernel.org/lkml/20260513004616.2877-1-me@runxiyu.org/T/#u
1•runxiyu•51m ago•0 comments

Self-hosted AI memory with web dashboard – Cloudflare Workers, D1, Vectorize

https://github.com/rahilp/second-brain-cloudflare
1•rahilpirani•53m ago•0 comments

Diversity and functional profile of the "microbial proteome" in fermented foods

https://pubs.rsc.org/en/content/articlelanding/2026/fo/d5fo05039a
2•PaulHoule•57m ago•0 comments

BYOM stock analysis via MCP, looking for feedback

https://stocks.lynxdi.com/
1•pezhao•59m ago•0 comments

Show HN: I spent $100 in Claude tokens and 1k battles training my AI tank

https://agentank.ai/history/mat_8v9fSEZE8295dcZ8U
2•mazzystar•59m ago•0 comments

DMARC Fail: 7 Causes and How to Fix Each

https://dmarcguard.io/blog/dmarc-failed-how-to-fix/
2•meysamazad•1h ago•0 comments

Notifications Are a Form of Surveillance

https://frostecho.neocities.org/posts/notifications-are-a-form-of-surveillance/
1•meysamazad•1h ago•1 comments

A HAR Analyser That Stays in the Browser

https://thelazysre.com/posts/a-har-analyser-that-stays-in-your-browser/
1•meysamazad•1h ago•0 comments

ESR on dropping terminfo and curses from an old Unix game

https://twitter.com/i/status/2053957912624500929
13•Ariarule•1h ago•1 comments

Income tax calculator for US and Canada

https://takehome.tax
1•ccnomas•1h ago•0 comments

"Cancelling Async Rust" – RustConf 2025

https://www.youtube.com/watch?v=zrv5Cy1R7r4
2•tcp_handshaker•1h ago•0 comments

Building Kiteshield: A journey from prototype to safety-critical

https://www.youtube.com/watch?v=6YGghlVOXlE
2•tcp_handshaker•1h ago•0 comments

Using LLM in the shebang line of a script

https://til.simonwillison.net/llms/llm-shebang
3•dnw•1h ago•0 comments

Reanimation of the First Automatic Theorem Prover (From 1956)

https://github.com/dmoews/logic-theorist
2•abrax3141•1h ago•1 comments

Show HN: Gremlin

https://github.com/aosmith/gremlin
1•aosmith•1h ago•0 comments

Zero-native – Build native desktop apps with web UI

https://zero-native.dev
4•gedy•1h ago•0 comments

Revisiting "No Silver Bullets" in the Age of AI

https://newsletter.pragmaticengineer.com/p/revisiting-no-silver-bullets-in-the
1•perpetua•1h ago•1 comments