frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Speed Can Reindustrialize America

https://www.austinvernon.site/blog/manufacturing.html
1•mfiguiere•17s ago•0 comments

Show HN: ContextLedger – CLI to track and handoff context b/w AI coding sessions

https://github.com/manthan787/context-ledger
1•EmTekker•41s ago•0 comments

Possible identification of the Luna 9 Moon landing site using machine learning

https://www.nature.com/articles/s44453-025-00020-x
1•marcodiego•2m ago•0 comments

New and Upcoming IRCv3 Features

https://libera.chat/news/new-and-upcoming-features-3
1•iamnothere•2m ago•0 comments

Karma Engineering

https://aimlbling-about.ninerealmlabs.com/blog/karma-engineering/
1•namnnumbr•3m ago•1 comments

With Apple: Fortify your app: Essential strategies to strengthen security

https://developer.apple.com/events/view/TUHA23T82K/dashboard
2•pjmlp•6m ago•0 comments

AI analysis for UK Parliament bills

https://ukparliament.vercel.app/
1•ArisC•8m ago•0 comments

iPhotron 4.10 Is Released

https://github.com/OliverZhaohaibin/iPhotron-LocalPhotoAlbumManager/releases/tag/v4.1.0
1•main-protect•9m ago•0 comments

Court orders Acer and Asus to stop selling PCs in Germany over H.265 patents

https://videocardz.com/newz/acer-and-asus-are-now-banned-from-selling-pcs-and-laptops-in-germany-...
2•ledoge•10m ago•0 comments

The Prompt of Babel

https://joemclean.github.io/writing/the-prompt-of-babel.html
1•jjjjjjjjoe•11m ago•3 comments

How Can Something Fall Faster Than Gravity? [video]

https://www.youtube.com/watch?v=dosAbCCKXLs
1•zahlman•13m ago•0 comments

Top AI SDR tools analysis

https://revenuesystemslab.substack.com/p/ai-sdr-tools
1•Atbech•14m ago•0 comments

Pentagon threatens to cut off Anthropic in AI safeguards dispute

https://www.reuters.com/technology/pentagon-threatens-cut-off-anthropic-ai-safeguards-dispute-axi...
1•MKais•14m ago•0 comments

Baseband, Bessel and Beyond

https://www.youtube.com/watch?v=0GjWRQMFVA8
1•michh•15m ago•0 comments

Addicted to your phone? Try "bricking" it

https://economist.com/culture/2026/02/15/addicted-to-your-phone-try-bricking-it
1•andsoitis•16m ago•0 comments

Codeberg is why developers are broke

https://sharemygit.com/
2•onesandofgrain•23m ago•1 comments

Show HN: Claude-relais – A plan/build/judge loop mixing Claude with Cursor

https://github.com/clementrog/claude-relais
1•crog•24m ago•0 comments

Can agentic coding raise the quality bar?

https://lpalmieri.com/posts/agentic-coding-raises-quality/
2•LukeMathWalker•25m ago•1 comments

Learning Kubernetes with the official docs and NotebookLM

https://randomwrites.com/
1•mutahirs•25m ago•0 comments

List of Sports Clichés

https://en.wikipedia.org/wiki/List_of_sports_clich%C3%A9s
1•carlos-menezes•26m ago•0 comments

State Attorneys General Want to Tie Online Access to ID

https://reclaimthenet.org/40-attorneys-general-back-ids-online-safety-act
21•computerliker•27m ago•9 comments

Python Fiddle – Online Python IDE, Compiler, and Interpreter

https://python-fiddle.com
2•Curiositry•27m ago•0 comments

Large Language Model Reasoning Failures

https://arxiv.org/abs/2602.06176
1•kawera•28m ago•0 comments

When Your Ally Turns Narcissistic: Manual for Navigating Transatlantic Relations

https://gppi.net/2025/10/12/when-your-ally-turns-narcissistic
3•rendx•30m ago•1 comments

The Second Half of the Chessboard

https://joshs.bearblog.dev/the-second-half-of-the-chessboard/
2•psychedare•34m ago•1 comments

Peter Steinberger: I need AI that scans every PR and Issue and de-dupes

https://twitter.com/steipete/status/2023057089346580828
1•vibeprofessor•35m ago•0 comments

Show HN: VOOG – Moog-style polyphonic synthesizer in Python with tkinter GUI

https://github.com/gpasquero/voog
6•gpasquero•35m ago•1 comments

Accessibility Is All You Need – Why agent protocols for the web are redundant

https://github.com/webmachinelearning/webmcp/issues/91
2•lulzx•36m ago•1 comments

Miller – CLI tool for querying, shaping, and reformatting data in many formats

https://miller.readthedocs.io/en/6.16.0/
4•smartmic•37m ago•0 comments

What Happened in El Paso? – By James Fallows

https://fallows.substack.com/p/what-happened-in-el-paso
2•MaysonL•37m ago•0 comments