frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

He lived on the street. Now turns raw dryland into frugal homestead [video]

https://www.youtube.com/watch?v=xa63fqDHmTQ
1•fallinditch•1m ago•0 comments

How I used Cursor to Migrate Frameworks

https://kentcdodds.com/blog/how-i-used-cursor-to-migrate-frameworks
1•abraham•2m ago•0 comments

How are engineering teams handling AI compliance?

1•partycat•2m ago•0 comments

A Program Binary Becomes a Running Process

https://buildsoftwaresystems.com/post/program-binary-to-running-process/
1•ThierryBuilds•3m ago•1 comments

Show HN: Cosmic horror newspaper engine for an Amsterdam art collective

https://unsafejournal.com/
1•webhouwer•5m ago•0 comments

DVC: Data Version Control

https://dvc.org/
1•Olshansky•8m ago•0 comments

Show HN: AluminatiAi – Per-job GPU energy cost tracking (open source)

1•AluminatiAi•9m ago•0 comments

Did the president need Congress to attack Iran?

https://docs.google.com/document/d/1KONKQuQkkU3lbWalkkvAdpJFk6xEPulCKmDPHHz-dLw/edit?usp=sharing
1•mrjoshuacraigt•12m ago•1 comments

Show HN: AI agent that works autonomously while I'm offline

https://hire-your-ai-guide.vercel.app
1•ZeroDayCyber•13m ago•0 comments

Show HN: Minecraft music player, with random intervals like in the game

https://paul-andre.github.io/minecraft-music-simulator/
1•bogdanoff_2•15m ago•0 comments

Show HN: Laid – Chrome extension that detects AI-generated LinkedIn posts

https://github.com/oldeucryptoboi/linkedin-ai-detector
1•oldeucryptoguy•17m ago•1 comments

Show HN: Tiny Toasts – Collect video toasts using a QR code (no app required)

https://tinytoasts.co
1•jtowers2010•20m ago•0 comments

Show HN: Cc-reaper – Three-layer cleanup for orphan Claude Code processes

https://github.com/theQuert/cc-reaper
1•thequert•24m ago•0 comments

Cab-Rank Rule

https://en.wikipedia.org/wiki/Cab-rank_rule
1•valzevul•24m ago•0 comments

The Whispering Earring

https://gwern.net/doc/fiction/science-fiction/2012-10-03-yvain-thewhisperingearring.html
3•iamsam123•25m ago•1 comments

Software Engineering in the Agentic Era

https://sidv.dev/blog/software-engineering-agentic-era/
1•fangpenlin•27m ago•0 comments

Show HN: Ralphex – autonomous GPT Codex agent loop for ChatGPT Pro users

https://github.com/SmolNero/ralphex
1•edgar_ortega•29m ago•0 comments

Xuv: X11 user daemon to automatically run commands triggered by user specified

https://codeberg.org/NRK/xuv
2•todsacerdoti•30m ago•0 comments

SaaS-pocalypse chatter is doomster pr0n

https://www.theregister.com/2026/03/01/saaspocalypse_opinion/
1•Bender•32m ago•1 comments

Lenovo shows off snap-together laptop with removable keyboard, screen, and ports

https://www.theregister.com/2026/03/01/lenovo_shows_off_modular_laptop/
1•Bender•33m ago•0 comments

South Korea's tax office apologizes for leaking seed phrase to seized crypto

https://www.theregister.com/2026/03/02/south_korea_tax_office_cryptocurrency_leak/
1•Bender•33m ago•0 comments

Ultrafast neural sampling with spiking nanolasers

https://www.nature.com/articles/s41467-025-66818-1
1•PaulHoule•38m ago•0 comments

Show HN: Port Forwarding Wrapper for Mosh

https://github.com/liyu1981/moshpf
1•liyu1981au•39m ago•0 comments

GPL as the Best Licence – Governance and Philosophy

https://blog.hansenpartnership.com/gpl-as-the-best-licence-governance-and-philosophy/
2•pabs3•41m ago•0 comments

The Mystery of Asjo.org

https://acid.vegas/blog/the-mystery-of-asjo-org/
1•reubn•41m ago•0 comments

If Trump attacks Iran, western media will be cheering him on

https://www.middleeasteye.net/opinion/if-trump-attacks-iran-western-media-will-be-cheering-him
2•lyu07282•45m ago•0 comments

Say Goodbye to the Undersea Cable That Made the Global Internet Possible

https://www.wired.com/story/say-goodbye-to-the-undersea-cable-that-made-the-global-internet-possi...
2•CHB0403085482•46m ago•1 comments

We Made the Isospectral Drums and It Went Fine

https://prismika.github.io/2026/03/01/we-made-the-isospectral-drums.html
2•nill0•47m ago•0 comments

Floor113.com – A Scarcity-Driven Dating System Built on Deterministic Access

https://floor113.com/
1•chainbuilder•51m ago•1 comments

DeepSeek to release long-awaited AI model in new challenge to US rivals

https://www.ft.com/content/e3366881-0622-40a7-9c34-a0d82e3d573e
3•freely0085•51m ago•1 comments