frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Data Fundamentals Primer for Learning LLM

https://algo-rhythm.dev/en/data/
2•vlumm•48s ago•0 comments

Execs Are Deploying Digital Twins to Do Their Work

https://www.wsj.com/tech/ai/execs-are-deploying-digital-twins-to-do-their-work-9547b375
1•Brajeshwar•53s ago•0 comments

Data Centers Now Consume 6% of US Electricity–and the Backlash Has Begun

https://singularityhub.com/2026/05/22/data-centers-now-consume-6-of-electricity-in-the-us-and-the...
1•cdrnsf•1m ago•0 comments

Agent Is Still a Hardcoded Workflow. It Is Not a Digital Employee Yet

https://kimura.yumiwillems.com/p/your-agent-is-still-a-hardcoded-workflow
1•BehaviorGraph•2m ago•0 comments

When Steve Jobs Grew Up

https://www.thefp.com/p/steve-jobs-leadership-transformation
1•Michelangelo11•4m ago•0 comments

NegPy – Open-source (GPL-3) film negative converter

https://github.com/marcinz606/NegPy
1•marcinz606•6m ago•0 comments

Read-only developer endpoint scanner for on-disk package, extension

https://github.com/perplexityai/bumblebee
1•taubek•9m ago•0 comments

Scotland Yard can keep using live facial recognition on people in London- judges

https://www.theregister.com/security/2026/04/22/high-court-approves-met-polices-facial-recog-afte...
1•gnabgib•10m ago•0 comments

AI Translate All Formats

1•cadic2603•12m ago•0 comments

Cisco Foundry Security Spec: Open specification for agentic security evaluation

https://github.com/CiscoDevNet/foundry-security-spec
2•cpard•13m ago•0 comments

Why Japan has abandoned houses

https://thehustle.co/newsletters/13-05-2026
1•stephsmithio•15m ago•1 comments

Google vs. Perplexity Chrome Extension

https://github.com/sarons/dual-ai-chat
1•cybermango•16m ago•1 comments

Quantum Dynamics Breakthrough Overturns Claim of 'Quantum Supremacy'

https://www.simonsfoundation.org/2026/05/21/quantum-dynamics-breakthrough-overturns-claim-of-quan...
4•SiempreViernes•23m ago•0 comments

Free admission and discounted overnight stays with Parks Canada

https://parks.canada.ca/voyage-travel/conseils-tips/choisis-canada-choose/admission-camping
2•bookofjoe•25m ago•0 comments

Marimo: A Reactive Python Notebook

https://marimo.io
1•pmaddams•26m ago•0 comments

Why Most Senior Devs Plateau, and What to Do

https://stackandscale.substack.com/p/why-most-senior-developers-plateau
3•lucyb0207•29m ago•1 comments

Onfim

https://en.wikipedia.org/wiki/Onfim
3•Michelangelo11•31m ago•0 comments

You will not be a member of the permanent underclass

https://thingofthings.substack.com/p/you-will-not-be-a-member-of-the-permanent
1•paulpauper•34m ago•1 comments

Why reviewing AI-generated code is devilishly hard

https://www.spinellis.gr/blog/20260523/
2•DSpinellis•40m ago•0 comments

The Forgotten Art of the LAN Party (2023)

https://www.superjumpmagazine.com/the-forgotten-art-of-the-lan-party/
1•susam•42m ago•0 comments

Italian authorities shut down major streaming piracy network

https://www.engadget.com/2180075/italian-authorities-shut-down-major-streaming-piracy-network-cin...
3•01-_-•46m ago•0 comments

ANCI: The Agent Infrastructure for Scheduling

https://meetanci.com
2•rajl•47m ago•0 comments

What's in a Codebase?

https://www.moderndescartes.com/essays/codebase_spec/
2•brilee•47m ago•0 comments

Elon, stop trying to make Grok happen

https://www.theverge.com/ai-artificial-intelligence/936219/elon-stop-trying-to-make-grok-happen
4•01-_-•48m ago•2 comments

Verytis – shared error memory for AI coding agents (MCP)

https://www.verytis.com
1•TychiqueY•48m ago•0 comments

Show HN: A satirical idle game about running an AI startup

https://game.trae.academy/
4•haebom•48m ago•0 comments

Show HN: Running BitNet b1.58 inside DRAM by breaking DDR4 timing rules

1•pcdeni•49m ago•0 comments

A Mysterious Children's Search Engine Is Misleading Kids

https://www.city-journal.org/article/kiddle-search-engine-kids
3•bushwart•50m ago•0 comments

NeuralNote

https://github.com/DamRsn/NeuralNote
1•hyperific•52m ago•0 comments

Kanban board web app powered by the Redmine API

https://ricardoborges.github.io/RedKanban/
1•r2ob•52m ago•0 comments