frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•7mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Recaf – A Java Bytecode Editor

https://recaf.coley.software/home.html
1•0x54MUR41•2m ago•0 comments

Show HN: URL Shortener Jo4.io

https://jo4.io/a/hackernews
1•anandchakru•10m ago•0 comments

Show HN: NpgsqlRest Automatic PostgreSQL Web Server

https://npgsqlrest.github.io/
1•vbilopav•12m ago•0 comments

Data Structures in Practice

https://github.com/djiangtw/data-structures-in-practice-public
1•atomlib•18m ago•0 comments

Minish – A Property-Based Testing Framework for Zig

https://github.com/CogitatorTech/minish
1•TheWiggles•18m ago•0 comments

Why Your AI Agents Are Hallucinating (and How to Stop It)

https://noveum.ai/en/blog/why-your-ai-agents-are-hallucinating-and-how-to-stop-it
1•imshashank•22m ago•1 comments

Fucking Approachable Swift Concurrency

https://fuckingapproachableswiftconcurrency.com/en/
2•todsacerdoti•26m ago•0 comments

The 9th Circuit Upholds Professor's Right to Mock 'Land Acknowledgments'

https://reason.com/2025/12/22/the-9th-circuit-upholds-a-university-of-washington-professors-right...
3•osnium123•31m ago•0 comments

CEO blasts companies with billions in funding but zero revenue

https://fortune.com/2025/12/24/databricks-ceo-ali-ghodsi-bubble-insane-zero-revenue-ai-circular/
2•kermatt•34m ago•1 comments

Tyler Cowen, the man who wants to know everything

https://www.economist.com/1843/2025/02/28/tyler-cowen-the-man-who-wants-to-know-everything
2•nanfinitum•36m ago•1 comments

How to copy the current Christmas CSS of HN?

https://news.ycombinator.com/
1•brightmood•37m ago•1 comments

More App Store Ad Spots

https://mjtsai.com/blog/2025/12/24/more-app-store-ad-spots/
2•ksec•40m ago•1 comments

Yarbo's Pop-Up Signals the Future of Smart Snow Tech

1•darius88•43m ago•1 comments

Mercury – Multimodal Drone

https://mercuriustech.com/mercury/
1•thunderbong•48m ago•0 comments

Show HN: Rhettilator – An exact-fraction calculator in base 360

https://the-rhettilator-9352543e.base44.app/
1•AllSeenEye•50m ago•1 comments

WhatIFF?, a modern Amiga Guide magazine for creative Amiga users

https://www.whatiff.info/
1•nickt•53m ago•0 comments

Command Line Interface Guidelines

https://clig.dev/
1•vinhnx•54m ago•1 comments

Show HN: Got tired of searching for AI news daily so I built my own AI news page

https://dreyx.com/
1•lilsquid•54m ago•1 comments

Creating General User Models from Computer Use

https://arxiv.org/abs/2505.10831
1•handfuloflight•55m ago•0 comments

Show HN: Web playground for Qwen-Image-Edit-2511

https://z-image.app/ja/models/qwen-image-edit-2511
1•yeekal•56m ago•0 comments

The Frontend Auth Middleware: Cross-Origin Iframes Without Third-Party Cookies

https://seg6.space/posts/the-frontend-auth-middleware/
2•seg6•57m ago•0 comments

Why I'm Treating Health as Infrastructure

https://healthasinfrastructure.substack.com/p/why-im-treating-health-as-infrastructure
1•zekrom•58m ago•0 comments

Show HN: Claude Code in Cursor

https://github.com/mergd/ccproxy
1•wyxuan•1h ago•0 comments

Is the Dictionary Done For?

https://www.newyorker.com/magazine/2025/12/29/unabridged-the-thrill-of-and-threat-to-the-modern-d...
2•mitchbob•1h ago•1 comments

Tinyfront

http://tinyfront.mooo.com/
2•pabs3•1h ago•0 comments

Show HN: Secret MCP: Let AI write your .env files without seeing your secrets

https://github.com/AKarenin/Secret-mcp
2•akarenin•1h ago•0 comments

Husqvarna 350 iB Leaf Blower Running VESC with 2070 Wh Battery [video]

https://www.youtube.com/watch?v=Q8c5QOmafpw
1•ProllyInfamous•1h ago•2 comments

Words Matter: Alternatives for Charged Terminology in the Computing Profession

https://www.acm.org/diversity-inclusion/words-matter
1•linguae•1h ago•5 comments

What's New in Ruby 4.0

https://nithinbekal.com/posts/ruby-4-0/
2•nithinbekal•1h ago•0 comments

Python-Tiny-HTTP-Server

https://github.com/johann-petrak/python-tiny-http-server
1•kamaraju•1h ago•0 comments