frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: A geocities inspired place for your vibed tools

https://www.tinytooltown.com/
2•shanselman•2m ago•0 comments

Google Cameyo – turn legacy Windows apps into PWAs

https://cameyo.google/
1•LorenDB•3m ago•0 comments

The Tree House: A voyage to the source of a backyard dream

https://www.laphamsquarterly.org/roundtable/tree-house
1•Caiero•5m ago•0 comments

Red Hot Chili Peppers sell music catalogue for $300M

https://guitar.com/news/industry-news/red-hot-chili-peppers-sell-music-catalogue/
2•randycupertino•8m ago•0 comments

The AI agent economy is going mainstream

https://datadome.co/agent-trust-management/why-anthropics-connector-expansion-makes-mcp-security-...
3•nrengan•10m ago•0 comments

Agentic AI is giving cyber criminals nation-state-like powers

https://www.defenseone.com/threats/2026/05/pentagon-leaders-love-agentic-ai-its-giving-cyber-crim...
1•jethronethro•10m ago•1 comments

SignalForge – A Local-First, Zero-Cloud Autonomous Agent with UI Verification

https://peerlist.io/aliberkcanli/project/signalforge
1•ABCanli•11m ago•0 comments

W – The European social network for verified humans

https://wsocial.news/
1•layer8•11m ago•0 comments

Library for fast mapping of Java records to native memory

https://github.com/mamba-studio/TypedMemory
2•joe_mwangi•12m ago•0 comments

I Ran the NSA This Is How to Defeat China's Hacker Army

https://www.nytimes.com/2026/05/11/opinion/international-world/i-ran-the-nsa-this-is-how-to-defea...
2•frb•15m ago•0 comments

Agentic AI vs. AI Agents: The Governance Shift

https://rootcx.com/blog/agentic-ai-vs-ai-agents
2•seyz•21m ago•0 comments

Ask HN: What makes a good intern in 2026?

3•ThePhillipLin•22m ago•1 comments

Show HN: Agentic productivity platform for high perfomers

https://www.mainthread.app/
2•bolshchikov•22m ago•0 comments

German data protectionists push for final end to Chat Control

https://www.heise.de/en/news/German-data-protectionists-push-for-final-end-to-chat-control-112830...
4•theanonymousone•24m ago•0 comments

Nvidia embraces AI investor, topping $40B in equity bets 2026

https://www.cnbc.com/2026/05/09/nvidia-embraces-ai-investor-topping-40-billion-in-equity-bets-202...
3•gmays•24m ago•2 comments

Will AI Produce the Next Great Divergence?

https://www.lawfaremedia.org/article/will-ai-produce-the-next-great-divergence
2•speckx•26m ago•0 comments

Mac App Store Review Times Increasing

https://mjtsai.com/blog/2026/03/02/mac-app-store-review-times-increasing/
4•jhack•26m ago•0 comments

FormulaBase: A Markdown editor with LaTeX support

https://formulario-five.vercel.app/
2•developer_ai_•27m ago•0 comments

A modern desktop music player for people tired of streaming apps

https://github.com/heartached/Noctis
3•heartached•27m ago•1 comments

Myst's Game Design Proposal document (1991)

https://archive.org/details/myst_proposal
2•gaws•31m ago•0 comments

Expat 2.8.1 released, CVE-2026-45186 and CVSS unreliability

https://blog.hartwork.org/posts/expat-2-8-1-released/
2•spyc•33m ago•0 comments

Golden Testing a CAD Library

https://doscienceto.it/blog/posts/2026-04-27-golden-testing-cad.html
2•PaulHoule•33m ago•0 comments

I built a simpler, more powerful "Dropbox" for devs and creators

https://dropscodes.vercel.app/
2•ghassan_gaidi•34m ago•0 comments

MobyDB – The Geospatial-Native Database

https://mobydb.com/
2•petethomas•34m ago•0 comments

DepthWork – iOS app that scores focus quality with a neuroscience-based index

https://depthwork.io/
2•CubusWaw•36m ago•0 comments

Sunburn inspired a new way to store energy

https://www.bbc.com/news/articles/c62l9gnx775o
6•devonnull•37m ago•0 comments

Cloudflare "issue" blocking legitimate access from humans for days

https://www.cloudflarestatus.com/incidents/xlh20wf0hd70
5•BrunoBernardino•38m ago•0 comments

Privacy, ownership, and freedom are being taken away from you

https://spicygarbagesoup.bearblog.dev/privacy-ownership-and-freedom-are-being-taken-away/
4•y0eswddl•38m ago•0 comments

History of CRMs APL

https://www.computer.org/csdl/magazine/an/2026/01/11442828/2eXehpB3Ybe
2•tosh•38m ago•0 comments

Using LLM in the shebang line of a script

https://til.simonwillison.net/llms/llm-shebang
3•twapi•40m ago•0 comments