frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Ask HN: Health/Ocean builders: What problem would you pay to get off your plate?

1•Frannky•1m ago•0 comments

Show HN: UK Butchers Meat Price Tracker

https://offer-spider.onrender.com
1•wolfer•1m ago•0 comments

Show HN: WineBar: A yet another Wine prefix manager, with Asahi Linux support

https://github.com/Tulon/WineBar
1•JosifA•2m ago•0 comments

Skills-kit/Framework for AI-generated, testable automation skills for every LLM

https://github.com/gabrielekarra/skills-kit
1•gabrielekarra•4m ago•1 comments

Our effort to improve the Mintlify assistant

https://www.mintlify.com/blog/assistant-improvements
1•samuel246•5m ago•0 comments

One Agent Isn't Enough

https://benr.build/blog/one-agent-isnt-enough
1•bisonbear•6m ago•0 comments

10K Docker images spray live cloud creds across the internet

https://www.theregister.com/2025/12/11/docker_hub_secrets_leak/
1•Brajeshwar•7m ago•0 comments

New window insulation blocks heat, but not your view

https://techxplore.com/news/2025-12-window-insulation-blocks-view.html
1•Brajeshwar•7m ago•0 comments

Show HN: Solodash – A single player, Balderdash-style daily word game

https://solodash.net
1•Nathanadian•8m ago•0 comments

Playing Santa Does Things to a Man. What It Did to Bob Rutan Was Even Stranger

https://www.esquire.com/lifestyle/a69597294/santaland-bob-rutan/
1•Lightbody•8m ago•0 comments

Show HN: Kinkora – A creative playground for experimenting with video models

https://kinkora.fun/
5•heavenlxj•14m ago•1 comments

Show HN: Oscilla – Free ngrok alternative for sharing localhost on the 'net

https://oscilla.tech/
1•clowerweb•14m ago•1 comments

What NeurIPS shows about where robotics and physical AI research is flourishing

https://robotsandstartups.substack.com/p/increased-focus-on-robotics-at-neurips
1•robotlaunch•18m ago•0 comments

Elegant Types in Ruby

https://github.com/low-rb/low_type
2•x3qt•22m ago•0 comments

Multiple Indicted on Charges of Theft and Re-Sale of Restaurant Cooking Oil

https://www.justice.gov/usao-sdia/pr/multiple-chinese-nationals-indicted-charges-related-theft-an...
6•737min•23m ago•1 comments

Science sleuths raise concerns about scores of bioengineering papers

https://www.nature.com/articles/d41586-025-03870-3
2•bookofjoe•24m ago•1 comments

Show HN: AI Fiction Duel – adversarial storytelling structure for LLMs

https://aifictionduel.com/
1•pfeaster•24m ago•0 comments

Molecular Effects of Indoor Tanning

https://www.science.org/doi/10.1126/sciadv.ady4878
1•thunderbong•26m ago•0 comments

Name your projects something fun

https://substack.com/inbox/post/181413808
2•lazy_afternoons•30m ago•1 comments

Show HN: Logforth, A versatile and extensible Rust logging framework

https://github.com/fast/logforth
1•tison•30m ago•0 comments

"Gentle" Lecture on the Wigner's Semicircle Law in Python Package Leymosun

https://github.com/msuzen/leymosun/blob/main/lectures/wigner_semicircle.ipynb
1•northlondoner•32m ago•0 comments

Supervisor Jackie Fielder is about to ban all new R&D in the Mission SF

https://twitter.com/terronk/status/1999532633097712090
3•donsupreme•33m ago•0 comments

Hacking Chromium Source Code: Replace DevTools HTTP Handler with Redis PubSub

https://www.deadf00d.com/post/chromium-pub-sub-redis.html
2•deadf00d•35m ago•0 comments

Palmeiras and Flamengo became South America's football superpowers

https://www.theguardian.com/football/2025/nov/28/how-palmeiras-and-flamengo-became-south-americas...
2•PaulHoule•35m ago•0 comments

Meta "deletes" years of conversations with friends/family

4•effectkai•36m ago•1 comments

I built this free app Word monitor, check it out

https://catchwords-app.onrender.com
1•ardi_c_cc•36m ago•0 comments

Undefinable yet Indispensable

https://aeon.co/essays/the-word-religion-resists-definition-but-remains-necessary
2•Brajeshwar•37m ago•0 comments

Building the Weir Language

https://elijahpotter.dev/articles/building-the-weir-language
1•chilipepperhott•38m ago•0 comments

Silk Road-linked Bitcoin wallets move $3M to new address

https://cointelegraph.com/news/silk-road-wallets-transfer-3m-bitcoin-new-address
2•flipped•39m ago•1 comments

Ad-Free Social Media

https://treechat.com
1•mitya777•41m ago•0 comments