frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Tokenminning: Because Tokenmaxxing Is a Bad Idea

https://www.tokenminning.com/
1•robmay•33s ago•0 comments

Show HN: Webhix – Self-hosted webhook.site alternative in a single Go binary

https://github.com/GaIsBax/Webhix
1•Joseph_SPF•3m ago•0 comments

Show HN: RainBreak App – The AI doesn't need a break. But you do

https://rainbreak.franzai.com/
1•franze•4m ago•0 comments

Treating LLMs as Programming Books

https://jola.dev/posts/treating-llms-as-programming-books
2•shintoist•4m ago•0 comments

Linear Tape-Open

https://en.wikipedia.org/wiki/Linear_Tape-Open
1•CGMthrowaway•5m ago•0 comments

Repro-Bot, our GitHub issue triage agent

https://www.metabase.com/blog/reprobot-github-issue-triage-agent
2•mooreds•5m ago•0 comments

Toyota uses superconducting motor in race for first time

https://www.asahi.com/ajw/articles/16626731
1•Tor3•6m ago•0 comments

Lego Education SPIKE portfolio retiring

https://education.lego.com/en-us/spike-update-2026/
2•etruong42•7m ago•0 comments

Reorgs Happen

https://ben.balter.com/2026/06/07/reorgs-happen/
1•mooreds•7m ago•0 comments

Premature Optimization is Fun Sometimes (2025)

https://invlpg.com/posts/2025-06-19-premature-optimization.html
1•birdculture•8m ago•0 comments

Steve Yegge

https://yegge.ai/
1•tosh•12m ago•1 comments

AI Traffic Grew 6.5x Faster Than Human Traffic This Year

https://www.fastly.com/blog/ai-traffic-grew-6-5x-faster-than-human-traffic-this-year
1•HieronymusBosch•12m ago•0 comments

Non-Alcoholic Beer Sold Out Before the Booze

https://medium.com/@dmitry_titov/spatanism-a-photo-zone-with-coffins-and-wrestling-how-i-attended...
1•Dmitry_Titov•13m ago•0 comments

Is music a distraction for my teenager while they revise?

https://www.bbc.co.uk/bitesize/articles/zx228p3
1•mmarian•14m ago•0 comments

xAI Taps Starlink Staffer to Run Grok Training Team

https://www.bloomberg.com/news/articles/2026-06-09/musk-s-xai-taps-starlink-staffer-to-run-grok-t...
1•petethomas•14m ago•0 comments

Show HN: Standing Questions – agent memory that stores questions, not answers

https://github.com/Rocco-alt/standing-questions
1•Kadiwar•17m ago•0 comments

Visualizing and identifying electrophysiological cell types in vivo

https://www.nature.com/articles/s41467-026-71331-0
1•PaulHoule•17m ago•0 comments

Loop Engineering

https://addyo.substack.com/p/loop-engineering
1•RyeCombinator•17m ago•0 comments

Career Isn't Eroding – You're Just Holding the Wrong Moat

https://blog.herlein.com/post/domain-plus-software-superpower/
3•speckx•17m ago•1 comments

Why SQLite succeeded as a database (2016)

https://changelog.com/podcast/201
2•downbad_•19m ago•0 comments

Show HN: First Batch – A pay-per-campaign QA testing sandbox

https://firstbatch.io/
1•bryden_cruz•20m ago•0 comments

Maplid: Place identification using data supplied by mobile network operators

https://www.tandfonline.com/doi/full/10.1080/13658816.2026.2617932
2•PaulHoule•21m ago•0 comments

Federal judge strikes down $100k fee on new H-1B visas

https://www.npr.org/2026/06/09/nx-s1-5851474/federal-judge-fee-h1b-visa
1•geox•23m ago•0 comments

Donut Lab's solid-state battery claim debunked by Ziroth

https://www.theverge.com/science/946608/donut-labs-debunk-solid-state-battery
2•timpera•24m ago•0 comments

Exposing the Solid State Donut Battery. It's over [video]

https://www.youtube.com/watch?v=j5oyVNjrUPI
2•xbmcuser•24m ago•0 comments

How to Use Claude Better Than 90% of Marketers

https://aiforcontentmarketing.ai/how-to-use-claude-better-than-90-of-marketers/
1•pakostina•25m ago•0 comments

DeepSeekV4 1.6T Day 0 to Day 43 Performance Over Time

https://newsletter.semianalysis.com/p/deepseekv4-16t-day-0-to-day-43-performance
1•nsoonhui•25m ago•0 comments

NomadTracks – A GPS tracker that syncs through your own iCloud, no server

https://apps.apple.com/us/app/nomadtracks/id6764303399
1•donkeycat•25m ago•0 comments

Should You Mock the Database?

https://dominik.info/blog/mocking-the-database
2•EspressoGPT•26m ago•0 comments

Show HN: OfferUnlock – Check if your tech job offer is underpaid

https://offerunlock.app
2•tino8383•26m ago•0 comments