frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•7mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

A Multitasker's Guide to Regaining Focus [2024]

https://www.nytimes.com/2024/03/11/well/mind/multitasking-tips.html
1•vinni2•3m ago•0 comments

The Destruction of Human Potential

https://rodgercuddington.substack.com/p/the-destruction-of-human-potential
1•freespirt•5m ago•0 comments

Evaluating Deep Agents: Our Learnings

https://twitter.com/langchainai/status/2006589207196930109
1•Anon84•7m ago•0 comments

Last Orders, London? A fifth of London's pubs have closed in the last 20 years

https://www.nytimes.com/2025/12/31/opinion/london-pubs.html
1•smurda•8m ago•0 comments

Everything I read, watched, and played in 2025

https://www.taranusaur.us/media
1•happyraul•9m ago•0 comments

Take a tour of the Antarctica-bound icebreaker. It has a gym

https://www.nytimes.com/live/2025/climate/antarctica-thwaites-glacier/tour-antarctica-icebreaker
1•fleahunter•11m ago•0 comments

Haters Are the Best Marketers: Mentava as Case Study

https://twitter.com/NielsHoven/status/2006404645078401195
1•barry-cotter•13m ago•0 comments

Even the Sky May Not Be the Limit for A.I. Data Centers

https://www.nytimes.com/2026/01/01/technology/space-data-centers-ai.html
3•fleahunter•13m ago•0 comments

Authenticity After Abundance

https://www.threads.com/@mosseri/post/DS76UiklIDf
1•admp•18m ago•0 comments

When AI Fails: Reasoning Visibility and Governance in Regulated Systems

https://zenodo.org/records/18114669
1•businessmate•20m ago•1 comments

If childhood is half of subjective life, how should that change how we live?

https://moultano.wordpress.com/2025/12/30/children-and-helical-time/
3•moultano•21m ago•0 comments

Show HN: Impostor Juego Online – Juega Gratis Al Juego Del Impostor

https://impostorjuego.org/
1•tomstig•23m ago•0 comments

Labour's Employment Cost Crisis

https://rodgercuddington.substack.com/p/labours-employment-cost-crisis-the
1•freespirt•29m ago•0 comments

All of Advent of Code 2025 in SQLite [video]

https://www.youtube.com/watch?v=PGuruDhK-YA
2•vismit2000•30m ago•0 comments

Performance Evaluation of Brokerless Messaging Libraries

https://arxiv.org/abs/2508.07934
1•tosh•30m ago•0 comments

NYC mayoral inauguration bans Flipper Zero, Raspberry Pi devices

https://www.bleepingcomputer.com/news/security/nyc-mayoral-inauguration-bans-flipper-zero-raspber...
1•taubek•32m ago•0 comments

Static Protocols in Python: Behaviour over Inheritance

https://patrickm.de/static-protocols-in-python/
2•sneakyPad•33m ago•1 comments

What if the world is made of cubes? Uncovering the universal geometry of geology

https://www.quantamagazine.org/scientists-uncover-the-universal-geometry-of-geology-20201119/
3•fanf2•33m ago•0 comments

A Distributed Systems Reliability Glossary

https://jepsen.io/blog/2025-10-20-distsys-glossary
1•tosh•35m ago•0 comments

Show HN: Phi – A meta-language where grammar = implementation (Cofree-based)

https://github.com/eurisko-info-lab/phi-autonomous
1•eurisko_2026•36m ago•1 comments

The Long Shot – Preventive Health Screening Reminders

https://longshot.invertedpassion.com/
2•twapi•36m ago•0 comments

NATS Messaging

https://en.wikipedia.org/wiki/NATS_Messaging
2•tosh•37m ago•0 comments

Show HN: Replacing $5K industrial signal towers with a webapp [video]

https://www.youtube.com/shorts/t-ROd1hx20I
2•edmundsparrow•42m ago•1 comments

Show HN: CalPal – A browser-based literate calculator with BYOK AI

https://trycalpal.app/
1•s1dd4rth•43m ago•0 comments

Google Co-Founder Sergey Brin's Unretirement Is a Lesson for the Rest of Us

https://www.inc.com/jessica-stillman/google-co-founder-sergey-brins-unretirement-is-a-lesson-for-...
1•iancmceachern•45m ago•1 comments

Can Applications Recover from fsync Failures? (2020)

https://www.usenix.org/conference/atc20/presentation/rebello
2•rdpintqogeogsaa•51m ago•0 comments

Leadership Lab: The Craft of Writing Effectively [video]

https://www.youtube.com/watch?v=vtIzMaLkCaM
2•williamtrask•54m ago•0 comments

Show HN - Automate commit messages with gitz (Rust and AI)

https://github.com/Tenuka22/gitz
2•tenuka_o_22•55m ago•1 comments

Top HN Stories in 2025

https://hn.algolia.com/?dateEnd=1767139200&dateRange=custom&dateStart=1735689600&page=0&prefix=fa...
2•r721•56m ago•0 comments

Show HN: GitHub-style Git activity visualizer for terminal

https://github.com/chaosprint/hindsight
1•chaosprint•1h ago•0 comments