frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Top Bank Regulator Spoke at an Invite-Only Client Dinner After Fed Meeting

https://www.wsj.com/economy/central-banking/top-bank-regulator-spoke-at-an-invite-only-client-din...
1•petethomas•2m ago•0 comments

Shrinking a NixOS ISO from 458 MiB to 183 MiB

https://natkr.com/2026-06-19-nixos-but-smol/
1•logickkk1•2m ago•0 comments

Pre-Modern Armies for Worldbuilders: Officials, Contractors and Professionals

https://acoup.blog/2026/06/19/collections-pre-modern-armies-for-worldbuilders-part-iib-officials-...
3•JohnHammersley•3m ago•0 comments

Trust Identity Protocol v1.0, an open post-quantum standard for human identity

https://theailab.org/whitepaper
1•dineshmendhe•4m ago•0 comments

Ask HN: Do you find it annoying to share your availability over text/DMs?

2•siddhibee•4m ago•0 comments

Zenzizenzizenzic

https://en.wikipedia.org/wiki/Zenzizenzizenzic
5•gyosifov•7m ago•0 comments

Ask HN: Do you use Apache Iceberg in Production?

1•vira28•11m ago•0 comments

Optimizing for Decision Points

https://narphorium.com/blog/decision-points/
1•azhenley•12m ago•0 comments

The data black hole at the center of AI

https://www.dwarkesh.com/p/the-sample-efficiency-black-hole-2
1•jedixit•12m ago•0 comments

The Wrong End of the Problem

https://schrottner.at/2026/06/18/The-Wrong-End-of-the-Problem.html
1•aepfli•15m ago•0 comments

OCaml 5.5.0

https://discuss.ocaml.org/t/ocaml-5-5-0-released/18265
3•cod1r•18m ago•0 comments

Gemini CLI vs. Claude Code: Why agent capabilities matter more than prompts

https://imaxxs.com/behavioral-induction-capabilities-shape-execution
1•imaxxs•18m ago•0 comments

HN: A free book on how "AI easy money" scams work

https://singularityforge.space/2026/06/20/di-and-easy-money-on-the-edge-of-a-myth/
1•Voice_of_Void•18m ago•0 comments

Cervical cancer mortality trends after HPV vaccination in England 2001–24

https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(26)00918-9/fulltext
4•bookofjoe•18m ago•0 comments

Europe buys the future, America builds it

https://economist.com/finance-and-economics/2026/06/18/europe-buys-the-future-america-builds-it
1•andsoitis•19m ago•0 comments

Google Is Using Nvidia's Playbook to Build a Rival AI Chip Business

https://www.wsj.com/tech/ai/google-is-using-nvidias-playbook-to-build-a-rival-ai-chip-business-1e...
2•Brajeshwar•24m ago•1 comments

Show HN: SkillsGuard – static scanner for malicious AI agent skills

https://github.com/Teycir/SkillsGuard
1•teycir•24m ago•0 comments

Computer Vision – Lecture 1.1 (Introduction: Organization) [video]

https://www.youtube.com/watch?v=YW1cIaOBkI8
1•skidrow•25m ago•0 comments

Anthropic Lacks Emotional Intelligence

https://www.lawfaremedia.org/article/anthropic-lacks-emotional-intelligence
1•ano-ther•26m ago•0 comments

Get notified the second Fable 5 comes back

https://fablewatch.com
1•markksantos•27m ago•0 comments

What happens if the EU loses the AI race [video]

https://www.youtube.com/watch?v=uVPoq1Svz7g
1•SpyCoder77•31m ago•0 comments

Show HN: Konxios a local first AI OS that connects LM Studio, Ollama and cloud

https://konxios.com
1•ifrosted•35m ago•0 comments

Eerie landscape of 100.000 monoliths made with fable 5(when it was available)

https://sand-morph.up.railway.app/atlas-of-quiet-monoliths
1•echohive42•37m ago•0 comments

The science behind Qantas' bet on 20-hour flights

https://www.reuters.com/business/aerospace-defense/sleep-light-science-behind-qantas-bet-20-hour-...
2•1659447091•38m ago•0 comments

Shellular: Run agents, terminals and browser DevTools from your phone

https://shellular.dev/
3•aquajet•41m ago•0 comments

GLM-5.2 vs. Claude Opus 4.8: Full Comparison

https://llm-stats.com/blog/research/glm-5-2-vs-claude-opus-4-8
3•gjvc•45m ago•0 comments

An offline earthquake monitor that detects P-waves before S-waves

https://play.google.com/store/apps/details?id=com.muhsin.mkdepremmonitor&hl=en_US
2•Muhsinkosoglu•48m ago•0 comments

How to Get Fired Using Switch Statements and Statement Expressions (2016)

https://blog.robertelder.org/switch-statements-statement-expressions/
1•downbad_•48m ago•0 comments

Show HN: Let agents send/receive SMS using your old Android phone

https://simlink.snaz.cz/
1•LukBrezina•50m ago•0 comments

HN with Pics

https://hn.is-ai-good-yet.com/
1•ilyaizen•51m ago•1 comments