frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Ask HN: If AI didn't exist, what would you be building today?

2•akashwadhwani35•5m ago•1 comments

The Pneumatic Tube Mail System in New York City

https://www.untappedcities.com/pneumatic-tube-mail-new-york-city/
1•thunderbong•8m ago•0 comments

The 100k Whys of AI

https://lcamtuf.substack.com/p/the-100000-whys-of-ai
1•surprisetalk•8m ago•0 comments

Show HN: Image Tools Hub – A Curated Directory of AI Image Tools

https://imgtoolshub.com
1•jtnt101•11m ago•0 comments

Systemd v261 Released

https://github.com/systemd/systemd/releases/tag/v261
1•zdkaster•11m ago•1 comments

SUV buyers undeterred by warnings of risk to pedestrians

https://www.theguardian.com/world/2026/jun/20/suv-risks-warnings-road-safety-buyers-uk-study
1•lambdaone•12m ago•1 comments

Lo and Behold, Reveries of the Connected World (Werner Herzog) [video]

https://www.youtube.com/watch?v=q3g3hqNJqpQ
1•david_shi•12m ago•0 comments

Proof of AGI is the impossibility of evals

https://thewatershed.markpesce.com/quacks-ergo-duck/
1•mpesce•14m ago•1 comments

Mark-of-the-web and pinning installers to sites

https://blog.randomoracle.io/2026/06/20/mark-of-the-web-and-pinning-installers-to-sites/
1•jandeboevrie•18m ago•0 comments

The videogame market is as big as ever, with PC leading growth [pdf]

https://resources.newzoo.com/hubfs/Newzoo%20-%20GMRF%20Q2%202026%20Analyst%20Update.pdf
1•HelloUsername•23m ago•0 comments

Earthquake gate stopping a San Andreas disaster under highest stress in 1K years

https://www.cnn.com/2026/06/19/weather/san-andreas-fault-record-stress-in-1000-years-earthquake-l...
1•mikhael•23m ago•0 comments

OCaml 5.5 Released

https://discuss.ocaml.org/t/ocaml-5-5-0-released/18265
2•azhenley•24m ago•0 comments

FFmpegKit NDK r26c patch and maintained Android fork

https://github.com/ffmpegkit-maintained/ffmpeg-kit
1•FFmpegKit•25m ago•0 comments

How do we prevent Bitrot?

https://notgull.net/bitrot/
1•dmit•28m ago•0 comments

Show HN: Shelve – Native macOS menu bar app that auto-organizes your Downloads

https://github.com/DanielZ1-tech/shelve
1•danielzx1•31m ago•0 comments

Ask HN: Would you let your AI coding agent profile and optimize autonomously?

1•connollystr•32m ago•0 comments

He made your free video player run smoothly. Now he's doing that for robots

https://techcrunch.com/2026/06/19/he-made-your-free-video-player-run-smoothly-now-hes-doing-that-...
1•XzetaU8•33m ago•0 comments

Principles and Practice of Deep Representation Learning [pdf]

https://ma-lab-berkeley.github.io/deep-representation-learning-book/assets/book-main.pdf
2•t_serpico•36m ago•0 comments

The Lost Story of Alan Turing's "Delilah" Project

https://spectrum.ieee.org/alan-turings-delilah
2•asdefghyk•38m ago•1 comments

Explaining Kerberos from A-Z

https://thattotallyrealmyth.gitbook.io/kerberos-explained
1•MeowMeowBinks•38m ago•0 comments

The Midjourney Scanner

https://twitter.com/midjourney/status/2067422898407837797
1•MrBuddyCasino•41m ago•0 comments

Show HN: I made an AI video of alexpotato's comment about his stockbroker dad

https://getartcraft.com/media/m_xtdewkcnz1ghvsnr5st2sted99p2nr
1•sexy_seedbox•41m ago•1 comments

Show HN: FloatDeck, a floating quick-actions menu for Chrome

https://chromewebstore.google.com/detail/floatdeck-floating-button/fanagpncolgnoglmhamngmcnadkffmlo
1•tapdot•44m ago•0 comments

Student Cheating Is Becoming Impossible to Detect in an A.I. Era

https://www.nytimes.com/2026/06/18/us/ai-apps-students-cheat.html
6•thm•56m ago•3 comments

Effective Use-Cases for LLMs

https://aggressivelyparaphrasing.me/2026/06/21/effective-use-cases-for-llms/
2•tcbrah•58m ago•0 comments

What are your Favorite Lobste.rs Comments?

https://lobste.rs/s/crl4fj/what_are_your_favorite_lobste_rs_comments
2•Curiositry•59m ago•0 comments

The terrifying world of the 'TikTok Farlands'

https://www.bbc.com/future/article/20260618-the-terrifying-world-of-the-tiktok-farlands
2•saikatsg•59m ago•0 comments

Warsh brings a skinny Fed approach to a complex, information-hungry world

https://www.reuters.com/business/warsh-brings-skinny-fed-approach-complex-information-hungry-worl...
3•kaycebasques•1h ago•0 comments

Public Service Announcement: Don't Say You Use AI for Writing

https://www.satisfice.com/blog/archives/488148
2•satisfice•1h ago•0 comments

Polymarket Paid Dozens to Post Videos of Themselves 'Winning' with Fake Bets

https://m.slashdot.org/story/455718
8•ilreb•1h ago•2 comments