frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: Made a PvP style Minesweeper game

https://minesweeperpro.com/
1•AdamKib•7s ago•0 comments

Roles over Jobs

https://personalis.io/blog/roles-over-jobs
1•sylvanjsmit•30s ago•0 comments

Kawaiintu OS – Bypassing GTK4 theme limits with dynamic Python hooks

1•mifjpn•37s ago•0 comments

The Legendary Study That Embarrassed Wine Experts Across the Globe (2014)

https://www.realclearscience.com/blog/2014/08/the_most_infamous_study_on_wine_tasting.html
1•downbad_•44s ago•0 comments

Debugging in the Age of Agents

https://www.akashtandon.in/ai/2026-06-24-debugging-in-the-age-of-agents/
1•akashtndn•2m ago•0 comments

Giving AI human-like memory limits (3–7 words) could improve language learning

https://www.mpi.nl/news/new-study-finds-forgetting-may-be-secret-better-ai-language-learning
1•giuliomagnifico•4m ago•0 comments

Execlave vs. Credo AI: honest technical comparison

https://www.execlave.com/blog/execlave-vs-credo-ai
1•brl1313•5m ago•0 comments

AI is learning to read the room

https://spectrum.ieee.org/emotion-ai-context
1•pseudolus•5m ago•0 comments

What Is an AI Agent Management Platform (AMP)?

https://www.execlave.com/blog/what-is-an-ai-agent-management-platform
1•brl1313•5m ago•0 comments

'You can't make billions without hurting people': Cory Doctorow on Elon Musk

https://www.theguardian.com/technology/2026/jun/24/cory-doctorow-on-elon-musk-ai-bubble-bosses-cr...
1•uxhacker•7m ago•1 comments

What Al does to the minds of novice coders

https://www.youtube.com/watch?v=HTUh0OO6Kmo
1•tchanukvadze•8m ago•0 comments

Legal tech firm sues US over limiting foreign access to Fable

https://www.reuters.com/legal/litigation/legal-tech-firm-sues-us-over-order-limiting-foreign-acce...
2•Topfi•9m ago•1 comments

MPP Credits: fund your agent with a card

https://tempo.xyz/blog/mpp-credits/
1•ronfriedhaber•10m ago•0 comments

Horsewood Official Site – #1 Male Health Supplement

https://finance.yahoo.com/sectors/healthcare/articles/horsewood-urgent-report-2026-horse-19110038...
1•tafyhanu•12m ago•0 comments

Has Strategy's Bitcoin Sale Backfired? What Prediction Markets Reveal

https://coinmarketcap.com/community/post/377227237/
1•predxdotpro•12m ago•0 comments

DeepSWE v1.1 – Updated execution and grading for the same engineering tasks

https://deepswe.datacurve.ai/blog/deepswe-v1-1
1•emrtnn•12m ago•0 comments

New

1•predxdotpro•13m ago•0 comments

Using History to Teach Mathematics

https://www.dijkstrascry.com/TeachigMathematicsWithHistory
1•rramadass•13m ago•1 comments

Chrome Extensions for Productivity in 2026

https://old.reddit.com/r/AgentContext_dev/comments/1ue96k4/the_15_best_chrome_extensions_for_prod...
1•javaeeeee•14m ago•0 comments

Europe: The Fastest-Warming Continent

https://www.rfi.fr/en/international-news/20260623-europe-the-world-s-fastest-warming-continent
1•rguiscard•16m ago•0 comments

How to burst the AI bubble: Strike at its roots

https://arstechnica.com/gadgets/2026/06/how-to-burst-the-ai-bubble-strike-at-its-roots/
3•0nce•17m ago•0 comments

Ask HN: What Is DMZ?

2•shivajikobardan•19m ago•1 comments

An opinionated (and mainly correct) guide to naming

https://adamtornhill.substack.com/p/an-opinionated-and-mainly-correct
1•nephrenka•21m ago•0 comments

Tagged data in Haskell (SICP 2.4.2)

https://entropicthoughts.com/sicp-2-4-tagged-data-in-haskell
1•ibobev•23m ago•0 comments

PipeVoice: The Free local alternative to wispr flow

https://pipevoice.app/
1•signalengine•23m ago•0 comments

Puzzling Success of Overparameterization: Lottery Tickets or Escape Dimensions?

https://infoscience.epfl.ch/entities/publication/9a49779b-f9f8-448d-b3d1-737c78455309
1•rbanffy•23m ago•0 comments

Use of AI to Get News in U.S. Is Rare

https://news.gallup.com/poll/711242/news-rare.aspx
2•giuliomagnifico•24m ago•0 comments

Agent Harness and Claw

https://devblogs.microsoft.com/agent-framework/meet-your-agent-harness-and-claw/
2•ibobev•24m ago•0 comments

How Spotify killed Heardle

https://dles.gg/blog/how-spotify-killed-heardle
2•trizoza•25m ago•0 comments

GELab-Zero: Android automation framework for multimodal LLMs

https://github.com/stepfun-ai/gelab-zero
2•ignoramous•28m ago•0 comments