frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

TIL: You can make HTTP requests without curl using Bash /dev/TCP

https://mareksuppa.com/til/bash-dev-tcp-http-without-curl/
1•mrshu•58s ago•1 comments

The British Social Media Ban Is Silly

https://dogdogfish.com/blog/2026/06/15/social-media-ban/
1•matthewsharpe3•1m ago•0 comments

British Colombia, Time Zones, and Postgres

https://www.crunchydata.com/blog/british-columbia-and-time-zone-changes
1•winslett•1m ago•0 comments

The AiCopalypse

https://www.objecthunter.net//articles/2026/06/16/the-aicopalypse.html
1•floken•2m ago•1 comments

Sam Bankman-Fried's Prison Experiment

https://nymag.com/intelligencer/article/sam-bankman-fried-prison-donald-trump-pardon-appeal.html
1•aanet•2m ago•0 comments

Show HN: NumUp – A Daily Math Puzzle

https://vasanth.fun/numup/
1•vasanthv•3m ago•0 comments

A Vision for a Rust Formal Specification

https://nadrieril.github.io/blog/2026/06/16/formal-spec-vision.html
1•emschwartz•3m ago•0 comments

The Pokémon Trading Card Game AI Battle Challenge

https://ptcg-abc.pokemon.co.jp/
2•esnard•4m ago•0 comments

Show HN: Memento – Self-hosted agentic search and LLM wiki over your email

3•georgeck•5m ago•0 comments

Show HN: Dino – An AI coding agent that keeps you in the loop

https://smartdino.dev
1•ylian•5m ago•0 comments

The European Commission is turning Google Search into a national-security risk

https://techletters.substack.com/p/the-european-commission-is-turning
1•miohtama•6m ago•0 comments

Snyk VulnBench JavaScript 1.0: Can LLMs Find the Same Bugs Twice?

https://arxiv.org/abs/2606.15762
1•ilreb•9m ago•0 comments

Fake Viral Guitarists Strike Again

https://www.youtube.com/watch?v=-Eik8uBvdxY
1•root-parent•10m ago•0 comments

Hackers Publish Knicks and Madison Square Garden Data Online

https://www.404media.co/hackers-publish-knicks-and-madison-square-garden-data-online/
3•Cider9986•10m ago•2 comments

Biff.fx: lightweight effects system for Clojure

https://biffweb.com/p/fx/
1•jacobobryant•11m ago•0 comments

Yes, we still need engineers

https://mattsayar.com/yes-we-still-need-engineers/
2•MattSayar•11m ago•0 comments

Ask HN: What do you think about blockchain's current trajectory

4•mobear•11m ago•0 comments

Google's Training Supercomputers from TPU v2 to Ironwood: Five Generations

https://arxiv.org/abs/2606.15870
1•matt_d•13m ago•0 comments

Can LLM Agents Infer World Models? Evidence from Agentic Automata Learning

https://arxiv.org/abs/2606.16576
1•Anon84•13m ago•0 comments

Apache Fory Serialization 1.2.0: JDK 25 support without sun.misc.Unsafe

https://github.com/apache/fory/releases/tag/v1.2.0
1•chaokunyang•13m ago•1 comments

Earth's underground fungal network would span 10% of the Milky Way

https://www.livescience.com/planet-earth/plants/earths-underground-fungal-network-is-so-massive-i...
3•gmays•14m ago•0 comments

Qwen-RobotWorld Technical Report

https://arxiv.org/abs/2606.17030
1•ilreb•15m ago•0 comments

JoyAI-VL-Interaction: Real-Time Vision-Language Interaction Intelligence

https://arxiv.org/abs/2606.14777
1•ilreb•15m ago•0 comments

Making ast.walk 220x Faster

https://reflex.dev/blog/why-ast-walk-when-you-can-ast-sprint/
8•palashawas•16m ago•0 comments

A Short Explanation of the Zettelkasten Method

https://isgin01.github.io/posts/explanation-of-zettelkasten/
1•pullshark91•16m ago•0 comments

Robinhood to cut 10% of workforce in restructuring

https://www.reuters.com/sustainability/robinhood-cut-10-its-full-time-workforce-2026-06-16/
2•indiesense•16m ago•0 comments

The ongoing debacle of hiring a fake coworker

1•blinkbat•17m ago•0 comments

Uncritical use of AI causes countrywide scandal at Starbucks Korea

https://www.theguardian.com/world/2026/jun/16/starbucks-korea-shut-all-stores-tank-day-promotionB...
2•Blackthorn•18m ago•0 comments

Agent Architecture Is a Compute Allocation Problem: The Advisor Strategy

https://harrisonsec.com/blog/agent-architecture-compute-allocation-advisor-strategy/
1•gzxharrison001•19m ago•0 comments

How we evaluate our LLM judge

https://build.forus.com/how-we-evaluate-our-llm-judge-a-perturbation-based-approach
2•abeinstein•19m ago•0 comments