frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Who Knew? 1 in 5 Americans Are Convinced They're Psychic

https://studyfinds.com/1-in-5-americans-convinced-theyre-psychic/
1•t-3•10s ago•0 comments

Esp-Claw: Chat Coding Edge AI Agent Framework for IoT

https://esp-claw.com/en/
1•hasheddan•14s ago•0 comments

AI Agents Are Selfish and Biology Solved It

https://eversole.dev/blog/signaling-is-the-intelligence/
1•kennethops•1m ago•0 comments

I ran a Hormuz Crisis emergent SIM: AIs started lying to hide a stalemate

3•vinserello•1m ago•1 comments

Artemis 2's Heat Shield Performed as Expected: First Results Are In

https://gizmodo.com/so-how-did-artemis-2s-heat-shield-hold-up-the-first-results-are-in-2000749198
1•bookofjoe•1m ago•0 comments

Sys. Review: The Impact of Covid-19 Vaccination on Myocarditis Risk and Recovery

https://www.mdpi.com/2039-7283/16/4/77
1•cratermoon•2m ago•0 comments

Gemini Enterprise Agent Platform, powering the next wave of agents

https://cloud.google.com/blog/products/ai-machine-learning/introducing-gemini-enterprise-agent-pl...
1•xnx•3m ago•0 comments

I refuse to play the imitation game

https://einarwh.no/blog/2026/04/15/i-refuse-to-play-the-imitation-game/
1•speckx•3m ago•0 comments

We discovered the speed limit of arithmetic – and broke it

https://www.newscientist.com/article/2521354-how-we-discovered-the-speed-limit-of-arithmetic-and-...
1•Brajeshwar•4m ago•0 comments

Kazam – my answer to static sites in the age of Claude being my main author

https://tdiderich.github.io/kazam/index.html
1•tylerdiderich•4m ago•1 comments

GPT Image 2 is here in Samsar T2V agent

https://www.samsar.one/blog/gpt-image-2-is-here-we-tried-giving-it-some-of-the-hardest-battles/
2•proy24•4m ago•1 comments

Is Claude Code going to cost $100/month? Probably not–it's all confusing

https://simonwillison.net/2026/Apr/22/claude-code-confusion/
1•gmays•4m ago•0 comments

Scaling Sameness

https://www.gradientinstitute.org/research-publications/scaling-sameness
1•dbaupp•5m ago•0 comments

Non-engineers don't know how to work with agents

https://mrprompty.com/features
1•ViktorPetrov•5m ago•1 comments

Brooks' Surgical Team Model and AI

https://jschof.dev/posts/2026/4/brooks-surgical-team-model-and-ai/
1•babybjornborg•5m ago•0 comments

Treetops glowing during storms captured on film for first time

https://www.psu.edu/news/earth-and-mineral-sciences/story/treetops-glowing-during-storms-captured...
2•t-3•5m ago•0 comments

Geometry Nodes in WebGPU

https://whoisryosuke.com/blog/2026/webgpu-node-graph/
1•juretriglav•6m ago•0 comments

Switching from Uv to PDM

https://stuartm.nz/2026/04/pdm-rocks/
1•birdculture•8m ago•0 comments

The Scraping Wiki: An LLM-maintained knowledge base indexing 400 articles

https://github.com/TheWebScrapingClub/scraping-wiki/blob/main/index.md
1•PigiVinci83•8m ago•0 comments

Crypto billionaire [Justin Sun] sues Trump-linked project alleging extortion

https://www.msn.com/en-us/money/companies/crypto-billionaire-sues-trump-linked-project-alleging-e...
2•bhouston•11m ago•1 comments

TeamPCP strikes again: Xinference (v2.6.0-2.6.2) PyPI package compromised

https://research.jfrog.com/post/xinference-compromise/
1•lukecarr•11m ago•1 comments

Books Are Not Remotely Too Expensive

https://www.millersbookreview.com/p/no-books-are-not-remotely-too-expensive
2•gHeadphone•12m ago•0 comments

Early data from Vera C. Rubin Observatory reveals over 11,000 new asteroids

https://phys.org/news/2026-04-early-vera-rubin-observatory-reveals.html
1•speckx•12m ago•0 comments

Can't wait to see this movie

https://www.youtube.com/watch?v=H-43VeYGiPM
1•alvineh•12m ago•0 comments

Python suite for neuroscience research across all modalities

https://github.com/facebookresearch/neuroai
1•MADEinPARIS•14m ago•0 comments

Qwen3.6-27B: Flagship-Level Coding in a 27B Dense Model

https://qwen.ai/blog?id=qwen3.6-27b
1•mfiguiere•14m ago•0 comments

The Single Dumbest Conspiracy Theory of 2026

https://www.theatlantic.com/science/2026/04/missing-scientists/686885/
2•Jtsummers•15m ago•0 comments

Larry McMurtry's Tall Tales

https://www.thenation.com/article/culture/larry-mcmurtry-biography/
1•samclemens•15m ago•0 comments

Google Cloud Fraud Defense, the Next Evolution of reCAPTCHA

https://cloud.google.com/blog/products/identity-security/introducing-google-cloud-fraud-defense-t...
1•throwaway29303•16m ago•0 comments

Qwen3.6-27B

https://twitter.com/i/status/2046939764428009914
1•NiekvdMaas•20m ago•0 comments