frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

The Queen's Duck

https://bwiggs.com/notebook/queens-duck/
1•SEJeff•49s ago•0 comments

US clears H200 chip sales to 10 China firms as Nvidia CEO looks for breakthrough

https://www.reuters.com/business/retail-consumer/us-clears-h200-chip-sales-10-china-firms-nvidia-...
3•layer8•2m ago•0 comments

Texas county pauses data center construction in rural areas

https://www.texastribune.org/2026/05/12/texas-hill-county-approves-data-center-construction-pause...
2•gmays•4m ago•0 comments

Deal reached with hackers to delete data stolen from the Canvas platform

https://www.nbcnews.com/tech/tech-news/deal-reached-hackers-delete-data-stolen-canvas-educational...
3•fortran77•5m ago•1 comments

Show HN: Nanci, CI written in plain Python, locally debuggable

https://nanci.dev/
1•Hex08•5m ago•0 comments

One in seven in UK prefer consulting AI chatbots to seeing doctor, study finds

https://www.theguardian.com/society/2026/may/13/one-in-seven-prefer-ai-chatbots-to-seeing-doctor-...
2•Brajeshwar•5m ago•0 comments

Blanet

https://en.wikipedia.org/wiki/Blanet
1•JumpCrisscross•7m ago•0 comments

A field manual for Deutsche Bahn

https://blog.hofstede.it/a-field-manual-for-three-years-on-deutsche-bahn/
1•fanf2•8m ago•0 comments

Plasma secrets: Windows position for naughty apps

https://www.dedoimedo.com/computers/plasma-window-position-2026.html
2•speckx•10m ago•0 comments

World's first laughing gas breathalyser trialled in England

https://news.sky.com/video/worlds-first-laughing-gas-breathalyser-trialled-in-england-13544036
1•austinallegro•10m ago•0 comments

Austin's population tops 1M residents for the first time

https://www.statesman.com/business/article/austin-population-tops-1-million-22258805.php
2•_JamesA_•10m ago•0 comments

Celebrating 10 Years of the MITx MicroMasters Programs

https://impact-openlearning.mit.edu/celebrating-10-years-of-the-mitx-micromasters-programs
1•raybb•11m ago•0 comments

GitHub Copilot's new desktop app

https://github.com/github/app
1•prosim•12m ago•1 comments

Bun's Rust rewrite has been merged

https://old.reddit.com/r/rust/comments/1tcrmjs/rewrite_bun_in_rust_has_been_merged/
4•ale•12m ago•0 comments

AI, open code and vulnerability risk in the public sector (UK)

https://www.gov.uk/guidance/ai-open-code-and-vulnerability-risk-in-the-public-sector
1•RobinL•15m ago•0 comments

How the Bird Eye Was Pushed to an Evolutionary Extreme

https://www.quantamagazine.org/how-the-bird-eye-was-pushed-to-an-evolutionary-extreme-20260513/
2•Brajeshwar•15m ago•0 comments

Why Do We Interface?

https://whydoweinterface.com/
2•structuredPizza•16m ago•0 comments

Jane Street Interview Simulator

https://janestreet.gg/
1•Jeanbu•16m ago•0 comments

A Single Infusion Could Suppress HIV for Years

https://www.nytimes.com/2026/05/11/health/hiv-infusion-immunotherapy.html
1•gmays•16m ago•0 comments

Discover Crosspad the best finger drumming web app

https://crosspad.app/
1•Brosper•21m ago•0 comments

Physics Guarantees the Datasphere Keeps Expanding (and What It Means for Agents)

https://twitter.com/i/status/2054961517767061668
1•dataranger•22m ago•0 comments

Show HN: BlitzGraph – Supabase for graphs, designed for LLM agents

https://blitzgraph.com
1•lveillard•23m ago•0 comments

Ambient Intents

https://xcancel.com/timourxyz/status/2054589504934273373
1•yurivish•23m ago•0 comments

Cannabis and driving? Studies reveal big risks

https://news.cuanschutz.edu/news-stories/cannabis-and-driving-studies-reveal-big-risks
3•PaulHoule•23m ago•0 comments

AI models are being used to predict conflict

https://www.economist.com/science-and-technology/2026/05/13/ai-models-are-being-used-to-predict-c...
2•Brajeshwar•24m ago•0 comments

Entire - How We Improved Agentic Search

https://entire.io/blog/improving-agentic-search-in-coding-agents
1•tanishqkanc•25m ago•0 comments

Claude Code cost observability to prevent tokenmaxxing

https://github.com/delta-hq/cc-ledger
1•tsv650•25m ago•1 comments

Which programming language is fastest?

https://benchmarksgame-team.pages.debian.net/benchmarksgame/index.html
1•tosh•26m ago•0 comments

Synthetic evaluation datasets for testing AI agents before production deployment

https://paixblox.github.io/learned/
1•cemillxchange•26m ago•0 comments

What's in a GGUF, besides the weights – and what's still missing?

https://nobodywho.ooo/posts/whats-in-a-gguf/
2•bashbjorn•28m ago•0 comments