frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Three cruise ship passengers die in suspected hantavirus outbreak

https://www.reuters.com/business/healthcare-pharmaceuticals/three-passengers-dead-one-case-hantav...
2•vld_chk•2m ago•0 comments

No Joke. Unification of (GR/Qt)

https://twitter.com/CTibedo/status/2051071923849723947
1•GeometryKernel•3m ago•0 comments

H4ckf0r0day/obscura: The headless browser for AI agents and web scraping

https://github.com/h4ckf0r0day/obscura
2•rezaprima•7m ago•2 comments

Show HN: Optical Design and Simulation in Matlab

https://www.mathworks.com/help/images/optical-system-design-and-analysis.html
1•ashishuthama•9m ago•0 comments

Could the X-Bat Stealth Fighter Drone Change the Air Combat Game?

https://www.twz.com/air/could-the-x-bat-stealth-fighter-drone-change-the-air-combat-game
1•breve•13m ago•0 comments

I Replaced Kiro with a Free Plugin – Here's What Happened

https://www.getdraft.dev/blog/replaced-kiro-with-free-plugin/
1•mayurpise•14m ago•0 comments

Drones are getting drugs, escape tools and crab legs to inmates

https://www.cnn.com/2026/05/03/us/drone-deliveries-contraband-prison-inmates
1•breve•16m ago•0 comments

Show HN: Cryptographic receipt authority for ISO 20022 financial messages

https://20022validator.com
1•NextGenRails•18m ago•0 comments

Show HN: Stigmem – open-source federated knowledge fabric for AI agents (v1.0)

2•barryjones20•18m ago•0 comments

Agentic Coding Is a Trap

https://larsfaye.com/articles/agentic-coding-is-a-trap
1•ayoisaiah•18m ago•0 comments

Local, deterministic, version-controlled knowledge graph

https://www.getdraft.dev/blog/local-graph-engine/
1•mayurpise•24m ago•0 comments

The PHP License, Simplified

https://ben.ramsey.dev/blog/2026/05/the-php-license-simplified
2•gslin•33m ago•0 comments

Open source intelligence about Palantir

https://palantirwatch.org
1•seb1204•34m ago•0 comments

Using the "Sandwich Method" to Teach Mathematics

https://pikuma.com/blog/sandwich-method-math-education
1•atan2•37m ago•0 comments

Kloak keeps secrets out of your application's memory

https://getkloak.io/blog/kloak-50000-feet-view/
1•spinningfactory•37m ago•0 comments

PyFlue – Python-Native Agent Harness Framework (Python Clone of Flue)

https://super-agentic.ai/pyflue
2•sebst•39m ago•0 comments

Show HN: Zuma Portable

https://drive.google.com/drive/folders/1hDRvlY707VrO_UztEtIt1EoPgKBICL8Q?usp=sharing
1•zeeeeeebo•43m ago•0 comments

Simpson's Paradox

https://en.wikipedia.org/wiki/Simpson%27s_paradox
5•basilikum•45m ago•1 comments

California man uses elaborate drone show to help delivery drivers find his house

https://www.dexerto.com/entertainment/california-man-uses-elaborate-drone-show-to-help-delivery-d...
2•gnabgib•47m ago•0 comments

Exit, Voice, and Loyalty

https://en.wikipedia.org/wiki/Exit,_Voice,_and_Loyalty
2•akyuu•48m ago•0 comments

Why should a Trace-ID be 128 bits? (A Surprisingly Long Answer)

https://newsletter.signoz.io/p/why-should-a-trace-id-be-128-bits
2•birdculture•48m ago•0 comments

HN Signal (Last 24 hours) | Curated top stories from HN in the last 24 hrs

https://www.heydebrief.com/dubkc/hn-best-24
1•baetylus•55m ago•2 comments

DeepClaude – Claude Code agent loop with DeepSeek V4 Pro, 17x cheaper

https://github.com/aattaran/deepclaude
18•alattaran•57m ago•13 comments

Show HN: Triggering anti-cheats with just a browser tab title

https://github.com/elliott-diy/DontTrustTitles
6•Elliott-Diy•57m ago•1 comments

Broadcasting GPS on the Local Network

https://evertpot.com/broadcasting-gps-on-local-network/
1•treve•59m ago•0 comments

Brutal in production, lands like a verdict

1•Non_Von_Neumann•1h ago•0 comments

ReactOS Introduces Unified Live/Install Media, New Storage Driver

https://www.phoronix.com/news/ReactOS-Unified-ISO
2•kykat•1h ago•1 comments

Introduction to Atom

https://validator.w3.org/feed/docs/atom.html
4•susam•1h ago•1 comments

The Google Cloud Knowledge Catalog

https://cloud.google.com/blog/products/data-analytics/introducing-the-google-cloud-knowledge-catalog
2•laxmena•1h ago•0 comments

Next-Token Predictor Is An AI's Job, Not Its Species

https://www.astralcodexten.com/p/next-token-predictor-is-an-ais-job
1•optimalsolver•1h ago•0 comments