frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

ISO Free Testers – Post Edutainment Events

https://dream2career.org/list/
1•EDU-ADVISOR•28s ago•1 comments

Dopamine bends time in our brain, making novel moments memorable

https://refractor.io/learning-memory/dopamine-dilates-time-novel-events/
1•geox•44s ago•0 comments

Today I've made the difficult decision to reduce the size of Coinbase by ~14%

https://twitter.com/brian_armstrong/status/2051616759145185723
1•adrianmsmith•58s ago•0 comments

AI data centers appear to be creating their own microclimates

https://www.sfgate.com/bayarea/article/ai-data-center-microclimates-22236756.php
1•cainxinth•2m ago•0 comments

Datapoint 2200: the machine laid the foundation for PC from Apple and IBM

https://spectrum.ieee.org/legacy-of-datapoint-2200-microcomputer
2•giuliomagnifico•4m ago•0 comments

Google Gemini Down

https://downdetector.ca/status/googlegemini/
3•caonidaye•5m ago•1 comments

How are the knives on this website,paragon-knives.com?

2•bgzlsxaz•6m ago•0 comments

FastLapackInterface: Allocation-free (eigen)decomposition for Julia

https://dynarejulia.github.io/FastLapackInterface.jl/stable/
2•phoebos•7m ago•0 comments

Design Twice, Ship Once

https://brunokiafuka.substack.com/p/design-twice-ship-once
2•brunokiafuka•7m ago•0 comments

Show HN: I used computer vision to annotate human concepts it wasn't built for

https://howtosee.life/
2•tomascarlson•8m ago•0 comments

Trump Is Losing a Second War

https://paulkrugman.substack.com/p/trump-is-losing-a-second-war
2•rbanffy•8m ago•1 comments

OpenClaw Had a Rough Week

https://openclaw.ai/blog/openclaw-rough-week
2•aliasocracy•9m ago•0 comments

AI firms should face 'minimum wage for robots' to limit job cuts, says tech boss

https://www.bbc.co.uk/news/articles/cjep33w1q7wo
3•dijksterhuis•9m ago•0 comments

McCLIM 1.0 – a GUI toolkit for Common Lisp

https://mcclim.common-lisp.dev/posts/McCLIM-100-Koliada-release.html
2•slyrus•10m ago•0 comments

New AI-Powered Age Assurance Measures [Meta]

https://about.fb.com/news/2026/05/ai-age-assurance-teens/
2•_1•10m ago•0 comments

Show HN: Likewise – a protocol for decentralized personal knowledge graphs

https://getlikewise.ai/spec/
2•danielrmay•11m ago•1 comments

Show HN: I created a Neovim plugin to replace Xcode

https://github.com/wojciech-kulik/xcodebuild.nvim
2•wojciech-kulik•12m ago•0 comments

Collapse is not random – run a minimal test in 30 seconds (Colab)

https://colab.research.google.com/drive/1FGu7_G1avub-0PUV6cpdox8cojuOvY9C
2•hiroakiaizawa•13m ago•1 comments

Rolling the Root Key

https://blog.apnic.net/2026/05/05/rolling-the-root-key/
2•jandeboevrie•14m ago•0 comments

OpenCL 3.1 is here

https://www.khronos.org/blog/opencl-3.1-is-here
2•jrepinc•16m ago•0 comments

Topaz vs. Azurite: what works locally and what doesn't

https://topaz.thecloudtheory.com/blog/topaz-vs-azurite/
2•kamilmrzyglod•16m ago•0 comments

Why ADHD Is the Cheat Code of the AI Era

https://www.airsugar.com/p/why-adhd-is-the-cheat-code-of-the
4•herbertl•18m ago•1 comments

Show HN: PulsePages – Multi-page websites for $9/year (Carrd alternative

https://www.pulsepages.co
2•erichensley•21m ago•0 comments

A Reddit commenter warned me..I laughed it off. Then they wiped everything

https://onetile.me
3•omara123•21m ago•0 comments

We Can Do Hard Things

https://allenpike.com/2026/we-can-do-hard-things/
5•herbertl•22m ago•0 comments

Cerebras targets $26.6B valuation in US IPO as AI chip demand surges

https://www.reuters.com/business/ai-chipmaker-cerebras-targets-115-125-share-price-us-ipo-source-...
2•giuliomagnifico•22m ago•0 comments

Fizz Buzz Through Monoids

https://entropicthoughts.com/fizzbuzz-through-monoids
2•ibobev•23m ago•0 comments

Accountants in Ilford

https://skzee.co.uk/accountants-in-ilford/
2•syedsherazahmed•23m ago•0 comments

Show HN: Instantly understand any GitHub repo

https://gitdiagram.com
2•ahmedkhaleel•24m ago•0 comments

A Love Letter to Flashcards

https://lesleylai.info/en/flashcards/
2•ibobev•24m ago•0 comments