frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Molecular quantum nanosensors functioning in living cells

https://www.science.org/doi/10.1126/sciadv.aeb5422
1•PaulHoule•1m ago•0 comments

A new era for memory-management maintainership

https://lwn.net/SubscriberLink/1070994/9b6713c0c4db24dc/
1•jzb•3m ago•0 comments

₹5 Lakh Personal Loan – Easy Online Process – SMFG India Credit

https://www.smfgindiacredit.com/5-lakh-personal-loan.aspx
1•saumyaraut11•4m ago•0 comments

Locked, stocked, and losing budget: AI vendor lock-in bites back

https://www.theregister.com/software/2026/04/28/locked-stocked-and-losing-budget-ai-vendor-lock-i...
3•Bender•7m ago•0 comments

EU hits snooze on AI Act rules after industry backlash

https://www.theregister.com/ai-and-ml/2026/05/07/eu-hits-snooze-on-ai-act-rules-after-industry-ba...
2•Bender•7m ago•0 comments

Found a Game of a Beating Heart

https://github.com/A-I-dentity/soul_stamp
1•purplemyth•9m ago•0 comments

Projecting React

https://tannerlinsley.com/posts/projecting-react/
2•ascorbic•9m ago•0 comments

Ticking Timebomb in Mac OS [video]

https://www.youtube.com/watch?v=Q9GAJ_ka4l4
1•vishnuharidas•10m ago•0 comments

Campaign staffers tell NPR they make 'thousands' betting on their own candidates

https://www.npr.org/2026/05/07/nx-s1-5795891/prediction-markets-kalshi-polymarket-campaigns
4•geox•12m ago•0 comments

AI's Fastest-Growing Engineering Role Has No Playbook

https://blog.danielvaughan.com/ais-fastest-growing-engineering-role-has-no-playbook-bc68fe07e701
2•dvaughan•12m ago•0 comments

Betting on the Longevity Market

https://longevity.stanford.edu/betting-on-the-longevity-market/
2•andsoitis•13m ago•0 comments

Prevent OSS PR/issue spam like a Pro (+ some unrelated fun):D

https://github.com/valhalla/valhalla/pull/6073
1•nilsnolde•13m ago•0 comments

Two Home Affairs officials suspended after AI 'hallucinations' found

https://www.citizen.co.za/news/home-affairs-officials-suspended-ai-hallucinations/
1•jruohonen•14m ago•0 comments

A daily color-naming game

https://closehue.com/
1•recursive_toast•15m ago•0 comments

Dawkins claimed that AI is conscious after conversation with Anthropic's Claude

https://unherd.com/2026/05/is-ai-the-next-phase-of-evolution/
3•flyaway123•15m ago•0 comments

Mozilla says 271 vulnerabilities found by Mythos and "almost no false positives"

https://arstechnica.com/information-technology/2026/05/mozilla-says-271-vulnerabilities-found-by-...
8•epistasis•16m ago•3 comments

Content for Content's Sake

https://lucumr.pocoo.org/2026/5/4/content-for-contents-sake/
1•Einenlum•17m ago•0 comments

US reportedly charges Scattered Spider hacker arrested in Finland

https://www.bleepingcomputer.com/news/security/us-reportedly-charges-scattered-spider-hacker-arre...
5•billybuckwheat•18m ago•0 comments

Creating for a niche

https://www.davesnider.com/posts/working-in-a-niche
2•snide•20m ago•0 comments

Minister gives Palantir's NHS platform a clean bill of health

https://www.theregister.com/paas-and-iaas/2026/05/07/minister-gives-palantirs-nhs-platform-a-clea...
2•Bender•20m ago•0 comments

Within the Context of No-Context – The decline of adulthood (1980)

https://www.newyorker.com/magazine/1980/11/17/within-the-context-of-no-context
1•frereubu•21m ago•0 comments

More PayPal emails hijacked to deliver tech support scams

https://www.malwarebytes.com/blog/news/2026/04/more-paypal-emails-hijacked-to-deliver-tech-suppor...
1•croes•21m ago•0 comments

AI for Creativity

https://bsuh.bearblog.dev/ai-for-creativity/
1•flyaway123•21m ago•0 comments

Agent pull requests are everywhere. Here’s how to review them.

https://github.blog/ai-and-ml/generative-ai/agent-pull-requests-are-everywhere-heres-how-to-revie...
2•chmaynard•23m ago•0 comments

Show HN: Super Mega SFF Story Ideator

https://compellingsciencefiction.com/super-mega-sff-story-ideator/
1•mojoe•24m ago•0 comments

Understanding AI

https://www.theatlantic.com/ideas/2026/05/ai-for-good-uses/687082/
2•paulpauper•25m ago•0 comments

Learning OCaml: PPX for Mere Mortals

https://batsov.com/articles/2026/03/03/ppx-for-mere-mortals/
1•DASD•25m ago•0 comments

Does Claude Have Feelings?

https://www.theatlantic.com/technology/2026/05/dawkins-claude-ai-consciousness/687093/
3•paulpauper•25m ago•1 comments

From US to Singapore, cruise passengers are being monitored for hantavirus

https://www.cnn.com/2026/05/07/world/hantavirus-ship-tenerife-outbreak-intl
2•paulpauper•27m ago•0 comments

Automating AI Research

https://jack-clark.net/2026/05/04/import-ai-455-automating-ai-research/
3•gmays•27m ago•0 comments