frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Magnets Are Bad for Hardware Again

https://hackaday.com/2026/05/21/magnets-are-bad-for-hardware-again/
1•speckx•30s ago•0 comments

Notes from the Mistral AI Now Summit in Paris

https://koenvangilst.nl/lab/mistral-ai-now-summit
1•vnglst•35s ago•0 comments

Liquid AI reveals 8B-A1B MoE trained on 38T

https://www.liquid.ai/blog/lfm2-5-8b-a1b
1•simjnd•2m ago•0 comments

Strace-UI, Bonsai_term, and the TUI Renaissance

https://blog.janestreet.com/strace-ui-bonsai-term-and-the-tui-renaissance/
1•hardwaregeek•2m ago•0 comments

Same Driver, new vehicle: Welcoming our first riders trips in the Ojai

https://waymo.com/blog/2026/05/welcoming-riders-in-the-ojai/
1•xnx•3m ago•0 comments

Which LLM is the best at finding real vulnerabilities?

https://medium.com/@lp1/which-llm-is-the-best-at-finding-real-vulnerabilities-part-1-2c51802cd55b
1•leakr•3m ago•0 comments

Lipstick on a Pig

https://blog.fredrb.com/2026/05/29/lipstick-on-a-pig/
1•fredrb•4m ago•0 comments

95% of Canceled Annual App Subscribers Never Come Back

https://9to5mac.com/2026/05/27/new-report-shows-annual-app-subscribers-rarely-return-after-they-c...
1•karakoram•4m ago•0 comments

Ask HN: Is anybody providing deterministic LLMs?

1•julienreszka•5m ago•1 comments

The Unsustainable AI Subsidy

https://tomtunguz.com/ai-model-inflation/
1•djha-skin•5m ago•0 comments

Slang.net added a new AI word: Braging

https://slang.net/meaning/braging
2•jamestarr474•6m ago•1 comments

Online (One-Pass) Algorithms

https://www.johndcook.com/blog/2026/05/29/online-one-pass-algorithms/
1•ibobev•7m ago•0 comments

Embodied Cognition and Agentic AI

https://lemire.me/blog/2026/05/28/embodied-cognition-and-agentic-ai/
1•ibobev•7m ago•0 comments

Ask HN: Any advice on how to learn good software architecture practices?

3•jimsojim•8m ago•0 comments

Pill That Regrows And Repairs Teeth

https://www.futura-sciences.com/en/say-goodbye-to-dental-implants-the-pill-that-regrows-and-repai...
1•karakoram•9m ago•0 comments

Changing a Title Can Make an Epoch-Making Article Invisible

1•kokhanserhii•9m ago•0 comments

Satradar – Track 10k+ Satellites up to 120 FPS

https://satradar.com/
2•davidwhodge•10m ago•0 comments

An attempt to calculate how far behind each AI lab is from the frontier

https://labgaps.com
1•cusos•10m ago•0 comments

Gen Z Men Want Babies. Gen Z Women Don't

https://www.vox.com/podcasts/480877/gen-z-men-wanna-be-dads
3•karakoram•11m ago•1 comments

Ask HN: How would you benchmark your engineering team's AI adoption?

1•cby•11m ago•1 comments

Predicting AI Job Exposure

https://www.ben-evans.com/benedictevans/2026/5/24/ai-job-exposure
1•iamskeole•11m ago•0 comments

Flathub disallows AI-assisted code and documentation

https://social.treehouse.systems/@barthalion/116657011366876079
2•jarek-foksa•12m ago•0 comments

Study finds AI chose nuclear signalling in 95% of simulated crises

https://www.kcl.ac.uk/news/artificial-intelligence-under-nuclear-pressure-first-large-scale-kings...
2•pseudolus•14m ago•2 comments

Use all AI futer for free& unlimited. try now

https://sites.google.com/view/rrrpromex/home
1•rrrpro123•15m ago•0 comments

Show HN: Oort – A prompt library where every listing has a shipped project

https://oortstack.com
1•Wesearchpress•15m ago•0 comments

Show HN: Promptloop – create, run, and improve prompt evals from the terminal

https://github.com/Bella3202019/promptloop
1•velapod•15m ago•0 comments

Master Your Online Meetings Workflow with CallBro: The Intro

https://medium.com/@MSalnikov/master-your-workflow-with-callbro-the-intro-ad6d7fae4104
2•zlat1997•15m ago•0 comments

Why the failure of Blue Origin's New Glenn rocket is so catastrophic

https://arstechnica.com/space/2026/05/heres-why-the-failure-of-blue-origins-new-glenn-rocket-is-s...
2•Brajeshwar•16m ago•0 comments

AI can chart a course to disaster faster than humans can notice

https://thebulletin.org/2026/05/ai-can-chart-a-course-to-disaster-faster-than-humans-can-notice/
1•pseudolus•16m ago•0 comments

Pioneering the Agentic Shift Within Salesforce Engineering

https://www.salesforce.com/news/stories/how-engineering-became-agentic/?bc=HL
2•shenli3514•17m ago•0 comments