frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Blog widget that switches 14 languages instantly, scroll preserved

https://blog.devforkhire.com/tech-ep1-en.html
1•hashedit•1m ago•0 comments

Ask HN: Is the job market actually bad or just different?

1•sovenyr•3m ago•0 comments

Gibraltar dumping all of its raw sewage into Mediterranean

https://www.theguardian.com/world/2026/may/06/uk-territory-gibraltar-dumps-raw-sewage-mediterranean
1•akyuu•3m ago•0 comments

Show HN: Is he OK? Senior safety monitoring app

https://howareu.app/
1•sminchev•4m ago•0 comments

The first lie about entrepreneurship

1•danish00111•7m ago•0 comments

InMusic will acquire Native Instruments, as NI joins brands from Akai to Moog

https://cdm.link/inmusic-will-acquire-native-instruments/
1•mrzool•7m ago•0 comments

In coal country, black lung surges as federal protections stall

https://e360.yale.edu/features/black-lung-pennsylvania
1•speckx•7m ago•0 comments

"ClaudeBleed" allows any Chrome extension to control Anthropic's AI assistant

https://cyberinsider.com/claudebleed-allows-any-chrome-extension-to-control-anthropics-ai-assistant/
1•flyaway123•10m ago•0 comments

Write programs you can still hack when you feel dumb

https://www.draketo.de/software/programs-you-can-still-hack-when-dumb.html
1•xhevahir•12m ago•0 comments

Open Source AI App Store Screenshot Designer

https://ai-app-store-screenshots.vercel.app/
1•jonnyjackson26•13m ago•0 comments

Gemma Chat: Offline Vibe Coding on Apple Silicon

https://github.com/ammaarreshi/gemma-chat
1•steveharing1•14m ago•0 comments

New Orleans needs to prepare to relocate residents

https://www.npr.org/2026/05/06/nx-s1-5810941/new-orleans-ocean-study
1•measurablefunc•16m ago•0 comments

Movies Are Too Long

https://www.slowboring.com/p/why-movies-are-getting-longer
2•paulpauper•25m ago•0 comments

Smoking, chromosomal aberrations, and cancer incidence in healthy subjects

https://www.sciencedirect.com/science/article/pii/S1383571821000644
1•paulpauper•26m ago•0 comments

Show HN: ChonkLM – Tiny language models running offline in the browser

https://chonklm.com
3•bilalba•26m ago•0 comments

Two kinds of work, and where AI belongs

https://twitter.com/talhof8/status/2051337721151455509
1•talhof8•26m ago•0 comments

Three Model Organisms for Taste

https://www.astralcodexten.com/p/three-model-organisms-for-taste
1•paulpauper•26m ago•0 comments

Off-Grid Boat Communications with Meshtastic

https://blog.noforeignland.com/off-grid-boat-communications-with-meshtastic/
1•tmalsburg2•26m ago•0 comments

I've Banned Query Strings

https://chrismorgan.info/no-query-strings
2•Brajeshwar•28m ago•0 comments

The XM30 program: The Army's Bradley replacement

https://taskandpurpose.com/tech-tactics/army-xm30-bradley-replacement/
1•lorenzohess•29m ago•0 comments

APRS Messaging 36 Miles with Two HTs

https://midnightcheese.com/2026/05/aprs-message-36-miles-two-ht-radios/
1•thcipriani•31m ago•0 comments

Simpson Chalkboard Generator: A Tool for Creating Bart Chalkboard Stills

https://enufstyle.com/generators/bart/
3•stefankuehnel•40m ago•0 comments

Extraterrestrial intelligent beings do not exist

https://articles.adsabs.harvard.edu/pdf/1980QJRAS..21..267T
2•Topfi•40m ago•1 comments

Show HN: Mlx-code – I built a "backyard shed" AI coding agent for Mac

https://github.com/JosefAlbers/mlx-code
1•JosefAlbers•41m ago•0 comments

Claude Code Sandboxing

https://code.claude.com/docs/en/sandboxing
3•Destiner•41m ago•0 comments

Merle Tuve and the development of the proximity fuze

https://www.youtube.com/watch?v=yjQtzk_4czg
1•bane•41m ago•1 comments

Drivers of success – the gap between actual drivers and what we read about

https://alearningaday.blog/2018/12/26/drivers-of-success-the-gap-between-actual-drivers-and-what-...
2•Olshansky•43m ago•0 comments

New financed PostmarketOS project: q6voice(d)

https://postmarketos.org/blog/2026/05/08/q6voice-project/
1•wicket•46m ago•0 comments

Musk, Altman Management Styles Under Fire at OpenAI Trial

https://www.bloomberg.com/news/articles/2026-05-08/musk-altman-management-styles-come-under-fire-...
1•1vuio0pswjnm7•49m ago•1 comments

Musk has never built a wafer fab, but he wants to burn $119B on one anyway

https://www.theregister.com/systems/2026/05/06/spacex-plots-119b-wafer-fab-to-make-elons-orbital-...
4•Bender•51m ago•1 comments