frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Nothing Matters

https://martinrue.com/nothing-matters/
1•afisxisto•9s ago•0 comments

What's new in JavaScript (and what's coming next)

https://neciudan.dev/whats-new-in-javascript
1•thunderbong•7m ago•0 comments

Flipbook – self hosted static viewers for media, documents and browser replays

https://flipbook.browserbox.io/
1•keepamovin•8m ago•0 comments

ElastAlert is dead, long live Clickdetect

https://clickdetect.souzo.me/blog/2026/04/19/elastalert-is-dead-long-live-clickdetect/
1•souzo•10m ago•0 comments

For $700 a Month, Sleeping Pods Make SF More Affordable

https://www.kqed.org/news/12080289/700-a-month-sleeping-pods-make-sf-more-affordable-but-at-what-...
1•harambae•10m ago•0 comments

Computerising Hyerogliphic Scripts [video]

https://www.youtube.com/watch?v=Vhx-hRyh6BM
1•downboots•10m ago•0 comments

Linkages to Trisect an Angle

http://www.takayaiwamoto.com/Greek_Math/Trisect/Linkage/Linkage_Tri.html
1•downboots•12m ago•0 comments

Pepperlot

https://pepperlot.com
1•alexrusulot•13m ago•0 comments

When oil prices spike, where does the money go?

https://theconversation.com/when-oil-prices-spike-where-does-the-money-go-280763
2•thelastgallon•14m ago•0 comments

Pressure, Temperature, and Phase Changes Within Supercritical CO2 Pipelines

https://www.mdpi.com/2227-9717/14/7/1039
2•PaulHoule•14m ago•0 comments

Windows 9x Subsystem for Linux

https://codeberg.org/hails/wsl9x
1•pabs3•15m ago•1 comments

Arch Linux Now Has a Bit-for-Bit Reproducible Docker Image

https://antiz.fr/blog/archlinux-now-has-a-reproducible-docker-image/
2•maxloh•16m ago•0 comments

A Generation Lost in the Bazaar – Quality happens when someone is responsible (2012)

https://queue.acm.org/detail.cfm?id=2349257
1•pabs3•18m ago•0 comments

Photographing Rocket Chute Deployment at 10 Km

https://hackaday.com/2026/04/22/photographing-rocket-chute-deployment-at-10-km/
1•y1n0•20m ago•0 comments

Test-foundry – QEMU-based Windows VM testing for kernel drivers and UEFI apps

https://github.com/jc-lab/test-foundry
2•joseph2024•20m ago•1 comments

Habitual coffee intake modifies host physiology and cognition

https://www.nature.com/articles/s41467-026-71264-8
1•gogobio•21m ago•1 comments

FlashDrive: Flash Vision-Language-Action Inference for Autonomous Driving

https://z-lab.ai/projects/flashdrive/
1•gmays•22m ago•0 comments

Microsoft looked at buying Cursor before SpaceX deal

https://www.cnbc.com/2026/04/22/microsoft-looked-at-buying-cursor-before-spacex-deal-sources-say....
1•mfiguiere•24m ago•0 comments

XAIDR – first runtime benchmark for agent-to-agent attack detection

https://github.com/anirudhraokotaru/xaidr-benchmark
2•delphisec•24m ago•0 comments

Let's Simulate the Org Charts Meme with Agents and See Who Wins

https://kunchenguid.substack.com/p/org-bench-lets-simulate-the-org-charts
1•bpierre•25m ago•0 comments

Fatty acid could restore failing vision

https://www.sciencedaily.com/releases/2026/04/260422091043.htm
2•y1n0•28m ago•0 comments

Job Is to Give a Shit

3•danfunk•30m ago•1 comments

Orthogravity [Desktop Webgame]

https://app-b5dj4l0ji2gx.appmedo.com/
1•mrKola•31m ago•0 comments

TeraFab facilities will use Intel's 14A process

https://www.tomshardware.com/tech-industry/semiconductors/elon-musk-says-terafab-will-use-intels-...
2•y1n0•32m ago•0 comments

Bruce Davidson – His landmark Subway series and his path to Magnum

https://www.youtube.com/watch?v=8KmDB4VHpzQ
1•fallinditch•32m ago•0 comments

ICE Got My Data – EFFector 38.8

https://www.eff.org/deeplinks/2026/04/how-ice-got-my-data-effector-388
4•omer_k•37m ago•1 comments

Vibe Genomics

https://vibe-genomics.replit.app/
1•jedixit•37m ago•0 comments

Database Turing Award Winner Mike Stonebraker [video]

https://www.youtube.com/watch?v=YPObBOwIrHk
3•guiambros•38m ago•0 comments

SportScore MCP – free live sports data for Claude

https://github.com/Backspace-me/sportscore-mcp
1•sportscore•48m ago•0 comments

Show HN: Stenobird, podcast transcription service for your agent

https://stenobird.com
1•somewhatjustin•49m ago•0 comments