frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Ask HN: What Is the "Lore" of HN?

1•Cider9986•1m ago•0 comments

Show HN: CoPilot for Project Management

https://quickapproveai.com/
1•xvok•1m ago•0 comments

Compu-Global-Hyper-Mega-Net: A Retro Internet for Retro Computers (LFNW 2026) [video]

https://www.youtube.com/watch?v=cSJsGNIDjtc
1•CursedSilicon•2m ago•0 comments

A man who decides when and where your next flight will be going

https://www.cnn.com/travel/airline-planning-officer-aviation-decisions
1•charrington•3m ago•0 comments

Event Clash of Prompts: A Real-Time Prompt Battle Royale

https://builder.aws.com
1•symbiotic_sec•3m ago•1 comments

Another supply-chain attack: elementary-data Python package compromised

https://arstechnica.com/security/2026/04/open-source-package-with-1-million-monthly-downloads-sto...
1•mil22•4m ago•1 comments

Live coverage: ULA to launch 29 Amazon Leo satellites on Atlas 5 LIVE in ~2hrs

https://spaceflightnow.com/2026/04/27/live-coverage-ula-to-launch-29-amazon-leo-satellites-on-atl...
1•bookmtn•6m ago•0 comments

Immigrants' Recent Effects on Government Budgets: 1994–2023

https://www.cato.org/white-paper/immigrants-recent-effects-government-budgets-1994-2023
3•Anon84•6m ago•0 comments

Talkie: a 13B vintage language model from 1930

https://talkie-lm.com/introducing-talkie
1•jekude•7m ago•0 comments

Ask HN: Will hardware ever be cheap again?

1•bjourne•8m ago•0 comments

Talkie: An LM from 1930

https://talkie-lm.com/chat
1•yusufozkan•8m ago•0 comments

ChatGPT Images 2.0 Still Can't Draw the Seven-Legged Spider I Want

https://will-keleher.com/posts/chatgpt-image-2-still-cant-draw-a-seven-legged-spider/
2•bsgada•13m ago•0 comments

AMD used AI to reimplement slurm in Rust

https://github.com/ROCm/spur
1•latchkey•13m ago•0 comments

Trump Administration Will Pay More Energy Firms to Cancel Wind Farms

https://www.nytimes.com/2026/04/27/climate/trump-administration-wind-farms.html
2•duxup•13m ago•1 comments

OlmoEarth: Custom embedding exports for downstream analysis

https://allenai.org/blog/olmoearth-embeddings
1•gmays•13m ago•0 comments

Using Rust to Build a $1 Handheld Gaming Console

https://chrisdell.info/using-rust-to-build-a-1-dollar-handheld-gaming-console/
2•kianryan•15m ago•0 comments

David Silver of DeepMind raises $1B to build AI that learns without human data

https://techcrunch.com/2026/04/27/deepminds-david-silver-just-raised-1-1b-to-build-an-ai-that-lea...
1•ryan_j_naughton•15m ago•0 comments

Gitglimpse – a CLI that turns your Git history into structured context

https://github.com/dino-zecevic/gitglimpse
1•dinoze•17m ago•1 comments

California's Billionaire Tax Has the Signatures to Make the Ballot

https://sfstandard.com/2026/04/26/california-billionaire-tax-2026/
3•m463•19m ago•0 comments

Switched from robot_localization to a single-node GPS fusion setup

https://github.com/manankharwar/fusioncore
1•kharwarm•23m ago•0 comments

Tell HN: Spam from Fridayaicore.in

2•HotGarbage•24m ago•0 comments

A History of Live Programming (2013)

https://liveprogramming.github.io/liveblog/2013/01/a-history-of-live-programming/
1•_doctor_love•26m ago•0 comments

AI strategy is all wrong

https://www.computerworld.com/article/4162557/your-ai-strategy-is-all-wrong.html
1•mikelgan•26m ago•1 comments

It's the Age of Electricity and America Isn't Ready

https://www.nytimes.com/interactive/2026/04/27/opinion/electricity-power-grid-infrastructure.html
4•rafaelc•27m ago•0 comments

Noctua releases official 3D CAD models for its cooling fans

https://www.noctua.at/en/3d-cad-models
1•embedding-shape•28m ago•1 comments

Tell HN: One Medical Is a Nightmare

4•rincebrain•29m ago•0 comments

My husband and son dived to see the wreck of the Titanic, and never came back

https://www.theguardian.com/world/2026/apr/25/my-husband-and-son-titan-submersible-christine-dawo...
2•makerdiety•31m ago•0 comments

CETaS Paper – From Cybercrime to Vibercrime?

https://cetas.turing.ac.uk/publications/cybercrime-vibercrime-assessing-generative-ai-adoption-cr...
1•susan_segfault•31m ago•0 comments

Smolvm

1•fukuzaki•32m ago•0 comments

Ex-GitHub CTO on Git internals

https://twitter.com/tnm/status/2046815943414935648
1•conormccarter•33m ago•0 comments