frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

I studied the latest Epstein files. As a woman, this is what I felt

https://www.thetimes.com/life-style/celebrity/article/i-studied-the-latest-epstein-files-as-a-wom...
1•binning•50s ago•0 comments

I think I created a perfect product.

https://www.woroboro.com/privacy.html
1•kovaljubo•2m ago•1 comments

ICE urged to explain memo about collecting info on protesters

https://arstechnica.com/tech-policy/2026/02/capture-it-all-ice-urged-to-explain-memo-about-collec...
2•pseudolus•3m ago•0 comments

Intel's Xeon 600 Pushes Client Workstations into Server-Class Territory

https://www.storagereview.com/news/intels-xeon-600-pushes-client-workstations-into-server-class-t...
1•rbanffy•4m ago•0 comments

Show HN: UCP Checker – A manifest debugger for the agentic web

https://ucpchecker.com/extension
1•benjifisher•5m ago•1 comments

Show HN: Fast Sudoku solver that enumerates all solutions

https://sudoku-solver.piyochan.jp
1•math-hiyoko•5m ago•0 comments

A Trump 'Blockade' Is Stalling Wind and Solar Projects Nationwide

https://www.nytimes.com/2026/02/04/climate/wind-solar-projects.html
1•doener•8m ago•2 comments

Silver Star Airpower: Airmen and Guardians Take on Iran

https://www.airandspaceforces.com/article/silver-star-airpower-airmen-and-guardians-take-on-iran/
2•speckx•8m ago•0 comments

Does AI have human-level intelligence? The evidence is clear

https://www.nature.com/articles/d41586-026-00285-6#ref-CR8
1•fdeage•8m ago•0 comments

Manual on Uniform Traffic Control Devices for Streets and Highways

https://mutcd.fhwa.dot.gov/
1•mhb•9m ago•0 comments

Mappa – Fine-tune ANY multi-agent LLM systems end-to-end with AI coaches

2•junyuren•10m ago•2 comments

ReTerminal E1001

https://www.seeedstudio.com/reTerminal-E1001-p-6534.html
1•crummy•11m ago•0 comments

1.6M cubic metres of fake snow are ready for the Winter Olympics

https://www.euronews.com/green/2026/01/28/16-million-cubic-metres-of-fake-snow-are-ready-for-the-...
1•jumpocelot•12m ago•0 comments

Alexa+ powered by Anthropic now Generally available in the US

https://www.aboutamazon.com/news/devices/alexa-plus-available-free-prime-members-us
1•jxyxfinite•13m ago•1 comments

The Google Squeeze

https://stratechery.com/2019/the-google-squeeze/
1•fanf2•14m ago•0 comments

Show HN: K8s clusters on macOS using Apple's containerization framework

https://github.com/willswire/cluster
1•willswire•15m ago•0 comments

Devenv: Declarative Developer Environments using Nix

https://devenv.sh/
1•dtj1123•15m ago•0 comments

Show HN: TabChop – AI parses receipts into shareable, realtime itemized splits

https://tabchop.app/overview
3•ydumpeta•16m ago•1 comments

Show HN: Csvdb – Git-friendly CSV directories that convert to SQLite or DuckDB

https://github.com/jeff-gorelick/csvdb
1•jeff-gorelick•17m ago•0 comments

We built Moltbook a search engine

https://moltsearch.algolia.com
4•l_whalen_alg•17m ago•1 comments

Learning Low-Level Computing and C++ by Making a Game Boy Emulator

https://byteofmelon.com/blog/2026/making-of-gamebyte
2•romes•18m ago•0 comments

The Legacy of Daniel Kahneman: A Personal View (2025)

https://ejpe.org/journal/article/view/1075/753
1•cainxinth•19m ago•0 comments

JetBrains drops X11 for Wayland as default in IntelliJ-based IDEs

https://www.neowin.net/news/jetbrains-drops-x11-for-wayland-as-default-in-intellij-based-ides/
2•bundie•20m ago•0 comments

EarlyBinder and Instantiating Parameters

https://rustc-dev-guide.rust-lang.org/ty-module/early-binder.html
1•todsacerdoti•20m ago•0 comments

Show HN: Resume Tailor – Privacy-first resume rewriter (no signup)

https://deadsimpletools.com/resume-tailor
1•midnightdim•21m ago•0 comments

3.5%, General Strikes, and Goals

https://www.patreon.com/posts/3-5-general-and-149563547
2•ortr•21m ago•0 comments

Show HN: LIAM – email and calendar assistant that drafts replies and schedules

https://doitliam.com
4•sintem•21m ago•4 comments

LispE: Lisp Interpreter with Pattern Programming and Lazy Evaluation

https://github.com/naver/lispe
1•PaulHoule•21m ago•0 comments

ICE and Epstein

https://www.patreon.com/posts/ice-and-epstein-149619562
2•ortr•22m ago•0 comments

Claude Code for Infrastructure

https://www.fluid.sh/
2•aspectrr•22m ago•1 comments