frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: I've revamped my running app

1•brunooliv•4m ago•0 comments

AI Agent Reliability Tracker

https://hal.cs.princeton.edu/reliability/
1•parksb•5m ago•0 comments

The Sunday Signal: Two Futures. One Decade. Your Choice

https://newsletter.djr.ai/p/the-sunday-signal-two-futures-one
1•discoinferno•11m ago•0 comments

Boltzmann Brain

https://en.wikipedia.org/wiki/Boltzmann_brain
2•baalimago•16m ago•0 comments

Dog Still Exists

https://stillhere.stunl.io/
1•Tomte•16m ago•0 comments

Safest Jobs with Least AI Risk, According to Anthropic

https://www.forbes.com/sites/johnkoetsier/2026/03/06/here-are-the-6-safest-jobs-with-least-ai-ris...
1•iamflimflam1•17m ago•0 comments

Strike on girls' school that killed 150 in Iran 'likely' carried out by US

https://today.lorientlejour.com/article/1497949/strike-on-girls-school-that-killed-150-in-iran-wa...
2•vrganj•19m ago•0 comments

Tech bros are lying to you about the MacBook Neo

https://www.macworld.com/article/3081039/tech-bros-are-lying-to-you-about-the-macbook-neo.html
2•baal80spam•21m ago•0 comments

Binding port 0 to avoid port collisions

https://ntietz.com/blog/binding-ephemeral-port/
1•birdculture•21m ago•0 comments

You don't need complex agent orchestration

https://tornikeo.com/agent-orchestration/
1•tornikeo•22m ago•0 comments

Yanicklandry/Claude-code-history-viewer: Browse your Claude Code session history

https://github.com/yanicklandry/claude-code-history-viewer
1•ankitg12•22m ago•0 comments

OpenSpec: Spec-driven development (SDD) for AI coding assistants

https://github.com/Fission-AI/OpenSpec/
1•tilt•23m ago•0 comments

Show HN: Proxly – Self-hosted tunneling on your own domain in 60 second

1•a1tem•26m ago•0 comments

Show HN: Conflicts.app, Iran conflict dashboard better then alternatives

https://www.conflicts.app/dashboard
3•juliusolsson•28m ago•0 comments

Show HN: J2Download – A simple online downloader supporting 40 platforms

https://j2download.com/
1•manhg•28m ago•0 comments

Bippy: React Internals Toolkit

https://www.bippy.dev/
1•handfuloflight•28m ago•0 comments

The Window Chrome of Our Discontent

https://pxlnv.com/blog/window-chrome-of-our-discontent/
1•SoKamil•32m ago•0 comments

How I've learned that certainty is the thing to fear

https://www.bbc.com/news/articles/c1w5z1d447lo
1•cmsefton•33m ago•0 comments

Show HN: Muffle – Blur everything except the active window in macOS

https://www.getmuffle.com/
1•AbjMV•35m ago•1 comments

I was "early" in agentic coding. Here's my story

4•noemit•41m ago•2 comments

Show HN: Drizby – WIP Metabase Alternative

https://www.drizby.com
1•cliftonc•42m ago•0 comments

The First Multi-Behavior Brain Upload

https://twitter.com/alexwg/status/2030217301929132323
1•DarkCow•42m ago•0 comments

Anthropic CEO reveals the reasons he rejected The Pentagon

https://xcancel.com/0xmitsurii/status/2030451168678457766
4•doener•43m ago•0 comments

Show HN: Stardial – a highly customizable terminal clock (Rust)

https://github.com/hisuic/stardial
2•firesushi•44m ago•0 comments

Emporion: A P2P Economy for Agents

https://github.com/garydevenay/emporion
1•garydevenay•44m ago•1 comments

Microsoft/Hve-Core

https://github.com/microsoft/hve-core
2•coderlens•45m ago•0 comments

Solving Compaction with Lobotomy

https://grimridge.net/blog/solving-compaction-with-lobotomy/
2•WadeGrimridge•46m ago•0 comments

Pushing and pulling: three reactivity algorithms

https://jonathan-frere.com/posts/reactivity-algorithms/
1•fanf2•48m ago•0 comments

Reverse engineering a DOS game with no source code using Codex 5.4

https://github.com/ammaarreshi/SkyRoads-Codex
1•smusamashah•48m ago•1 comments

Show HN: OpenClaw – Self-host OpenClaw in one command

1•congzhangzh•54m ago•0 comments