frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Claws are now a new layer on top of LLM agents

https://twitter.com/karpathy/status/2024987174077432126
1•Cyphase•54s ago•0 comments

SleepFM model predicts 130 conditions (C-Index >0.75) from one night of sleep

https://www.nature.com/articles/s41591-025-04133-4
1•marojejian•2m ago•1 comments

Information Wants to Be Surveiled

https://netwars.pelicancrossing.net/2026/02/20/information-wants-to-be-surveiled/
1•ColinWright•5m ago•0 comments

The Interest Rate on Your Codebase: A Financial Framework for Technical Debt

https://www.chiply.dev/post-technical-debt
1•gashad•5m ago•0 comments

Claude Code's compaction discards data that's still on disk

https://github.com/anthropics/claude-code/issues/26771
1•aciccarelli2•6m ago•1 comments

Show HN: Give your OpenClaw agent a face and voice with LiveKit and LemonSlice

https://github.com/openserv-labs/openclaw-voice-avatar
1•arbayi•7m ago•0 comments

Show HN: WatchTurm – an open-source release visibility layer I use in my work

1•WatchTurm•8m ago•0 comments

Show HN: Cellarium: A Playground for Cellular Automata

https://github.com/andrewosh/cellarium
1•andrewosh•10m ago•0 comments

Code Mode: give agents an API in 1k tokens

https://blog.cloudflare.com/code-mode-mcp/
1•janpio•10m ago•0 comments

Show HN: MephistoMail – A RAM-only, tracker-free disposable email client

1•benmxrt•11m ago•0 comments

The Two Things

https://www.csun.edu/~dgw61315/thetwothings.html#The%20Story%20of%20the%20Two%20Things
2•haunter•11m ago•0 comments

Show HN: MephistoMail – A RAM-only, tracker-free disposable email client

https://mephistomail.site?1
1•benmxrt•12m ago•0 comments

Gamedate – A site to revive dead multiplayer games

https://gamedate.org/
1•msuniverse2026•12m ago•0 comments

How Real-Time Voice Agents Work: Media Infrastructure and Latency

1•gokuljs•16m ago•0 comments

Show HN: Agent Passport – OAuth-like identity verification for AI agents

1•samerismail•17m ago•0 comments

Show HN: HN-Stories – Rust CLI to browse and open HN stories from the terminal

https://crates.io/crates/hn-stories
1•Brysonbw•18m ago•0 comments

eBay buys Depop for $1.2B in effort to lure younger shoppers

https://www.theguardian.com/technology/2026/feb/19/ebay-buys-depop-from-etsy
1•iamben•23m ago•0 comments

I Let Claude Read My Email

https://ericbrookfield.com/2026/02/20/i-let-claude-read-my-email/
2•surprisetalk•25m ago•0 comments

The Unbearable Weight of Cruft

https://www.joanwestenberg.com/the-unbearable-weight-of-cruft/
1•zdw•25m ago•0 comments

Cybernetic practices for design research pedagogy (2023)

https://onlinelibrary.wiley.com/doi/10.1002/sres.2974
1•andsoitis•26m ago•0 comments

Show HN: Routype – typed REST client in ~200 lines, no codegen

https://github.com/jbingen/routype
1•jbingen•27m ago•0 comments

Agent Compromised by Agent to Deploy an Agent

https://www.mbgsec.com/posts/2026-02-19-agent-repo-compromised-by-agent-to-install-an-agent/
2•chha•31m ago•0 comments

DHS Admits Its Website the 'Worst of the Worst' Immigrants Was Rife with Errors

https://www.cnn.com/2026/02/19/politics/homeland-security-worst-immigrants-website
3•TigerUniversity•32m ago•0 comments

The Stanford Emerging Technology Review 2026 [pdf]

https://setr.stanford.edu/sites/default/files/2026-01/SETR2026_web-260109.pdf
3•cantaloupe•33m ago•0 comments

How to Die Optimally – A Theory of Consumption When AI Takes Your Job

https://ngrislain.github.io/static/projects/ai-economics/ai-economics.html
2•ngrislain•36m ago•0 comments

ATAboy is a USB adapter for legacy CHS only style IDE (PATA) drives

https://github.com/redruM0381/ATAboy
5•zdw•38m ago•0 comments

Your tech or my tech: make up your mind quickly (2024)

https://berthub.eu/articles/posts/your-tech-my-tech/
3•pabs3•40m ago•0 comments

Show HN: Murl – Curl for MCP Servers

https://github.com/turlockmike/murl
4•turlockmike•41m ago•0 comments

Fork, Explore, Commit: OS Primitives for Agentic Exploration

https://arxiv.org/abs/2602.08199
3•wang_cong•42m ago•0 comments

Show HN: Are – Rule engine for JavaScript, C#, and Dart with playground

https://are-playground.netlify.app/
4•beratarpa•42m ago•0 comments