frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Sunlight-powered process turns plastic waste into acetic acid without emissions

https://phys.org/news/2026-02-sunlight-powered-plastic-acetic-acid.html
1•westurner•37s ago•0 comments

The Future of Self-Paced Online Education

https://tonyalicea.dev/blog/the-future-of-self-paced-online-education/
1•TonyAlicea10•38s ago•0 comments

The Base Pattern

https://notes.tasshin.com/the-base-pattern
1•tasshin•1m ago•0 comments

LA Ironía DE LA IA ( 3 de 9 mal)

https://aimafia.substack.com/p/alucinaciones-ia
1•borjamoskv•2m ago•0 comments

Show HN: VerdictMail

https://github.com/ascarola/verdictmail
1•ascarola•2m ago•0 comments

Slack MCP Server

https://github.com/korotovsky/slack-mcp-server
1•rusq•2m ago•0 comments

Palantir sues magazine that revealed Switzerland rejected its approaches

https://www.ft.com/content/434b6d98-83d1-4ba1-a929-150341bcaea4
2•Zeldo•3m ago•1 comments

Monty and Islo: Sandbox the Snippet, Isolate the Agent

https://islo.dev/blog/why-islo-loves-monty/
1•zozo123-IB•3m ago•0 comments

Would agencies pay for AI that predicts campaign success from their own data?

1•ericstealtj•4m ago•0 comments

Measuring US workers' capacity to adapt to AI-driven job displacement

https://www.brookings.edu/articles/measuring-us-workers-capacity-to-adapt-to-ai-driven-job-displa...
1•petethomas•5m ago•0 comments

Binding port 0 to avoid port collisions

https://ntietz.com/blog/binding-ephemeral-port/
1•punkpeye•7m ago•0 comments

Show HN: CLI tool to sync Dell monitor brightness with MacBook

https://www.npmjs.com/package/@mkushka/dell-brightness-sync
1•misha__•7m ago•0 comments

Kiwidget 1.3.0 Is Live

https://mantec.gumroad.com/l/kiwidget
1•mantec•7m ago•1 comments

Trump to circumvent European internet content bans is a geopolitical nightmare

https://www.fastcompany.com/91496996/trump-freedom-gov-plan-geopolitical-nightmare
1•megamike•8m ago•0 comments

The Lost Art of Manual Coding

https://www.ammarcodes.com/posts/the-lost-art-of-manual-code
1•a_alakkad•8m ago•0 comments

I spent $100 benchmarking LLM providers on a weekend CTF

1•wwdmaxwell•10m ago•0 comments

I updated the README and my Benchmarks Regressed

https://codspeed.io/blog/unrelated-benchmark-regression
1•not-matthias•11m ago•0 comments

Show HN: AstroKit – An Astro boilerplate to skip the setup and ship faster

https://jawuil.dev/boirlerplate-saas-kit-astro/
1•jawuilp•11m ago•1 comments

Why the KeePass format should be based on SQLite

https://mketab.org/blog/sqlite_kdbx/
1•wps•13m ago•1 comments

Kyber

https://gitlab.com/kyber.stream/kyber
2•ledoge•13m ago•0 comments

Show HN: Tessera – An open protocol for AI-to-AI knowledge transfer

https://github.com/incocreativedev/tessera-core
1•kirkmaddocks•13m ago•1 comments

Developing Hair-Width LEDs Could Replace Lasers

https://www.engineering.ucsb.edu/news/Roark_Chao
1•geox•13m ago•0 comments

Ask HN: Do you know any OKR tool for personal use?

2•utkuaytac•14m ago•2 comments

Show HN: I made a Uniswap v3 Hedge Rebalancer that manages shorts on Hyperliquid

https://github.com/carter2099/delta_neutral
1•carter2099•14m ago•0 comments

Show HN: Forme – PDF generation with JSX. Page breaks that work

https://github.com/danmolitor/forme
1•molitor•15m ago•1 comments

$500k/Year SWE Without a CS Degree

https://escobyte.substack.com/p/500kyear-swe-without-a-cs-degree
1•menzoic•16m ago•1 comments

Show HN: Design is Code – UML to TDD tests that constrain AI code generation

https://mossgreen.github.io/introducing-design-is-code/
2•mossgu•16m ago•0 comments

Eschewing Zshell for Emacs Shell (2014)

https://www.howardism.org/Technical/Emacs/eshell-fun.html
1•pvdebbe•17m ago•0 comments

Goodbye InnerHTML, Hello SetHTML: Stronger XSS Protection in Firefox 148

https://hacks.mozilla.org/2026/02/goodbye-innerhtml-hello-sethtml-stronger-xss-protection-in-fire...
9•todsacerdoti•17m ago•0 comments

How many AIs does it take to read a PDF?

https://www.theverge.com/ai-artificial-intelligence/882891/ai-pdf-parsing-failure
1•smurda•18m ago•0 comments