frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Klarna to launch dollar-backed stablecoin as race in digital payments heats up

https://www.reuters.com/business/finance/klarna-launch-dollar-backed-stablecoin-race-digital-paym...
1•krrishd•18s ago•0 comments

Optimzing Our Jax LLM RL Pipeline

https://notes.kvfrans.com/7-misc/rl-infra.html
1•lnyan•1m ago•0 comments

The Nostromo Project (2011) [video]

https://www.youtube.com/watch?v=9NoCsZvYeEQ
1•exvi•2m ago•0 comments

80.1 % on LoCoMo Long-Term Memory Benchmark with a pure open-source RAG pipeline

1•ViktorKuz•6m ago•0 comments

Nostromo: A Legend Born and Born Again - Part 1 (2011)

https://web.archive.org/web/20110519073318/http://www.therpf.com/f45/prop-store-first-look-nostro...
1•exvi•7m ago•0 comments

<5KB demoscene intro by Claude

https://demo-blue-fog-5621.fly.dev/
1•MattPearce•10m ago•1 comments

A Math Horror Show at UC San Diego

https://www.wsj.com/opinion/a-math-horror-show-at-cal-at-san-diego-c91f2035
1•delichon•11m ago•0 comments

US triples national park fee for non-residents, amid 'new' fee for Americans

https://www.theguardian.com/us-news/2025/nov/25/national-park-fee-non-residents
1•c420•12m ago•1 comments

Client Registration and Enterprise Management in the Nov 2025 MCP Auth Spec

https://aaronparecki.com/2025/11/25/1/mcp-authorization-spec-update
1•gz5•12m ago•0 comments

Markets are getting more concerned about Oracle's AI data center debt

https://sherwood.news/markets/markets-are-getting-more-concerned-about-oracles-ai-data-center-debt/
1•zerosizedweasle•13m ago•0 comments

Plug it in and make it magic

https://doingsoftwarewrong.com/blog/plug-in-ai-magic/
1•ChunkyAu•16m ago•0 comments

Java Quirks: Bridge and Synthetic Methods for Reflection

https://www.ptidej.net/blog/bridge-methods-java/
2•yann-gael•18m ago•1 comments

CS QLola

https://news.ycombinator.com
2•bappaforjio•21m ago•0 comments

Lifetime Safety in Clang – 2025 US LLVM Developers' Meeting [video]

https://www.youtube.com/watch?v=3zWK7Lx96vI
1•matt_d•24m ago•0 comments

Joe Armstrong – The mess we are in

https://youtu.be/lKXe3HUG2l4?si=YEbsd9xOCH_yP_C2
1•lifeisstillgood•28m ago•0 comments

Ask HN: Hard and deep tech – why are Jira and Confluence the go-to PM tools?

1•dnlh_lvg•28m ago•2 comments

Dr. Chainlove Or: How I Learned to Stop Worrying and Love On-Chain Gaming

https://organizedplayer.substack.com/p/dr-chainlove-or-how-i-learned-to
1•0north•29m ago•0 comments

Prosecutor Used Flawed A.I. To Keep a Man in Jail, His Lawyers Say

https://www.nytimes.com/2025/11/25/us/prosecutor-artificial-intelligence-errors-lawyers-californi...
3•perihelions•31m ago•0 comments

BebboSSH: SSH2 implementation for Amiga systems (68000, GPLv3)

https://franke.ms/git/bebbo/bebbossh
1•snvzz•31m ago•0 comments

Genesis Mission – A National Mission to Accelerate Science Through AI

https://genesis.energy.gov/
1•Anon84•35m ago•0 comments

Design Follows Data Structures

https://www.tedinski.com/2019/01/29/data-structures-are-fundamental.html
3•plutonium3345•36m ago•0 comments

Maybe some people should just give up [video]

https://www.youtube.com/watch?v=rsoEipuwXiI
1•koakuma-chan•39m ago•0 comments

I tracked 609 food additives across 817K products to find awareness gaps

https://compareadditives.com
4•markvitals•39m ago•2 comments

GrapheneOS ceases operations in France amid pressure and legal threats

https://alternativeto.net/news/2025/11/grapheneos-ceases-operations-in-france-amid-pressure-and-l...
3•airhangerf15•42m ago•0 comments

Are LLMs the Best That They Will Ever Be?

https://asimovaddendum.substack.com/p/are-llms-the-best-that-they-will
3•rufusrock•43m ago•2 comments

Scientists can now watch metal crystals grow inside liquid metal

https://theconversation.com/scientists-can-now-watch-metal-crystals-grow-inside-liquid-metal-270451
3•billybuckwheat•49m ago•0 comments

Automating Linux Backups with Rsync: A Set-and-Forget Strategy

https://orioninsist.org/blog/linux-automated-backup-rsync-guide/
1•orioninsist•53m ago•0 comments

Show HN: Free macro dashboards with downloadable charts (e.g., EUR/USD)

https://fxmacrodata.com/dashboard/EUR_USD
1•roberttidball•55m ago•1 comments

Show HN: Real-time system that tracks how news spreads across 200k websites

https://yandori.io/news-flow/
1•antiochIst•56m ago•1 comments

Credits Are Not It

https://hengar.pika.page/posts/credits-are-not-it
2•hengar•59m ago•0 comments