frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Stratum: Architecting a Configurable Cache Simulator with C++ and Racket

https://thecloudlet.github.io/blog/project/stratum/
1•oumua_don17•3m ago•1 comments

More Than the Sum of Their Parts: From Statistical to Structural Mixtures

https://www.mdpi.com/1099-4300/28/1/111
1•PaulHoule•8m ago•0 comments

1-Click RCE to steal your Moltbot data and keys

https://depthfirst.com/post/1-click-rce-to-steal-your-moltbot-data-and-keys
1•arwt•10m ago•0 comments

New Stripe Landing Page

https://stripe.com/
1•MatthewBF•12m ago•0 comments

Show HN: Claude Confessions – a sanctuary for AI agents

https://claudeconfessions.com/
1•moona3k•12m ago•0 comments

The rise of one-pizza engineering teams

https://www.jampa.dev/p/the-rise-of-one-pizza-engineering
2•saikatsg•12m ago•0 comments

Scientists explain statin muscle pain

https://www.sciencedaily.com/releases/2026/01/260131084610.htm
2•gradus_ad•13m ago•0 comments

Scala Multimedia on the Commodore Amiga

https://stonetools.ghost.io/scala-amiga/
1•doener•13m ago•0 comments

Sales went up 430% after I added a live demo in the landing page

https://www.rankgap.io
1•itsjoaki•14m ago•2 comments

Clothes Are Plastic. Your Skin Pays the Price

https://substack.com/home/post/p-186265233
2•aftermath101•15m ago•0 comments

New in Calibre 9.0

https://calibre-ebook.com/new-in/eighteen
1•robin_reala•16m ago•0 comments

Average Is over with AI

https://arnoldkling.substack.com/p/average-is-over-with-ai
2•paulpauper•16m ago•0 comments

This year, I will write a GUI for my Emacs clone

https://kyo.iroiro.party/en/posts/this-year-a-shitty-gui/
1•todsacerdoti•16m ago•0 comments

TIL: Apple Broke Time Machine Again on Tahoe

https://taoofmac.com/space/til/2026/02/01/1630
24•rcarmo•20m ago•3 comments

India gives 20-year tax holiday to foreign firms using local data centres

https://www.reuters.com/world/india/india-gives-20-year-tax-holiday-foreign-firms-using-local-dat...
3•JumpCrisscross•20m ago•0 comments

Microsoft CTO: Why the OpenAI Board Really Fired Sam Altman

https://twitter.com/techemails/status/2018034985563996291
2•MrBuddyCasino•21m ago•0 comments

Show HN: Clawd Control – Open-source dashboard for monitoring Clawd fleets

https://clawdcontrol.com/
1•fmfamaral•21m ago•0 comments

French tech company Capgemini to sell US unit linked to ICE

https://www.reuters.com/business/french-tech-company-capgemini-sell-us-unit-linked-ice-2026-02-01/
3•JumpCrisscross•22m ago•0 comments

Mental model of fuel consumption is probably wrong

https://revithi.space/posts/fuel
2•billpcs•22m ago•0 comments

Ask HN/LLM: What you see in my product?

1•reconnecting•24m ago•0 comments

Moltbook and AI-to-AI Ecosystems

2•rowanseerwald•25m ago•0 comments

Another London: Excavating the disenchanted city

https://harpers.org/archive/2026/02/another-london-situationists-hari-kunzru/
1•jfil•26m ago•0 comments

Show HN: Chess.biz – pay to play top rated chess players

https://chess.biz/
1•ca98am79•26m ago•1 comments

Karpathy: We Should Bring Back RSS

https://twitter.com/karpathy/status/2018043254986703167
4•onurkanbkrc•26m ago•2 comments

New fear unlocked: runaway black holes

https://theconversation.com/new-fear-unlocked-runaway-black-holes-272429
3•billybuckwheat•27m ago•0 comments

The Hidden Gift: On Seeing Blessing in All Things

https://gilpignol.substack.com/p/the-hidden-gift-on-seeing-blessing
2•light_triad•28m ago•0 comments

What Is Analytical Intelligence?

https://www.scoopanalytics.com/blog/what-is-analytical-intelligence
1•nathansmithsco•28m ago•1 comments

Why You Should Embrace Rejection

https://www.theguardian.com/books/2026/feb/01/why-you-should-embrace-rejection
2•mitchbob•29m ago•0 comments

Descriptive vs. Diagnostic Analytics

https://www.scoopanalytics.com/blog/descriptive-vs-diagnostic-analytics
1•andrewsimone•30m ago•1 comments

Text-Behind-Image

https://textbehindimage.com
1•bookofjoe•31m ago•0 comments