frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Behind Trump vs. Powell Is a Battle over US Empire's Future

https://jacobin.com/2026/01/trump-powell-fed-europe-dollars
1•kaycebasques•3m ago•0 comments

It Can Apply and Positive in Favor the Newton III Law on an Engine System Device

1•monterrey•5m ago•0 comments

State Ofthe Art Novel InFlow 1Gearturbine/Reaction 2Imploturbocompressor/Impulse

1•monterrey•8m ago•0 comments

San Francisco to offer free childcare to people making up to $230000

https://www.theguardian.com/us-news/2026/jan/15/san-francisco-childcare-families
1•darth_avocado•9m ago•0 comments

Podcasting Could Use a Good Asteroid

https://www.joanwestenberg.com/podcasting-could-use-a-good-asteroid/
1•zdw•11m ago•0 comments

Ask HN: What are Claude's skills/what skills does Claude possess?

1•Obscurity4340•12m ago•0 comments

Glyphhanger – Your web font utility belt

https://www.zachleat.com/web/glyphhanger/
1•doodlesdev•14m ago•0 comments

The Myth of the ThinkPad

https://innovintageblog.wordpress.com/2026/01/08/the-myth-of-the-thinkpad/
1•volemo•16m ago•2 comments

Jeff Bezos Needs to Speak Up

https://www.theatlantic.com/ideas/2026/01/raid-washington-post/685621/
1•JumpCrisscross•17m ago•1 comments

Ericsson Silent Layoffs in the US

2•allabouttech•21m ago•1 comments

Trump Moves to Make Tech Giants Pay for Surging Power Costs

https://www.bloomberg.com/news/articles/2026-01-15/trump-to-direct-key-us-grid-operator-to-hold-e...
2•jmcdonald-ut•21m ago•1 comments

America's Throwaway Spies: How the CIA Failed Iranian Informants in Tehran

https://www.reuters.com/investigates/special-report/usa-spies-iran/
2•koolhead17•22m ago•0 comments

Mark Carney and Xi Jinping meet to mend ties as Donald Trump disrupts globe

https://www.ft.com/content/9eeff245-2081-4f97-bc8e-6bbdaf59074e
3•KnuthIsGod•25m ago•0 comments

Fontello – Combine icon webfonts for your own project

https://github.com/fontello/fontello
1•doodlesdev•25m ago•0 comments

Is there any way we can help Stack Overflow Website get back up?

https://stackoverflow.com/questions/79867766/is-there-any-way-we-can-help-stack-overflow-website-...
1•nomilk•25m ago•0 comments

AI as a Compression Problem

https://dkg.fifthhorseman.net/blog/2025-ai-and-compression.html
1•pabs3•26m ago•0 comments

PanoptiCity – interactive map reveals the scale of mass surveillance worldwide

https://panopticity.fr/
2•pabs3•28m ago•0 comments

How Safe Is the Rust Ecosystem? A Deep Dive into Crates.io

https://mr-leshiy-blog.web.app/blog/crates_io_analysis/
1•RustSupremacist•32m ago•0 comments

Trump accepts Nobel Peace medal from Venezuelan opposition leader

https://www.smh.com.au/world/north-america/venezuelan-opposition-leader-says-she-presented-trump-...
2•KnuthIsGod•33m ago•2 comments

Gen X and Millennials Will Inherit Trillions in Real Estate over the Next Decade

https://www.wsj.com/real-estate/luxury-homes/millennial-genx-inherit-real-estate-wealth-d78b4454
2•alephnerd•38m ago•1 comments

From AI agent prototype to product: Lessons from building AWS DevOps Agent

https://aws.amazon.com/blogs/devops/from-ai-agent-prototype-to-product-lessons-from-building-aws-...
1•malahay•42m ago•1 comments

TranslateGemma: A new suite of open translation models

https://blog.google/innovation-and-ai/technology/developers-tools/translategemma/
2•anigbrowl•42m ago•0 comments

Show HN: Buildzr: Python DSL for Authoring C4 Models

https://github.com/amirulmenjeni/buildzr
1•amenji•43m ago•0 comments

Apple's Tactics Could Prevent Japan from Improving Browser Competition

https://open-web-advocacy.org/blog/how_apples_key_tactic_could_prevent_japans_smartphone_act_from...
1•donohoe•46m ago•0 comments

Boeing knew of flaw in part linked to UPS plane crash

https://www.bbc.com/news/articles/cly56w0p9e1o
31•1659447091•49m ago•6 comments

Microsoft Xbox Manufacturing in 2002

https://www.youtube.com/watch?v=YeQrQYFVlXA
1•guidedlight•51m ago•0 comments

Image FX – Free One-Click AI Photo Editor and Image Generator

https://image-fx.app
1•julian2026•52m ago•0 comments

European Alternatives for Digital Products

https://european-alternatives.eu
1•memset•54m ago•0 comments

Show HN: Dev Utility Hub – Client-side only developer tools (JSON, JWT, Cron)

1•hun-ing•57m ago•0 comments

vLLM-MLX – Run LLMs on Mac at 464 tok/s

https://github.com/waybarrios/vllm-mlx
2•waybarrios•1h ago•1 comments