frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Don't Waste Your Back Pressure

https://banay.me/dont-waste-your-backpressure/
1•emersonmacro•58s ago•0 comments

Deaths in Detention Warn of Horrors Behind ICE's Prison Walls

https://truthout.org/articles/deaths-in-detention-warn-of-horrors-behind-ices-prison-walls/
1•wahnfrieden•1m ago•0 comments

I built visual search for tattoo artists

1•rbaten•2m ago•0 comments

Prominent PR firm accused of commissioning favourable changes to Wikipedia pages

https://www.theguardian.com/technology/2026/jan/16/pr-firm-portland-accused-of-commissioning-favo...
1•1equalsequals1•2m ago•0 comments

Capital Moves: RCA's 70-year quest for cheap labor

https://www.cornellpress.cornell.edu/book/9780801435256/capital-moves/
1•hhs•3m ago•0 comments

No knives, only cook knives

https://kellykozakandjoshdonald.substack.com/p/no-knives-only-cook-knives
1•firloop•4m ago•0 comments

Nourish Food Club – Pasture-Raised, Corn and Soy-Free, Low PUFA

https://nourishfoodclub.com/
1•bilsbie•5m ago•0 comments

MIT Researchers Destroy the Context Window Limit [video]

https://www.youtube.com/watch?v=huszaaJPjU8
1•amichail•5m ago•0 comments

The Setapp Mobile iOS store is shutting down on February 16th

https://www.theverge.com/news/863978/setapp-mobile-ios-store-shutdown
1•raybb•6m ago•0 comments

Show HN: Dokly – Hosted documentation platform for devs

https://www.dokly.co/
1•gsharma1•8m ago•0 comments

Vulnerable WhisperPair Devices – Hijack Bluetooth Accessories Using Fast Pair

https://whisperpair.eu/vulnerable-devices
1•gnabgib•9m ago•0 comments

GitHub Has to Change

https://solmaz.io/log/2026/01/17/github-has-to-change/
1•hosolmaz•9m ago•0 comments

Sunglasses Weaken Your Eyes: Untold Story of Light Sensitivity and Dopamine

https://twitter.com/zaidkdahhaj/status/2012640228864070003
1•bilsbie•10m ago•0 comments

Compiling Scheme to WebAssembly

https://eli.thegreenplace.net/2026/compiling-scheme-to-webassembly/
1•chmaynard•12m ago•0 comments

Sequoia's Mission Accomplished

https://colinsteele.org/blog/sequioas_mission_accomplished/
2•cvillecsteele•14m ago•0 comments

How to be a good conference talk audience member (2022)

https://www.mooreds.com/wordpress/archives/3522
1•mooreds•30m ago•0 comments

Who Gets to Inherit the Stars?

https://techcrunch.com/2026/01/17/who-gets-to-inherit-the-stars-a-space-ethicist-on-what-were-not...
2•zansara•30m ago•1 comments

A Hit Movie Set Deep Inside an AI Lab

https://www.wsj.com/tech/ai/google-deepmind-documentary-youtube-thinking-game-732bfa06
1•bookofjoe•32m ago•1 comments

My Personal Financial Strategy (2020)

https://www.rdegges.com/2020/my-personal-financial-strategy/
1•mooreds•32m ago•0 comments

The Suicide Pact: what happens the moment we invade Greenland

https://substack.com/inbox/post/184398789
6•Eric_WVGG•35m ago•6 comments

DetLLM – Deterministic Inference Checks

https://github.com/tommasocerruti/detllm
1•cerru905•37m ago•1 comments

Musk wants up to $134B in OpenAI lawsuit, despite $700B fortune

https://techcrunch.com/2026/01/17/musk-wants-up-to-134b-in-openai-lawsuit-despite-700b-fortune/
2•SilverElfin•41m ago•2 comments

Camden County Police in New Jersey expands drone program

https://www.cbsnews.com/philadelphia/news/camden-nj-homicides-drone-program/
1•pilingual•42m ago•0 comments

Ask HN: Duterte EJK, 2025-09, US extrajudicial killing in the Caribbean?

1•stopbulying•42m ago•1 comments

Authenticating Digital Evidence in US Courts [pdf]

https://law.baylor.edu/sites/g/files/ecbvkj1546/files/2023-11/7_grimm_capra_joseph.pdf
1•colonCapitalDee•43m ago•0 comments

EU Set to Halt US Trade Deal over Trump's New Tariff Threat

https://www.bloomberg.com/news/articles/2026-01-17/eu-set-to-halt-us-trade-deal-over-trump-s-late...
11•ekjhgkejhgk•44m ago•7 comments

Ask HN: Why is the $0 hijacking of intellectual labor so normalized in OSS?

1•fumi2026•47m ago•6 comments

My Rube Goldberg RSS Pipeline

https://taoofmac.com/space/blog/2026/01/17/2130
1•rcarmo•48m ago•0 comments

Global trust crisis deepfakes AI

https://techfusiondaily.com/global-trust-crisis-deepfakes-ai/
1•nelkazzu•49m ago•0 comments

Ask HN: How AliExpress gets its recommendation as priority in Gmail?

2•RicoElectrico•50m ago•1 comments