frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

How Etsy Uses LLMs to Improve Search Relevance

https://www.etsy.com/codeascraft/how-etsy-uses-llms-to-improve-search-relevance
1•zdw•2m ago•0 comments

Nvidia suppliers halt H200 output after China blocks chip

https://www.ft.com/content/02a3eb7c-684f-4e39-87b8-36e9595ef800
1•SanjayMehta•6m ago•1 comments

What are best products for creating video or interactive demos for a SaaS?

1•rishabhpoddar•8m ago•0 comments

Intuition why the derivative of $e^x$ is itself

https://math.stackexchange.com/questions/3511144/intuition-why-the-derivative-of-ex-is-itself
1•throwoutway•10m ago•0 comments

A Calif. teen trusted ChatGPT's drug advice. He died from an overdose

https://www.sfgate.com/tech/article/calif-teen-chatgpt-drug-advice-fatal-overdose-21266718.php
2•freediver•12m ago•0 comments

Gas Town is a glimpse into the future

https://johncodes.com/archive/2026/01-16-a-glimpse-into-the-future/
1•jpmcb•15m ago•0 comments

Building a Quake PC

https://fabiensanglard.net/quake_pc/index.html
2•zdw•16m ago•0 comments

China Clamps Down on High-Speed Traders, Removing Servers

https://www.bloomberg.com/news/articles/2026-01-16/china-clamps-down-on-high-speed-traders-removi...
4•petethomas•22m ago•0 comments

Astronaut Charlie Duke (2021) [video]

https://www.youtube.com/watch?v=U7jWk0u4K-E
1•ceroxylon•30m ago•1 comments

I skipped Japan's university exam to write a "computational metaphysics" exam

2•fumi2026•32m ago•0 comments

Grand Illusion

https://chrishedges.substack.com/p/grand-illusion-read-by-eunice-wong
1•chmaynard•35m ago•0 comments

Artisanal Code

https://sunnyamrat.com/posts/2026-01-17-artisanal-code/
2•sunnyam•35m ago•0 comments

Lucasfilm President Kathleen Kennedy to step down

https://www.latimes.com/entertainment-arts/business/story/2026-01-15/lucasfilm-president-kathleen...
4•iancmceachern•37m ago•1 comments

Stay-at-home sons are here – and they're not going anywhere

https://www.washingtonpost.com/style/trends/2026/01/16/stay-at-home-sons/
2•blondie9x•38m ago•2 comments

Show HN: Agent Coworking,Multi-agent networks for AI collaboration (open source)

https://openagents.org/showcase
2•snasan•39m ago•0 comments

Ask HN: Can companies claim copyright over their LLM-generated codebases?

2•mks_shuffle•43m ago•2 comments

GitHub Copilot now supports OpenCode

https://github.blog/changelog/2026-01-16-github-copilot-now-supports-opencode/
4•todsacerdoti•46m ago•0 comments

Show HN: I gave AI persistent memory. Someone didn't like that

1•Shaximus•46m ago•0 comments

Dispute with Russian billionaire leads to 4 Bay Area bankruptcies

https://www.sfgate.com/tech/article/russian-billionaire-four-biotech-bankruptcies-21299298.php
4•nradov•48m ago•0 comments

Crypto grifters are recruiting open-source AI developers

https://www.seangoedecke.com/gas-and-ralph/
3•lalitmaganti•55m ago•1 comments

GoodJob, Solid Queue, Sidekiq, Active Job, in 2026

https://island94.org/2026/01/goodjob-solid-queue-sidekiq-active-job-in-2026
1•thunderbong•57m ago•0 comments

Ben Affleck and Matt Damon on the Limits of AI in Filmmaking [video]

https://www.youtube.com/watch?v=O-2OsvVJC0s
4•karakoram•58m ago•0 comments

Every Inch Matters

https://mercurialsolo.substack.com/p/every-inch-matters
1•mercurialsolo•59m ago•1 comments

Chinese Fishing Boats Form Sea Barriers

https://www.nytimes.com/interactive/2026/01/16/world/asia/china-ships-fishing-militia-blockade.html
6•SubiculumCode•59m ago•3 comments

Video Analysis of ICE Shooting Sheds Light on Contested Moments

https://www.nytimes.com/2026/01/15/video/ice-shooting-renee-good-minneapolis-videos.html
5•treetalker•1h ago•1 comments

Alabama Snowfall Forecast

https://ema.alabama.gov/2026/01/15/thursday-afternoon-update-on-possible-snowfall-this-weekend/
1•qwertyuiop_•1h ago•0 comments

Show HN: Making Claude Code sessions link-shareable

3•reflectivetrap•1h ago•0 comments

Claude Code sessions are now link-shareable

https://github.com/OmkarKovvali/claude-session-share
1•reflectivetrap•1h ago•1 comments

Nearly 5M Accounts Removed Under Australia's New Social Media Ban

https://www.nytimes.com/2026/01/15/world/australia/social-media-ban-australia.html
4•bookofjoe•1h ago•1 comments

Anything Will Work (In AI)

https://publish.obsidian.md/ueaj/Machine+Learning/Theory/Anything+WILL+work
1•qouteall•1h ago•0 comments