frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: TrailWrightQA – local-first, AI-assisted Playwright UI testing

https://github.com/marktl/TrailWrightQA
1•marktl•3m ago•0 comments

Accommodation Nation: America's colleges have an extra-time-on-tests problem

https://www.theatlantic.com/magazine/2026/01/elite-university-student-accommodation/684946/
1•petethomas•7m ago•0 comments

A Trajetória Do Assistente Social No Contexto Do Terceiro SETOr

https://minutocaptamais.substack.com/p/a-trajetoria-do-assistente-social
1•drallanvieira•8m ago•0 comments

When the Boss Is Always Right, the AI Will Be Wrong

https://www.bloomberg.com/opinion/articles/2025-12-02/ai-will-be-bad-if-the-tech-ceo-is-always-right
1•petethomas•10m ago•0 comments

Ask HN: What fiction books would you recommend for programmers?

3•superconduct123•11m ago•1 comments

Most Agentic AI failures I've debugged turned out to be ingestion drift

2•wehadit•12m ago•0 comments

Thoughts of a Neopagan / the Spirituality

1•5wizard5•13m ago•0 comments

I wrote JustHTML using coding agents

https://friendlybit.com/python/writing-justhtml-with-coding-agents/
1•EmilStenstrom•14m ago•0 comments

What I learned building an opinionated and minimal coding agent

https://mariozechner.at/posts/2025-11-30-pi-coding-agent/
1•the_mitsuhiko•17m ago•0 comments

Git read-tree: Carbon-Copy without Merge Hell

https://blog.zenosmosis.com/posts/5-git-read-tree/
1•rustic-indian•22m ago•1 comments

Id Software was Lazy – DOOM could have had PC Speaker Music

https://lenowo.org/viewtopic.php?t=45
2•minki_the_avali•26m ago•0 comments

Ask HN: Do you think you have your location services on?

2•jacquesm•27m ago•1 comments

Ivan Sutherland Sketchpad Demo 1963 [video]

https://www.youtube.com/watch?v=6orsmFndx_o
1•fs_software•31m ago•0 comments

AI Mathematical Olympiad – Progress Prize 3

https://www.kaggle.com/competitions/ai-mathematical-olympiad-progress-prize-3
1•kristianp•32m ago•0 comments

MADvent – A Math and Logic Advent Calendar for Your Kids

https://madvent.amithm.ca/about
1•amitpm•33m ago•1 comments

Noodl.ist

https://jetgirl.art/introducing-noodlist/
1•jetgirl•33m ago•0 comments

Agents need good developer experience too

https://modal.com/blog/agents-devex
1•birdculture•35m ago•0 comments

Richard Feldman, "New Ways to Roc" [video]

https://www.youtube.com/watch?v=VnPw9rk8FI8
1•stephdin•35m ago•0 comments

Show HN: A "what-if" budget planner app born from new-parent chaos

https://planstheapp.com
1•riario•35m ago•0 comments

Helping Agents Debug Webapps

https://blog.fsck.com/2025/12/02/helping-agents-debug-webapps/
1•Ch00k•38m ago•0 comments

Oracle Credit Fear Gauge Hits Highest Since 2009 on AI Bubble Fears

https://www.bloomberg.com/news/articles/2025-12-02/oracle-credit-fear-gauge-hits-highest-since-20...
1•petethomas•39m ago•0 comments

Honduran ex-president released from US prison after Trump pardon

https://www.bbc.com/news/articles/cpvdr8k7xjro
5•wslh•44m ago•0 comments

H-1B to Plan B: India's top tech talent looks beyond the U.S.

https://restofworld.org/2025/india-tech-talent-diversifies-beyond-us/
1•nanfinitum•47m ago•0 comments

Comparison of Waymo Rider-Only crash rates by crash type to human benchmarks

https://www.tandfonline.com/doi/full/10.1080/15389588.2025.2499887
1•agnosticmantis•50m ago•0 comments

Rebinding for Observer-Safe Information Design

https://rebinding.is/
1•isaacbowen•54m ago•0 comments

Ask HN: Which web browser are you using and why?

4•throwaway81998•55m ago•7 comments

Claude the albino alligator in Cal Academy passed away at age 30

https://www.calacademy.org/press/releases/claude-the-albino-alligator-passes-away-at-age-30
3•elinear•57m ago•0 comments

Claude Died

https://abc7news.com/post/cal-academy-announces-beloved-claude-albino-alligator-has-died-30/18241...
15•jumploops•1h ago•3 comments

Cloth Simulation

https://cloth.mikail-khan.com/
2•adamch•1h ago•0 comments

Show HN: Build CLI apps with Ink that run in the browser

https://www.ink-web.dev/
2•thoughtfulchris•1h ago•0 comments