frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

AI receptionist, look for GTM cofounder

https://callpal.com
1•franklin_yao•3m ago•1 comments

Show HN: An AI environment to understand sources or topics

https://www.kerns.ai/
1•kanodiaayush•4m ago•0 comments

Monument Valley Blog

https://monumentvalley.blog/
1•Nancy1230•4m ago•0 comments

Google OCS Apollo: The >$3B Game-Changer in Datacenter Networking (2023)

https://newsletter.semianalysis.com/p/google-apollo-the-3-billion-game
1•smj-edison•7m ago•0 comments

Collagen amino acid composition supplementation reduces biological age in humans

https://www.nature.com/articles/s41514-025-00280-7
1•walterbell•8m ago•0 comments

ChatGPT's Biggest Foe: Poetry

https://nautil.us/chatgpts-biggest-foe-poetry-1252100/
3•billybuckwheat•10m ago•0 comments

Search for Malaysia Airlines flight MH370 to resume

https://www.theguardian.com/world/2025/dec/03/search-for-malaysia-airline-flight-mh370-to-resume-...
1•n1b0m•11m ago•0 comments

China's first orbital booster landing attempt

https://visitor.passport.weibo.cn/visitor/visitor?entry=sinawap&a=enter&url=https%3A%2F%2Fm.weibo...
4•nkoren•13m ago•1 comments

Apple Aperture: Senior QA (2004-2005)

https://substack.techreflect.org/p/aperture-senior-qa-2004-2005
1•Austin_Conlon•14m ago•0 comments

AI's Missing UI

https://www.fujimon.com/blog/missing-ui
2•yuyafujimoto•18m ago•0 comments

RFC-8927: JSON Type Definition

https://datatracker.ietf.org/doc/html/rfc8927
1•rapnie•20m ago•0 comments

Why are Transformers replacing CNNs? [video]

https://www.youtube.com/watch?v=KnCRTP11p5U
1•chii•21m ago•0 comments

GitHub Trending Page Stuck for a Month

https://github.com/orgs/community/discussions/179946
4•keksis_leo•38m ago•0 comments

Ask HN: Vibe Coded Apps in Production

1•rajeshrajappan•40m ago•2 comments

Easy way bulk process color pairs for wcag a11y compliance

https://stackoverflow.com/questions/53639685/color-contrast-accessibility-checker
1•lalithaar•44m ago•0 comments

AI Monetization: Turn user ideas into shared IP via AI+Human screening

https://medium.com/@zaranur848/a-new-monetization-pathway-for-ai-platforms-using-multi-layer-ai-e...
1•haizei•49m ago•0 comments

How Can Interpretability Researchers Help AGI Go Well?

https://www.alignmentforum.org/posts/MnkeepcGirnJn736j/how-can-interpretability-researchers-help-...
1•gmays•50m ago•0 comments

Researchers Find Microbe Capable of Producing Oxygen from Martian Soil

https://scienceclock.com/microbe-that-could-turn-martian-dust-into-oxygen/
1•ashishgupta2209•54m ago•1 comments

Cellular Blueprint for How We Think, Feel

https://news.gsu.edu/2025/12/02/georgia-state-brain-researchers-draw-cellular-blueprint-for-how-w...
1•XzetaU8•58m ago•0 comments

Glass-detect: a detector for Ray-Ban Meta glasses

https://github.com/sh4d0wm45k/glass-detect
1•holysoles•1h ago•0 comments

We built a database of 290k English medieval soldiers; here's what it reveals

https://theconversation.com/we-built-a-database-of-290-000-english-medieval-soldiers-heres-what-i...
2•mellosouls•1h ago•0 comments

Show HN: An emotional steering website for Qwen 2.5 7B

https://aifeels.chat/
1•nicetomeetyu•1h ago•0 comments

China's first reusable rocket Zhuque-3 makes maiden voyage but recovery fails

https://www.scmp.com/news/china/science/article/3335004/chinas-first-reusable-rocket-zhuque-3-mak...
2•perihelions•1h ago•0 comments

AI Is Breaking the Moral Foundation of Modern Society

https://eyeofthesquid.com/ai-is-breaking-the-moral-foundation-of-modern-society-a145d471694f
70•TinyBig•1h ago•61 comments

Website Task Flowchart

https://xkcd.com/3175/
2•pabs3•1h ago•0 comments

Quad9 DOH HTTP/1.1 Retirement, December 15, 2025

https://quad9.net/news/blog/doh-http-1-1-retirement/
20•pickledoyster•1h ago•0 comments

From Code Foundation Models to Agents and Applications: A Practical Guide

https://arxiv.org/abs/2511.18538
2•eunos•1h ago•0 comments

The Math Crisis at UC San Diego [video]

https://www.youtube.com/watch?v=qYfDQdcVKaQ
2•chii•1h ago•0 comments

Motif – Draw Tile Patterns

https://motif.works/
2•bryanrasmussen•1h ago•0 comments

The Vibe Coding Labyrinth

https://connelllocke.substack.com/p/the-vibe-coding-labyrinth
2•nervous-energy•1h ago•0 comments