frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Ask HN: Is ChatGPT Experiencing a Degradation?

1•spIrr•45s ago•0 comments

Same Product, Same Store, but on Instacart, Prices Might Differ

https://www.nytimes.com/2025/12/09/business/instacart-algorithmic-pricing.html
1•subhero•2m ago•1 comments

RealGen: Photorealistic Text-to-Image Generation via Detector-Guided Rewards

https://yejy53.github.io/RealGen/
1•doener•3m ago•0 comments

Did Hitler really Have a 'Micropenis'?

https://www.theguardian.com/tv-and-radio/2025/nov/13/did-hitler-really-have-a-micropenis-hitlers-...
1•wjSgoWPm5bWAhXB•3m ago•0 comments

Show HN: Iceberg-JS, a TypeScript Client for the Apache Iceberg REST Catalog

https://github.com/supabase/iceberg-js
1•kiwicopple•5m ago•0 comments

Learning a new programming language with an LLM

https://feeding.cloud.geek.nz/posts/learning-new-programming-language-with-ai/
1•speckx•6m ago•0 comments

SSE sucks for transporting LLM tokens

https://zknill.io/posts/sse-sucks-for-transporting-llm-tokens/
1•zknill•7m ago•0 comments

Hyper-Scalers Are Using CXL to Lower the Impact of DDR5 Supply Constraints

https://www.servethehome.com/hyper-scalers-are-using-cxl-to-lower-the-impact-of-ddr5-supply-const...
1•giuliomagnifico•7m ago•0 comments

Creativity and Mental Health

https://en.wikipedia.org/wiki/Creativity_and_mental_health
1•wseqyrku•7m ago•0 comments

The Rise of the 0.1x Engineer

https://www.jerpint.io/blog/2025-12-08-01x-engineer/
2•jerpint•8m ago•0 comments

A new way to trigger responses in the body by simulating psychological pressure

https://medicalxpress.com/news/2025-11-streak-trigger-responses-body-simulating.html
1•PaulHoule•8m ago•0 comments

Evidence That Humans Now Speak in a Chatbot-Influenced Dialect Getting Stronger

https://gizmodo.com/chatbot-dialect-2000696509
1•pseudolus•9m ago•0 comments

Show HN: Avatune – SSR-friendly avatars with browser ML for attribute prediction

https://www.avatune.dev/
2•teimurjan•10m ago•0 comments

DeepMath: A lightweight math reasoning Agent with smolagents

https://huggingface.co/blog/intel-deepmath
3•ibobev•11m ago•0 comments

Advent of Code in Dialog

https://entropicthoughts.com/advent-of-code-in-dialog
2•ibobev•11m ago•0 comments

Multibase CLI

http://www.chriswarbo.net/blog/2025-12-07-multibase_cli.html
1•ibobev•12m ago•0 comments

Offline cybersecurity AI using RAG and local LLM (Python, FAISS, Llama 3.1)

https://gitlab.com/sydsec1/Syd
1•todsacerdoti•12m ago•0 comments

AWS re:Invent re:Watch tool

https://myrewatch.link/
2•cebert•14m ago•0 comments

All the Places

https://alltheplaces.xyz/
2•djoldman•15m ago•0 comments

Atomic time source failure at NIST Gaithersburg campus

https://groups.google.com/a/list.nist.gov/g/internet-time-service/c/Zd7VaR-vqV4
2•ahlCVA•16m ago•0 comments

Which Roadmap Path Are You On?

https://holenventures.substack.com/p/which-roadmap-path-are-you-on
1•hholen•17m ago•0 comments

My App Will Harm You Physically, Using Math

https://prolost.com/blog/drinkingbuddy
1•ingve•17m ago•0 comments

Show HN: I built a dual-engine diagram app to fix my own workflow

https://diagram-generator.com/
1•dongjiewu•20m ago•0 comments

Spending too much time on one AoC problem

https://richclubb.github.io/blog/spending_way_too_much_time/
1•jamesbelchamber•21m ago•1 comments

60 Years of Artificial Intelligence at Stanford (2023) [video]

https://www.youtube.com/watch?v=Cn6nmWlu1EA
1•swatson741•25m ago•0 comments

Torture Techniques from CIA Black Sites Were Used at Alligator Alcatraz

https://www.forever-wars.com/torture-techniques-from-cia-black-sites-were-used-at-alligator-alcat...
15•perihelions•27m ago•2 comments

The AI bust scenario that no one is talking about

https://www.noahpinion.blog/p/the-ai-bust-scenario-that-no-one
2•maelito•28m ago•0 comments

Show HN: I Built an Plug and Play UI Library with Motion Animations

https://ogblocks.dev/
2•Karanzk•30m ago•0 comments

Join the on-call roster, it'll change your life

https://serce.me/posts/2025-12-09-join-oncall-it-will-change-your-life
2•todsacerdoti•34m ago•0 comments

Pydantic-AI Deepagents

https://github.com/vstorm-co/pydantic-deep
3•kacper-vstorm•34m ago•1 comments