frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•7mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

I Stopped Using the Notion Mobile App for Tasks

https://medium.com/@kresstudios/why-i-stopped-using-the-notion-mobile-app-for-tasks-94213ea20164
1•luis_journey•1m ago•0 comments

8-Bit Boléro by Linus Åkesson [video]

https://www.youtube.com/watch?v=kMGbGGllQoE
1•msk-lywenn•2m ago•0 comments

K-12 National staffing and enrollment trends

https://edunomicslab.org/staffing-v-enrollment-trends/
1•mhb•3m ago•0 comments

Show HN: Crypto Inheritance with Shamir Secret

https://shardium.maxcomperatore.com
1•maxcomperatore•4m ago•0 comments

Braid: Bounded Reasoning for Autonomous Inference and Decisions

https://arxiv.org/abs/2512.15959
1•arbayi•4m ago•0 comments

Ask HN: What do you look for when evaluating privacy-first ad platforms or MVPs?

1•frndsprotocol•5m ago•0 comments

Ask HN: Do you allow vibecoded submissions in your open-source projects?

2•sneas•6m ago•0 comments

Samsung Announces First 2nm Mobile Chip Ahead of Apple

https://www.macrumors.com/2025/12/19/samsung-exynos-2600-chip-2nm-process-apple/
1•tosh•7m ago•0 comments

China Boosts AI Chip Output by Upgrading Older ASML Machines

https://www.ft.com/content/d10398db-b8b4-40f3-8c6d-b340470f5f3c
2•karakoram•11m ago•1 comments

Lies, Damned Lies and Trump Speeches

https://paulkrugman.substack.com/p/lies-damned-lies-and-trump-speeches
2•rbanffy•11m ago•0 comments

Ask HN: Has putting yourself in the "right rooms" improved your life?

2•shannonalp•12m ago•0 comments

We built a universal installer for agent skills based on the new open standard

https://github.com/skillcreatorai/Ai-Agent-Skills
6•skillcreator•12m ago•3 comments

An unreasonable book (1976) [pdf]

http://jmc.stanford.edu/artificial-intelligence/reviews/weizenbaum.pdf
1•andsoitis•12m ago•0 comments

Ask HN: Is Google Search and Services down for anyone else?

1•exploraz•14m ago•2 comments

Registry You Can Actually Query

https://writethat.blog/reg.html
1•psarna•15m ago•0 comments

Efficient Attention Mechanisms for Large Language Models: A Survey

https://arxiv.org/abs/2507.19595
1•belter•16m ago•0 comments

An integrated view of the structure and function of the human 4D nucleome

https://www.nature.com/articles/s41586-025-09890-3
1•bookofjoe•17m ago•0 comments

YouTube Is Degraded

https://downdetector.co.uk/status/youtube/
12•alphawong•18m ago•9 comments

Timing 'Hello World'

https://antonz.org/timing-hello-world/
1•blenderob•18m ago•0 comments

YouTube Outage [video]

https://www.youtube.com/watch?v=3MG_Dm4kefc
4•vettyvignesh•19m ago•2 comments

It's the Great AGI Rebrand

https://www.theverge.com/ai-artificial-intelligence/845890/ai-companies-rebrand-agi-artificial-ge...
1•sandbach•19m ago•0 comments

Computer Power and Human Reason (1976) [pdf]

http://blogs.evergreen.edu/cpat/files/2013/05/Computer-Power-and-Human-Reason.pdf
1•andsoitis•19m ago•0 comments

JWT Playground for Developers

https://www.devglan.com/online-tools/jwt-decoder-validator
1•only2dhir•20m ago•0 comments

AI Can Write Your Code. It Can't Do Your Job

https://terriblesoftware.org/2025/12/11/ai-can-write-your-code-it-cant-do-your-job/
1•antfarm•21m ago•0 comments

Football Pools

https://en.wikipedia.org/wiki/Football_pools
2•zeristor•23m ago•0 comments

NASA's Webb Observes Exoplanet Whose Composition Defies Explanation

https://science.nasa.gov/missions/webb/nasas-webb-observes-exoplanet-whose-composition-defies-exp...
2•taubek•23m ago•0 comments

Pa Supreme Ct allows non-warranted access to your Google searches

https://reason.com/volokh/2025/12/16/are-there-fourth-amendment-rights-in-google-search-terms/
3•mwexler•24m ago•0 comments

Show HN: Liquid Glass dev and design resources for platforms beyond iOS&macOS

https://www.liquidglassresources.com/
2•andraskindler•24m ago•0 comments

Ask HN: Incorporate in Which Country?

4•finnvyrn•26m ago•1 comments

Carving Nature at Your Joints

https://planktonvalhalla.com/20251219-carving-nature-at-your-joints/
1•mrcgnc•29m ago•0 comments