frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

In the Beginning Was the Command Line

https://web.stanford.edu/class/cs81n/command.txt
1•wseqyrku•32s ago•0 comments

Portugal updates cybercrime law to exempt security researchers

https://www.bleepingcomputer.com/news/security/portugal-updates-cybercrime-law-to-exempt-security...
1•N19PEDL2•2m ago•0 comments

Optimizing Iceberg Compaction: Why We Built an Embedded Engine in Rust

https://risingwave.com/blog/implementing-iceberg-compaction-rust/
1•WavyPeng•3m ago•0 comments

Geoffrey Hinton says Google is 'beginning to overtake' OpenAI

https://www.businessinsider.com/ai-godfather-geoffrey-hinton-google-overtaking-openai-2025-12
1•djhu9•4m ago•0 comments

Zuck#: A programming language for connecting the world. And harvesting it

https://jayzalowitz.github.io/zucksharp/
1•kf•6m ago•0 comments

Show HN: Dograh – an OSS Vapi alternative to quickly build and test voice agents

https://github.com/dograh-hq/dograh
1•a6kme•9m ago•1 comments

Curated SRE/DevOps/Cloud Job Board

https://sshcareers.com/
1•sajokrit•13m ago•0 comments

You Gotta Push If You Wanna Pull

https://www.morling.dev/blog/you-gotta-push-if-you-wanna-pull/
1•ingve•14m ago•0 comments

Knuth 2025 Christmas lecture: Adventures with Knight's Tours [video]

https://www.youtube.com/watch?v=MKiRte-tnMY
2•fs123•18m ago•0 comments

Ask HN: Smart Glasses with a Decent Camera?

1•brcmthrowaway•18m ago•0 comments

Show HN: Detect employee skill gaps and develop training plans

https://semis.reispar.com
1•Mcjulie•25m ago•0 comments

Adding Unpack Syntax to RCL

https://ruudvanasseldonk.com/2025/adding-unpack-to-rcl
3•todsacerdoti•27m ago•0 comments

Applets Are Officially Gone, but Java in the Browser Is Better

https://frequal.com/java/AppletsGoneButJavaInTheBrowserBetterThanEver.html
15•pjmlp•28m ago•8 comments

Can an English Fish Merchant Turn a Profit in Asia's Largest Market? [video]

https://www.youtube.com/watch?v=PqAf1qRcixc
1•tosh•28m ago•0 comments

The AI-Fication of Cyberthreats

https://www.trendmicro.com/vinfo/gb/security/research-and-analysis/predictions/the-ai-fication-of...
2•runningmike•29m ago•0 comments

GitHub Actions Has a Package Manager, and It Might Be the Worst

https://nesbitt.io/2025/12/06/github-actions-package-manager.html
2•robin_reala•29m ago•0 comments

Why Startups Die

https://www.techfounderstack.com/p/why-startups-die
3•makle•39m ago•0 comments

Show HN: Weather mini – Trip forecasts powered by Apple Intelligence

https://weathermini.app
1•kailuo•42m ago•0 comments

From Azure Functions to FreeBSD

https://jmmv.dev/2025/12/from-azure-functions-to-freebsd.html
1•todsacerdoti•42m ago•0 comments

Guidance: A cheat code for diffusion models

https://sander.ai/2022/05/26/guidance.html
2•tesserato•42m ago•0 comments

App that turns sermon notes to daily devotionals

https://apps.apple.com/us/app/serma/id6745926259
1•roosells•45m ago•1 comments

Closer Look at the Birthday Paradox [video]

https://www.youtube.com/watch?v=OUQVhnuTMVU
1•duck•46m ago•0 comments

European commission X ad account has been terminated

https://twitter.com/nikitabier/status/1997450541275005041
1•teekert•46m ago•0 comments

Show HN: Bugmail – the easiest way to catch and fix production bugs

https://www.bugmail.site/
2•bumpymark•49m ago•0 comments

Show HN: Pocket PMO – Quick free PMO oversight

https://pocketpmo.com/
1•iamasuperuser•49m ago•0 comments

Starmer's Electoral Posturing

https://rodgercuddington.substack.com/p/starmers-electoral-posturing
2•freespirt•50m ago•1 comments

Show HN: GffutilsAI, an agent to analyze genomic files

https://www.biorxiv.org/content/10.64898/2025.12.02.690645v1
1•sbassi•50m ago•0 comments

Drunk Driving – Grand Rapids Dip

https://en.wikipedia.org/wiki/Drunk_driving
1•thunderbong•50m ago•1 comments

The Collapse of Trust in AI Assistants

https://zenodo.org/records/17837188
6•businessmate•53m ago•1 comments

When software becomes fast food

https://world.hey.com/joaoqalves/when-software-becomes-fast-food-23147c9b
2•kiyanwang•56m ago•0 comments