frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Black swans from the Red Planet–Could NASA bring back "mirror life" from Mars?

https://thebulletin.org/2025/11/mirror-life-on-black-swans-from-the-red-planet-could-nasa-bring-b...
1•voxleone•1m ago•0 comments

WebR – R in the Browser

https://webr.sh/
1•creata•1m ago•0 comments

Poll: What browser do you use?

1•whatever3•2m ago•0 comments

We built an AI that spots problems in your product data

https://withcounsel.co
1•aarushyadav•4m ago•1 comments

GitHub Actions cache size can now exceed 10 GB per repository

https://github.blog/changelog/2025-11-20-github-actions-cache-size-can-now-exceed-10-gb-per-repos...
1•cosmic_quanta•4m ago•0 comments

The Little Theorems

https://blog.computationalcomplexity.org/2025/11/the-little-theorems.html
1•baruchel•4m ago•0 comments

A logging loop in GKE cost me $1,300 in 3 days – 9.2x my actual infrastructure

2•nthypes•6m ago•0 comments

Show HN: I built a Zero-Knowledge text shredder that deletes data on read

https://www.tbd.bot/
2•ktwao•6m ago•0 comments

Malaysia to Bar Children Under 16 from Social Media

https://www.nytimes.com/2025/11/24/world/asia/malaysia-social-media-ban.html
1•geox•6m ago•0 comments

Academic Arbitrage in the LLM Era

https://c.mov/academic-arbitrage/
1•aoli-al•9m ago•0 comments

Raylib now requires no OS

https://twitter.com/raysan5/status/1992964737953411283
1•klaussilveira•9m ago•0 comments

Show HN: Grow Your Organic Traffic with Automated SEO

https://www.fastseofix.com
1•certibee•10m ago•0 comments

Ask HN: Do smooth, wearable spinning rings (fidget-spinners) exist?

2•spinity•10m ago•0 comments

Real-time, verified city chatrooms for travelers built around actual trip dates

1•aacishh•11m ago•0 comments

I tried lab-grown chocolate. Could it be the future of Halloween?

https://www.theguardian.com/wellness/2025/oct/31/lab-grown-chocolate-halloween
1•PaulHoule•11m ago•0 comments

Ethiopian volcano erupts for first time in 12,000 years

https://www.theguardian.com/world/2025/nov/24/ethiopian-volcano-hayli-gubbi-erupts-first-time-120...
2•tosh•12m ago•0 comments

Shard Your Database

https://pgdog.dev/blog/shard-your-database
1•levkk•12m ago•0 comments

Show HN: MCP Optimizer for faster, lower-token coding agents

https://platform.tupl.xyz/
1•fencio_dev•14m ago•0 comments

New CPU Pricing for Containers and Sandboxes – Cloudflare

https://developers.cloudflare.com/changelog/2025-11-21-new-cpu-pricing/
1•NicoJuicy•15m ago•0 comments

Show HN: Pg-aiguide – Write better PostgreSQL code with AI

https://github.com/timescale/pg-aiguide
1•cevian•15m ago•0 comments

Enshittification of Arduino Begins? Qualcomm Starts Clamping Down

https://itsfoss.com/news/enshittification-of-arduino-begins/
1•cratermoon•15m ago•0 comments

Automatically Merging Dependabot Pull Requests

https://deanpcmad.com/2025/dependabot-auto-pr-merge/
1•deanpcmad•15m ago•0 comments

Network Design Principles

https://github.com/xxia8864/Article/blob/main/Docs/Network%20Design%20Principles.md
1•bill3389•17m ago•0 comments

The Intentional Stance

https://en.wikipedia.org/wiki/Intentional_stance
1•georgestrakhov•18m ago•0 comments

Shai-Hulud malware infects 500 NPM packages, leaks secrets on GitHub

https://www.bleepingcomputer.com/news/security/shai-hulud-malware-infects-500-npm-packages-leaks-...
2•speckx•19m ago•0 comments

Olmo 3 from Scratch (Standalone Notebook)

https://github.com/rasbt/LLMs-from-scratch/blob/main/ch05/13_olmo3/README.md
1•quietlearning•19m ago•0 comments

Surf (Dutch edu/research IT cooperative) test Nextcloud for partner institutions

https://www.surf.nl/en/themes/public-values/surf-and-nextcloud
4•teekert•20m ago•0 comments

Assert in Production

https://dtornow.substack.com/p/assert-in-production
1•rajeevk•22m ago•0 comments

WINS removal: Moving forward with modern name resolution

https://support.microsoft.com/en-us/topic/wins-removal-moving-forward-with-modern-name-resolution...
1•p_ing•23m ago•0 comments

A Startup's Bid to Dim the Sun: Solar Geoengineering Pros and Cons

https://www.newyorker.com/news/the-lede/a-startups-bid-to-dim-the-sun
1•bookofjoe•23m ago•1 comments