frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

The Oxford Word of the Year 2025 is "rage bait"

https://corp.oup.com/news/the-oxford-word-of-the-year-2025-is-rage-bait/
1•CuriousCorvid•1m ago•0 comments

For First Time in Decades, Child Deaths Will Rise This Year

https://www.wsj.com/health/for-first-time-in-decades-child-deaths-will-rise-this-year-92c67b51
1•petethomas•7m ago•0 comments

Median Filter over Arbitrary Datatypes

https://martianlantern.github.io//2025/09/median-filter-over-arbitrary-datatypes/
1•martianlantern•8m ago•0 comments

Strategy Letter IV: Bloatware and the 80/20 Myth (2001)

https://www.joelonsoftware.com/2001/03/23/strategy-letter-iv-bloatware-and-the-8020-myth/
1•NavinF•9m ago•0 comments

Open, Vendor-Neutral Framework for AI/ML Compute Optimization

https://outerbounds.com/blog/six-steps-to-cost-optimization
1•frktcpumu•11m ago•1 comments

Google's Android for desktops and laptops is called "Aluminium – OSnews

https://www.osnews.com/story/143907/googles-android-for-desktops-and-laptops-is-called-aluminium/
1•abdelhousni•11m ago•1 comments

[Free Lifetime] [Connect with Travelers in Every City]

1•aacishh•11m ago•0 comments

Climbing a different kind of tree [video]

https://www.youtube.com/shorts/cIQ8vbiL_pA
1•programmexxx•13m ago•0 comments

Show HN: Lynkr – Claude Code-Compatible Proxy for Databricks/Azure Anthropic

https://github.com/vishalveerareddy123/Lynkr/wiki/Emulating-the-Claude-Code-Backend-for-LLM%27s-h...
1•vishalveera•14m ago•0 comments

Ask HN: Do you still think public blockchains/stablecoins are useless/a scam?

1•spir•15m ago•2 comments

Arrested by Phone: A Graphic Novel About a Real-Life Nightmare

https://www.bloomberg.com/graphics/2025-india-digital-arrest-by-phone-graphic-novel/
1•petethomas•17m ago•0 comments

Perplexity leaked its system prompt by accident just because I asked in Hindi

https://old.reddit.com/r/PromptEngineering/comments/1pdd66c/perplexity_leaked_its_entire_system_p...
1•achow•21m ago•0 comments

React2Shell (CVE-2025-55182/CVE-2025-66478)

https://react2shell.com/
1•orkj•27m ago•1 comments

Show HN: Mirror_bridge – C++ Reflection powered Python binding generation

https://github.com/FranciscoThiesen/mirror_bridge
1•fthiesen•28m ago•0 comments

Ethereum's Fusaka upgrade today added sharding via data availability sampling

https://twitter.com/ethereum/status/1996226190399455358
1•spir•31m ago•1 comments

Little something to help third world countries candidates

https://cvai.dev/
1•pukarkhanal•36m ago•1 comments

China planted so many trees it's changed the country's water distribution

https://www.livescience.com/planet-earth/plants/china-has-planted-so-many-trees-its-changed-the-e...
1•achow•36m ago•0 comments

Show HN: Onetone – A full-stack framework with custom C interpreter

https://github.com/onetoneframework/framework
1•tactics6655•37m ago•0 comments

A Cosmic Offense: Elias Canetti's contest against death

https://www.commonwealmagazine.org/cosmic-offense
1•diodorus•37m ago•0 comments

Uncloud - Tool for deploying containerised apps across servers without k8s

https://uncloud.run/
4•rgun•38m ago•0 comments

Lego ZX Spectrum – Tribute to Sir Clive Sinclair

https://beta.ideas.lego.com/product-ideas/1113841c-596d-4f28-be4a-367cc83e8ed1
2•sohkamyung•39m ago•0 comments

Zero-Setup Java Build Tooling via Mill Bootstrap Scripts

https://mill-build.org/blog/16-zero-setup.html
1•lihaoyi•46m ago•0 comments

I Have No Identification Cards – Robin Greenfield

https://www.robingreenfield.org/identification/
1•pkaeding•53m ago•0 comments

Mirror_bridge – C++ reflection for generating Python/JS/Lua bindings

https://chico.dev/Mirror-Bridge/
2•fthiesen•53m ago•0 comments

A Protocol for Measuring Answer Space Occupancy in Large Language Models

https://zenodo.org/records/17810543
1•businessmate•54m ago•1 comments

Foreign-dlopen: call dlopen from static programs

https://github.com/pfalcon/foreign-dlopen
1•todsacerdoti•1h ago•0 comments

Why our AI future may look less like Skynet and more like Olympus

https://awesomeworld.substack.com/p/why-our-ai-future-may-look-less-like
2•dstavisky•1h ago•1 comments

Gex X Rocks but Whatever

https://medium.com/@jonathacz99/what-a-sex-worker-notices-about-gen-x-and-gen-z-men-fd0d13b6c203
1•karmaniverous•1h ago•0 comments

Show HN: A Minimal Monthly Task Planner (printable, offline, no signup)

https://printcalendar.top/
4•defcc•1h ago•2 comments

Google has released Android 16 QPR2's source code

https://www.androidauthority.com/android-16-qpr2-source-code-3621513/
1•thunderbong•1h ago•0 comments