frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•4mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: I made DressMate, an AI to decide what to wear from your own wardrobe

https://dressmate-ai.com/
1•novaTheMachine•50s ago•0 comments

Practical seed recovery for the PCG pseudo-random number generator

https://tosc.iacr.org/index.php/ToSC/article/view/8700
1•fanf2•52s ago•0 comments

Which Nested Data Format Do LLMs Understand Best? JSON vs. YAML vs. XML vs. Md

https://www.improvingagents.com/blog/best-nested-data-format/
1•mattcollins•1m ago•1 comments

Distil-PII: family of PII redaction SLMs

https://github.com/distil-labs/Distil-PII
1•party-horse123•2m ago•1 comments

How America got hooked on ultraprocessed foods

https://www.nytimes.com/interactive/2025/10/16/well/eat/ultraprocessed-food-junk-history.html
1•mykowebhn•3m ago•0 comments

AnyUp: Universal Feature Upsampling

https://wimmerth.github.io/anyup/
1•mariuz•4m ago•0 comments

Blaand – Seeing Whey in a New Old Way (2019)

https://medievalmeadandbeer.wordpress.com/2019/05/29/blaand-seeing-whey-in-a-new-old-way/
1•Kaibeezy•5m ago•0 comments

A Ukrainian F-16 Ace Crushed Six Russian Missiles with Just Four of His Own

https://united24media.com/war-in-ukraine/how-one-ukrainian-f-16-ace-pilot-crushed-six-russian-mis...
1•JumpCrisscross•5m ago•0 comments

Xi's Rare Earth Shock Gives Trump a Chance to Win over US Allies

https://www.bloomberg.com/news/articles/2025-10-16/xi-s-rare-earth-shock-gives-trump-a-chance-to-...
1•wslh•6m ago•1 comments

AI Photo Editor:Edit Your Photos in Seconds

https://www.aiphotoedit.pro/
1•Ante_max•6m ago•0 comments

Entoptic Phenomena (1995)

https://www.oubliette.org.uk/Intro.html
1•Kaibeezy•7m ago•0 comments

An Introduction to Event Theory

https://yonkeltron.com/posts/an-introduction-to-event-theory/
3•Bogdanp•8m ago•0 comments

Ask HN: What If AGI = Awareness Grown Internally?

2•f_of_t_•9m ago•2 comments

Tor browser removing various Firefox AI features

https://blog.torproject.org/new-alpha-release-tor-browser-150a4/
2•HelloUsername•9m ago•0 comments

Exasol's distributed MPP architecture vs. DuckDB

https://www.odbms.org/2025/10/on-exasols-distributed-mpp-architecture-vs-duckdb-qa-with-mathias-g...
3•astigsen•11m ago•0 comments

Run interactive commands in Gemini CLI

https://developers.googleblog.com/en/say-hello-to-a-new-level-of-interactivity-in-gemini-cli/
2•ridruejo•11m ago•0 comments

Even the Inventor of 'Vibe Coding' Says Vibe Coding Can't Cut It

https://gizmodo.com/even-the-inventor-of-vibe-coding-says-vibe-coding-cant-cut-it-2000672821
3•rbanffy•13m ago•1 comments

I Became a Police Abolitionist (2020)

https://www.theatlantic.com/ideas/archive/2020/07/how-i-became-police-abolitionist/613540/
2•robtherobber•14m ago•0 comments

Derek Sivers's database and web apps

https://github.com/sivers/sivers
3•surprisetalk•15m ago•0 comments

Microsoft identifies boardroom cyber awareness as a top priority

https://www.computerweekly.com/news/366632783/Microsoft-identifies-boardroom-cyber-awareness-as-a...
2•beardyw•15m ago•0 comments

The State of PHP 2025

https://blog.jetbrains.com/phpstorm/2025/10/state-of-php-2025/
4•mikece•18m ago•0 comments

A Pill That Prints

https://actu.epfl.ch/news/a-pill-that-prints-2/
2•geox•21m ago•0 comments

Finding and fixing software bugs automatically with SapFix and Sapienz

https://engineering.fb.com/2018/09/13/developer-tools/finding-and-fixing-software-bugs-automatica...
2•chw9e•21m ago•0 comments

NASA should go all-in on nuclear propulsion

https://bigthinkmedia.substack.com/p/why-nasa-should-go-all-in-on-nuclear
5•pseudolus•25m ago•0 comments

OpenAI hires black hole physicist in broader science push

https://www.axios.com/2025/10/16/openai-science-black-hole-physicist
5•mfiguiere•27m ago•0 comments

Color-changing organogel stretches 46 times its size and self-heals

https://phys.org/news/2025-09-organogel-size.html
2•PaulHoule•29m ago•0 comments

AI-Powered Stagehand + bisect:Finding and Fixing the Commit That Broke Your Code

https://qckfx.com/blog/ai-powered-stagehand-git-bisect-finding-and-fixing-the-commit-that-broke-y...
2•chw9e•29m ago•0 comments

Once unthinkable, NASA, Lockheed now consider launching Orion on other rockets

https://arstechnica.com/space/2025/10/once-unthinkable-nasa-and-lockheed-now-consider-launching-o...
5•rbanffy•29m ago•2 comments

Show HN: Binharic – A terminal-based AI coding agent

2•habedi0•29m ago•0 comments

Head of Apple's AI Search Project Leaves to Join Meta

https://www.macrumors.com/2025/10/16/apple-ai-search-head-leaves-for-meta/
3•thm•30m ago•0 comments