frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: Keepthat.link – rudimentary, no-frills bookmarks

https://www.keepthat.link/
1•e_xyz•1m ago•0 comments

Childhood Neighbors Influence Occupation Choice [pdf]

https://drive.google.com/file/d/17Pq41ZzfwEdm-YrmWCMkvU0E4T-SXzPp/view
1•elsewhen•2m ago•0 comments

Show HN: Zsweep – Play Minesweeper using only Vim motions

https://zsweep.com
1•oug-t•5m ago•1 comments

Nuclear Weapons Are Now ESG Compliant

https://news.slashdot.org/story/26/01/14/144240/nuclear-weapons-are-now-esg-compliant
1•7777777phil•6m ago•0 comments

The Truth Architecture – Why Web3 Is the Only Way Out

https://aegistrail.github.io/posts/Why-Web3-is-the-only-way-out/
2•patronage•7m ago•0 comments

Humans are taking our jobs!

https://humanthreat.xyz/
2•modinfo•8m ago•0 comments

Predator Spyware Turns Failed Attacks into Intelligence for Future Exploits

https://www.securityweek.com/predator-spywares-granular-anti-analysis-features-exposed/
1•smurda•9m ago•0 comments

Engineering a reusable insulin patch pump

2•u-pump•10m ago•0 comments

The Harvesting of Lettuce

https://sftw.substack.com/p/310-to-yuma
1•HR01•11m ago•0 comments

Seamless codebase-relevant context enrichment for prompts

https://github.com/arterialist/magic-prompt
1•Arterialist•11m ago•0 comments

Is Sienna Rose AI? All Signs Point to 'Yes'

https://www.rollingstone.com/music/music-news/sienna-rose-ai-artist-real-1235499068/
1•geox•12m ago•0 comments

With AI coding we can just make our own editors

https://github.com/posix4e/minivim
1•alexnewman•17m ago•1 comments

Show HN: StayUp – a background desktop app for activity-based time trackers

1•delusdev•18m ago•0 comments

How to Build an AI Agent Declaratively with Terraform

https://chatbotkit.com/tutorials/how-to-build-an-ai-agent-declaratively-with-terraform
1•_pdp_•19m ago•0 comments

Perelman's Proof of the Poincar E Conjecture: A Nonlinear PDE Perspective

https://arxiv.org/pdf/math/0610903
1•tzury•24m ago•0 comments

Show HN: SMath Units, RCPC Initiative

https://github.com/JTRSoftware/Project_RCPC/tree/main/ReadyToShare/sMath
1•jtr87•26m ago•0 comments

Blue on X: "unrot your brain"

https://twitter.com/bluewmist/status/2012755834636533893
2•bilsbie•27m ago•0 comments

Show HN: Open-source confusion matrix generator for ML models

1•pareshrnayak•27m ago•1 comments

Ljudmila

https://wiki.ljudmila.org/Main_Page
2•jruohonen•27m ago•0 comments

The real technical debt is semantic decay and only platforms can stop it

https://unvarnishedgrady.substack.com/p/on-platforms-iii-the-physics-of-meaning
2•ecurb•27m ago•0 comments

Show HN: 13MB full-text site search

https://www.asciimx.com/log/site-search/
1•kovac•28m ago•0 comments

Coding with LLMs can still be fun

https://www.codingwithjesse.com/blog/coding-with-llms-can-still-be-fun/
1•CodingWithJesse•29m ago•0 comments

Could Europe Defeat America in an All-Out War?

https://globalaffairsexplained.com/europe-defeat-america/
1•type0•31m ago•1 comments

Show HN: Moshi – Talk to Claude Code from your phone (zero desktop install)

https://getmoshi.app
1•rjyo•31m ago•0 comments

What is Plan 9?

https://fqa.9front.org/fqa0.html#0.1
26•AlexeyBrin•32m ago•3 comments

Show HN: Straw – HTTP Liquid template engine

https://github.com/moritzrinow/straw
1•devmojo•33m ago•0 comments

Stop Begging Prospects to Open Your Files

https://www.sendnow.live
2•sendnow•35m ago•0 comments

Show HN: I made a Tetris based block puzzle game

https://playdropstack.com/
3•lastodyssey•37m ago•0 comments

String Theory Can Now Describe a Universe That Has Dark Energy

https://www.quantamagazine.org/string-theory-can-now-describe-a-universe-that-has-dark-energy-202...
2•7777777phil•39m ago•0 comments

Self Driving Biology

https://mattshams.com/writings/self-driving-biology
1•hnpwd•39m ago•0 comments