frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: Say What You Should Have

https://mappymail.com
1•pruetj•2m ago•0 comments

Compulsively violent people might have lower IQs

https://www.psypost.org/people-who-engage-in-impulsive-violence-tend-to-have-lower-iq-scores/
3•karim79•6m ago•0 comments

Show HN: LucidExtractor – AI web scraper that understands plain English

https://lucidextractor.liceron.in
1•yukendiran_j•7m ago•0 comments

Ask HN: Does This Make Sense?

2•piratesAndSons•7m ago•1 comments

Show HN: ZuckerBot. API and MCP server for AI agents to run Meta/Facebook ads

https://zuckerbot.ai/
1•DavisGrainger•9m ago•0 comments

Weekly AI Pulse: Feb 23rd Edition

https://manojgopanapalli.substack.com/p/your-weekly-ai-pulse-india-ai-impact
1•thecontentboy•9m ago•0 comments

You Make Good Money...So Why Do You Still Feel Replaceable?

https://cpleveragingai.substack.com/p/you-make-good-money
1•cp18101985•10m ago•0 comments

Ask HN: Is there a workaround in OpenClaw for tab not found

1•jinen83•11m ago•0 comments

HN; Scheduled autonomous Claude agents using shell scripts and launchd

https://github.com/raulriera/MacPilot
1•raulriera•12m ago•1 comments

The Refrigerator Policy

https://alexover.dev/articles/the-refrigerator-policy/
2•alexoverdev•13m ago•1 comments

Show HN: A 4-tier self-healing system for local AI agents (was silently broken)

1•ramsbaby-dev•13m ago•1 comments

Tiny knife raised $450K in under a week

https://gearjunkie.com/knives/coin-tiny-knife-launch
1•teleforce•14m ago•0 comments

Show HN: 4-tier self-healing AI agent (was silently broken for weeks)

https://github.com/Ramsbaby/openclaw-self-healing
1•ramsbaby-dev•17m ago•0 comments

15K Labeled Enterprise Use Cases for Agent Routing (CC-by-4.0)

https://huggingface.co/datasets/LlewellynSystems/ode-enterprise-use-cases
1•LLSODE•18m ago•0 comments

LipoVive Launches Natural Supplement to Boost Metabolic Health

https://www.morningstar.com/news/accesswire/1138075msn/lipovive-reviews-shocking-2026-report-what...
1•janzlaps•18m ago•0 comments

WME Group

https://en.wikipedia.org/wiki/WME_Group
1•barrister•19m ago•0 comments

I Vibecoded a Tax App in a couple of weekends

https://github.com/ouais/opentax
3•ouaiso•23m ago•4 comments

What a viral monkey, his plushie, and a 70-year-old experiment tell us

https://theconversation.com/a-viral-monkey-his-plushie-and-a-70-year-old-experiment-what-punch-te...
3•defrost•28m ago•0 comments

Ask HN: What's your thought on Google Antigravity?

3•ms7892•33m ago•1 comments

Clawbridge Runner – CLI for nightly OpenClaw discovery and connection briefs

https://clawbridge.cloud/
1•lich2000117•33m ago•1 comments

MiniMax-M2.5: How to Run Guide

https://unsloth.ai/docs/models/minimax-m25
1•khimaros•34m ago•0 comments

A sneaky demonstration of the dangers of curl – bash

https://blog.k3can.us/posts/2026/feb/dontcurlbash/
2•novemp•34m ago•0 comments

Keeping the Pirates at Bay

https://www.gamedeveloper.com/business/keeping-the-pirates-at-bay
4•colinprince•36m ago•0 comments

Show HN: Panther – a cross-platform cybersecurity scripting language

1•CzaxTanmay•38m ago•1 comments

Why Kanji survived in Japan but not in Korea or Vietnam [video]

https://www.youtube.com/watch?v=61kf0AY5rzY
1•teleforce•40m ago•0 comments

Show HN: Convert Audio to Text – NeatScribe

https://neatscribe.com
3•kadeus•41m ago•0 comments

Show HN: Sayou: Open-source agent workspace with versioned files and MCP tools

https://github.com/pixell-global/sayou
2•syumpx•43m ago•1 comments

Elon Musk posted about race almost every day in January

https://www.theguardian.com/technology/2026/feb/12/elon-musk-posts-january-white-supremacists
16•tastyface•47m ago•1 comments

AI-recorded meetings can go horribly wrong

https://www.afr.com/world/europe/this-is-how-ai-recorded-meetings-can-go-horribly-wrong-20260223-...
3•KnuthIsGod•47m ago•0 comments

Bug Traceability: Translating Bugs to Business Impact

https://docs.testchimp.io/blog/ux_bug_traceability/
2•nsamarasekera•49m ago•0 comments