frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•10mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Zero-build privacy policies with Astro

https://www.openpolicy.sh/blog/no-build-astro
1•jamie_davenport•2m ago•0 comments

Show HN: List of Design Md Files

https://github.com/bergside/awesome-design-md-skills
1•elwingo1•4m ago•0 comments

Rage: A modern Ruby framework designed for non-blocking I/O

https://github.com/rage-rb/rage
1•thunderbong•6m ago•0 comments

Apple Home is expanding its energy management features (2025)

https://www.theverge.com/news/685733/apple-home-energykit-energy-management-ios26-wwdc
1•zeristor•11m ago•1 comments

Dubai considering canal to bypass Strait of Hormuz (2008)

https://www.freightwaves.com/news/dubai-considering-canal-to-bypass-strait-of-hormuz
1•JumpCrisscross•11m ago•0 comments

Seeduplex: ByteDance's full-duplex voice AI

https://seeduplex.io
1•caoq•11m ago•1 comments

CLI proxy that reduces LLM token consumption by 60-90% on common dev commands

https://github.com/rtk-ai/rtk
2•ahamez•12m ago•0 comments

I watched Claude Code read my AWS credentials on startup

https://github.com/diemoeve/forgeterm/releases/tag/v0.2.0
2•storm677•13m ago•0 comments

Ask HN: Are you encountering AI-related questions in the hiring market?

2•somthingwrong•14m ago•0 comments

My turns any API into an AI agent in 60 seconds

https://www.indiehackers.com/post/i-built-a-platform-that-turns-any-api-into-an-ai-agent-in-60-se...
2•AnimeMyPic•18m ago•0 comments

I Lost This Once

https://johnie.se/writing/i-already-lost-this-once
1•div3rs3•18m ago•0 comments

Show HN: I Built an LLM Harness for Language Learning

https://language.coursely.ai/begin
1•MO-379•19m ago•0 comments

Model Architecture Description Encyclopedia

https://madl.si5.pl/
1•izik•19m ago•0 comments

Track List – Track Your Favorite GitHub List Daily

https://www.trackawesomelist.com/
1•triilman•26m ago•0 comments

CVE-2026-23869: CVSS 7.5 in React Server Components lead to Denial of Service

https://vercel.com/changelog/summary-of-cve-2026-23869
1•altbdoor•30m ago•0 comments

AES Cipher and Dechiper

https://github.com/loperfido/aes256
1•loperfido•31m ago•0 comments

Lobsters Interview with Internet_Janitor

https://alexalejandre.com/programming/interview-with-john-earnest/
1•birdculture•33m ago•0 comments

Particles seen emerging from empty space for first time

https://www.newscientist.com/article/2522324-particles-seen-emerging-from-empty-space-for-first-t...
4•isaacfrond•33m ago•0 comments

Show HN: A security scanner for AI Agent Skills

https://github.com/Fangcun-AI/SkillWard/tree/main
3•mayziem•34m ago•0 comments

Show HN: Search cheap night train tickets in Europe

https://trainbot.eu/
1•druskacik•34m ago•0 comments

The Uncensored Library

https://www.uncensoredlibrary.com/en
1•28304283409234•35m ago•0 comments

Ask HN: How do you make paywalls in ATProtocol?

1•Gooblebrai•36m ago•0 comments

PostgresBench: A Reproducible Benchmark for Postgres Services

https://clickhouse.com/blog/postgresbench
2•tosh•37m ago•0 comments

Show HN: Run GUIs as Scripts

https://github.com/skinnyjames/hokusai-pocket
2•zero-st4rs•47m ago•0 comments

Agentic AI in 2026: Who's building real systems?

https://simplai.ai/blogs/what-is-enterprise-ai/
2•shanmugarajsk•50m ago•0 comments

The Music of the Spheres

https://www.smbc-comics.com/comic/spheres-part-1
2•robin_reala•51m ago•0 comments

Capture-Quiet Decomposition: A Verification Theorem for Chess Endgame Tablebases

https://arxiv.org/abs/2604.07907
1•RusDyn•58m ago•0 comments

iPhone Fold is 'on track' to launch this September, per Mark Gurman

https://9to5mac.com/2026/04/07/iphone-fold-is-on-track-to-launch-this-september-per-mark-gurman/
1•prawn•58m ago•0 comments

Why Apple Is Betting Against Native iOS Development

https://medium.com/@mrhotfix/why-apple-is-secretly-betting-against-native-ios-development-c67cb34...
1•antfarm•59m ago•0 comments

The Model Is Not the Product: Harnesses Will Define the Next Phase of AI

https://www.mountaineagle.net/articles/display/the-model-is-not-the-product-harnesses-not-intelli...
2•uswn•1h ago•0 comments