frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: Clawd Penguin – a virtual hangout for when Claude goes down

https://clawdpenguin.com
1•ossa-ma•1m ago•0 comments

Specsmaxxing

https://acai.sh/blog/specsmaxxing
1•brendanmc6•1m ago•0 comments

Microsoft Offers Voluntary Retirement to About 7% of US Workers

https://www.bloomberg.com/news/articles/2026-04-23/microsoft-offers-voluntary-retirement-to-about...
1•helsinkiandrew•2m ago•0 comments

Freak Heat Spikes Pay Big on Polymarket, Rousing Weather Nerds' Suspicion

https://www.wsj.com/business/unusual-weather-bets-on-polymarket-spur-french-investigation-b799bec8
1•julienchastang•2m ago•0 comments

Show HN: We're building Apache spark for agents with Rust and Datafusion

https://github.com/SkardiLabs/skardi
1•btnokami•3m ago•0 comments

Google TPU 8i for Inference and TPU 8T for Training Announced

https://www.servethehome.com/google-tpu-8i-for-inference-and-tpu-8t-for-training-announced/
1•teleforce•4m ago•0 comments

Show HN: Code garden deep-dive: my Forth C64 tetromino game

https://github.com/ekipan/sss/blob/share-hn/Design.md
1•ekipan•4m ago•0 comments

The Price of AI Is the Internet

https://vanilla.sh/blog/price-of-ai/
2•speckx•5m ago•0 comments

The NCSC's AI threat warning and the gap in AI agent security

https://agentshield.pro/blog/ncsc-perfect-storm
1•eigenart•6m ago•0 comments

Engineering Architecture: A Syllabus?

https://www.argmin.net/p/engineering-architecture-a-syllabus
1•sebg•6m ago•0 comments

America Cannot Lose the Robotics Race

https://a16z.com/america-cannot-lose-the-robotics-race/
1•nowflux•6m ago•0 comments

The Sleeper in the Payment Stack

https://franktyoung.substack.com/p/the-sleeper-in-the-payments-stack
1•manojr13•6m ago•0 comments

Atlassian to begin using customer metadata and and in-app data to train AI

https://www.atlassian.com/trust/ai/data-contribution/faqs
1•AaronM•8m ago•2 comments

Malicious Checkmarx Artifacts Found in Official KICS Docker Repository

https://socket.dev/blog/checkmarx-supply-chain-compromise
1•darkwater•9m ago•0 comments

Google Workspace Intelligence

https://workspace.google.com/blog/product-announcements/introducing-workspace-intelligence
1•oscarfr•9m ago•0 comments

Math Is Hard

http://miod.online.fr/software/openbsd/stories/vaxfp.html
1•signa11•9m ago•0 comments

Gemini Enterprise for the agentic task force

https://cloud.google.com/blog/products/ai-machine-learning/whats-new-in-gemini-enterprise
1•oscarfr•9m ago•0 comments

Gemini Enterprise Agent Platform

https://cloud.google.com/blog/products/ai-machine-learning/introducing-gemini-enterprise-agent-pl...
1•oscarfr•10m ago•0 comments

Why AI coding speed does not translate into engineering speed

https://blog.reqproof.com/p/ai-writes-your-code-nobody-verifies
1•LeonidBugaev•14m ago•0 comments

Web debugging proxy in your coding agent

https://www.telerik.com/blogs/when-your-coding-assistant-finally-got-x-ray-vision
1•zlatkov•14m ago•0 comments

Netflix Was Held Together with Duct Tape

https://marcrandolph.substack.com/p/netflix-was-held-together-with-duct
2•theorchid•15m ago•0 comments

The Declining Driver's License: Good, Bad, or Both?

https://maxmautner.com/2026/04/21/teen-drivers-license-decline.html
1•mslate•15m ago•0 comments

What Is AI Share of Voice? and Why You Should Care

https://liatbenzur.com/2026/04/22/what-is-ai-share-of-voice/
1•AISupportTeam•15m ago•0 comments

Is AI an Expensive Hobby?

https://adandai.wordpress.com/2026/04/23/is-ai-an-expensive-hobby/
2•allessa•16m ago•0 comments

Lossless Image Compression Architecture for Deep-Space CMOS Cameras

https://www.mdpi.com/2076-3417/16/6/2873
2•PaulHoule•16m ago•0 comments

Original Hello World in "B" Programming Language - Computerphile [video]

https://www.youtube.com/watch?v=cYS57xJuRP8
1•em-bee•16m ago•0 comments

High-Dose Flu Vaccine Linked to Lower Dementia Risk

https://www.medscape.com/viewarticle/high-dose-flu-vaccine-linked-lower-dementia-risk-2026a1000cjf
4•kieranmaine•17m ago•0 comments

Show HN: Sable Found a SQL Injection in a Legacy Financial Portal

https://blog.vulnetic.ai/how-sable-found-a-sql-injection-in-a-legacy-financial-portal-58a329b96b0b
1•danieltk76•18m ago•0 comments

Evolving Distributed Tracing at Uber Engineering

https://www.uber.com/in/en/blog/distributed-tracing/
1•sebg•19m ago•0 comments

Show HN: AgentBox – SDK to Run Claude Code, Codex, or OpenCode in Any Sandbox

https://github.com/TwillAI/agentbox-sdk
2•willydouhard•20m ago•0 comments