frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Turn Phone to Speacker for PC

https://sonno.live
1•kinle•2m ago•0 comments

OpenAI Codex: Increase in users hitting Codex rate limits

https://status.openai.com/incidents/01KS88SRADTWQW27NYRAXMBAQN
1•embedding-shape•3m ago•1 comments

Sir John Soane and the red telephone box

https://www.soane.org/features/sir-john-soane-and-red-telephone-box-0
1•curio_Pol_curio•5m ago•0 comments

VPN [First VPN] used by ransomware actors dismantled in global crackdown

https://www.europol.europa.eu/media-press/newsroom/news/cybercriminal-vpn-used-ransomware-actors-...
2•sorenjan•6m ago•1 comments

Extensy

https://extensy.dev
2•amirlannk•6m ago•3 comments

How Metrics Drift: Goodhart's Law, Metric Gaming, and Reality Drift [pdf]

https://github.com/therealitydrift/reality-drift-library/blob/main/Reality%20Drift%20Project/03_R...
1•realitydrift•8m ago•0 comments

Why Did South Africa Relinquish Its Nuclear Weapons?

https://www.thecollector.com/south-africa-nuclear-weapons/
2•Tomte•8m ago•0 comments

Frontier Risk Report (February to March 2026) – METR

https://metr.org/blog/2026-05-19-frontier-risk-report/#incidents-hero
1•paraschopra•8m ago•0 comments

Maldives tragedy shines light on dangers of cave diving

https://www.theguardian.com/world/2026/may/23/maldives-diving-tragedy-cave-experts-warn-danger-sa...
1•YeGoblynQueenne•10m ago•0 comments

Building Complex Functions Out of Real Parts

https://www.johndcook.com/blog/2026/05/22/complex-functions-real-parts/
1•tzury•10m ago•0 comments

Why AI writes software but doesn't build a good product

https://www.f-rello.com/blog/1
1•karstenb•11m ago•1 comments

Show HN: CostHawk Tracks AI Adoption Across Teams, Repos, and Projects

https://costhawk.ai
1•tech-pulse•11m ago•0 comments

Show HN: Calculator Music – play songs with number keys in the browser

2•zice0503•12m ago•0 comments

Yuri Ushakov

https://grokipedia.com/page/Yuri_Ushakov
1•__patchbit__•12m ago•0 comments

PSA Crypto: The P is for Portability

https://danielmangum.com/posts/psa-crypto-portability/
1•hasheddan•16m ago•0 comments

AI assistants can be hijacked and manipulated by inaudible sounds

https://arxiv.org/abs/2604.14604
1•chbint•17m ago•0 comments

AI is changing the internet forever

https://www.cnn.com/2026/05/23/tech/ai-internet-search
1•reconnecting•23m ago•0 comments

Show HN: First MCP server for Guesty property mgmt – 43 tools, open source

https://www.npmjs.com/package/guesty-mcp-server
1•dlj_realty•25m ago•0 comments

How Was This Allowed to Happen? – 2025 Washington National Crash [video]

https://www.youtube.com/watch?v=41UYPeTr96s
2•susam•25m ago•0 comments

Claude Code Documentation Map

https://code.claude.com/docs/en/claude_code_docs_map
1•geox•26m ago•0 comments

Runway started by helping filmmakers – now it wants to beat Google at AI

https://techcrunch.com/2026/05/15/runway-started-by-helping-filmmakers-now-it-wants-to-beat-googl...
1•gmays•26m ago•0 comments

I let an AI agent loose on my network – it owned my supply chain in 12 minutes

https://dennysentinel.com/blog/deepseek-owned-supply-chain-12-minutes/
2•makerdiety•27m ago•0 comments

Opaque Types in Python

https://blog.glyph.im/2026/05/opaque-types-in-python.html
2•lumpa•28m ago•0 comments

LLMKube – A Kubernetes operator for local LLMs across Nvidia and Mac fleets

https://llmkube.com/
2•richteach•28m ago•0 comments

His Chatbot Nearly Ruined Him. To Recover, He Had to Destroy It

https://www.wsj.com/tech/personal-tech/chatgpt-addiction-chatbots-recovery-7977308e
1•impish9208•31m ago•2 comments

The Young Wikipedians Writing the Front Page of Music History

https://pitchfork.com/thepitch/meet-the-young-wikipedians-writing-the-front-page-of-music-history/
3•altilunium•33m ago•0 comments

Google Introduces HTML-in-Canvas API: Accessible UI Meets WebGL / WebGPU

https://www.webgpu.com/news/google-html-in-canvas-webgl-webgpu/
2•FarhadG•34m ago•0 comments

StreamIndex: Memory-bounded compressed sparse attention via streaming top-k

https://arxiv.org/abs/2605.02568
2•OsamaJaber•34m ago•0 comments

On The <dl>

https://benmyers.dev/blog/on-the-dl/
17•ravenical•42m ago•1 comments

TrustGive – a charity directory that links every claim to the 990 Текст

https://trustgive.org
2•AlexOpasnost•43m ago•0 comments