frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•10mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

RocketMapper Satellite Tracker

https://rocketmapper.com/satellites
1•jonbaer•20s ago•0 comments

Show HN: Got a tough conversation coming up? This AI app will help you prepare

https://toughconversations.app/
1•ddesposito•50s ago•0 comments

Claude Code gets 'safer' auto mode

https://www.theverge.com/ai-artificial-intelligence/900201/anthropic-claude-code-auto-mode
1•datadrivenangel•1m ago•0 comments

Apple randomly closes bug reports unless you "verify" the bug remains unfixed

https://lapcatsoftware.com/articles/2026/3/11.html
1•zdw•1m ago•0 comments

China still on track to supplant US as No 1 economy in 10 years

https://www.scmp.com/economy/china-economy/article/3347863/china-still-track-supplant-us-worlds-n...
1•mikhael•1m ago•0 comments

IXI's autofocusing lenses are almost ready to replace multifocal glasses

https://www.engadget.com/wearables/ixis-autofocusing-lenses-multifocal-glasses-ces-2026-212608427...
1•andsoitis•3m ago•0 comments

Automatically generate all 3D print files for organizing a drawer

https://geniecrate.com/
1•woktalk•3m ago•0 comments

Shader Development Studio

https://www.shader.se
1•memalign•5m ago•0 comments

Project N.O.M.A.D. Offline Survival Computer Bundles AI, Wikipedia, Khan Academy

https://github.com/Crosstalk-Solutions/project-nomad
2•tagami•6m ago•1 comments

Microsoft Rust Training Books

https://github.com/microsoft/RustTraining
1•serial_dev•6m ago•0 comments

High-performance denoising library for ray tracing

https://www.openimagedenoise.org/
1•teleforce•6m ago•0 comments

Updates to GitHub Copilot interaction data usage policy

https://github.blog/news-insights/company-news/updates-to-github-copilot-interaction-data-usage-p...
2•prefork•6m ago•0 comments

Ball Pit

https://codepen.io/mrdoob_/full/NPRwLZd
3•memalign•7m ago•0 comments

Show HN: I built a voice AI that responds like a real woman

1•shalomer•7m ago•0 comments

Model collapse is already happening

https://cacm.acm.org/blogcacm/model-collapse-is-already-happening-we-just-pretend-it-isnt/
9•zdw•9m ago•4 comments

The OpenAI Safety Bug Bounty Program

https://openai.com/index/safety-bug-bounty/
1•Agreed3750•10m ago•0 comments

Google's TurboQuant offers LLMs up to 6x compression

https://arstechnica.com/ai/2026/03/google-says-new-turboquant-compression-can-lower-ai-memory-usa...
2•cwt137•10m ago•0 comments

Pulling the Lever

https://zaferbalkan.com/pulling-the-lever/
1•feldrim•11m ago•1 comments

Crocker's Rules

https://blainsmith.com/articles/crockers-rules/
1•speckx•12m ago•0 comments

At least 40% of Russia's oil export capacity halted, Reuters calculations show

https://www.reuters.com/business/energy/least-40-russias-oil-export-capacity-halted-reuters-calcu...
2•doener•14m ago•0 comments

Has banning phones improved performance at Dutch schools?

https://www.bbc.com/news/articles/cpqxjwvvyl4o
2•tartoran•16m ago•0 comments

JavaFX 26 Today [video]

https://www.youtube.com/watch?v=Z3eHldNlHlU
1•java-man•17m ago•0 comments

Instagram and YouTube found liable in landmark social media addiction trial

https://www.pbs.org/newshour/nation/instagram-and-youtube-found-liable-in-landmark-social-media-a...
1•bdcravens•17m ago•0 comments

The LiteLLM Supply Chain Attack and Why Your Secrets Shouldn't Survive Boot

https://blog.crawley.systems/posts/litellm-supply-chain-attack
2•kcmastrpc•17m ago•0 comments

Playra by a 16 Year Old

1•Allenboyy•17m ago•0 comments

Finland moves to allow hosting of nuclear weapons

https://responsiblestatecraft.org/finland-nuclear-weapons/
1•t-3•18m ago•1 comments

Elon Musk Announces ClipX

https://lzon.ca/posts/series/duck/clipx/
1•jpmitchell•18m ago•0 comments

Dialkit

https://joshpuckett.me/dialkit
1•Areibman•22m ago•0 comments

Turn messy Amazon invoice PDFs into usable Excel data

https://amazoninvoicetoexcel.com/
1•bigCourage•23m ago•0 comments

Claude picks the first idea that works. Make it pick the best one

https://photostructure.com/coding/claude-code-replan/
2•speckx•23m ago•0 comments