frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: TradingAgents without the API bill – run multi agents in Claude Code

https://github.com/lucemia/trading-agents-plugin
1•lucemia51•3m ago•0 comments

Stop Supplying. Start Owning

https://allensthoughts.com/2026/05/01/stop-supplying-start-owning/
2•herbertl•4m ago•0 comments

Uber wants to turn its drivers into a sensor grid for AV companies

https://techcrunch.com/2026/05/01/uber-wants-to-turn-its-millions-of-drivers-into-a-sensor-grid-f...
2•nickvec•5m ago•0 comments

Zugzwang

https://en.wikipedia.org/wiki/Zugzwang
2•Qem•9m ago•0 comments

If Claude writes the code, what makes me still a developer?

https://betweentheprompts.com/if-claude-writes-the-code/
2•scastiel•11m ago•0 comments

Santa Cruz restaurant changes logo after flurry of negative reviews for AI art

https://www.sfgate.com/food/article/santa-cruz-restaurant-ai-21955920.php
2•randycupertino•12m ago•0 comments

LLMs consistently pick resumes they generate over ones by humans or other models

https://arxiv.org/abs/2509.00462
5•laurex•15m ago•0 comments

Domination: A contrarian view of AI risk (2024)

https://matthewbutterick.com/chron/domination.html
2•vermilingua•24m ago•0 comments

I moved my blog from Jekyll to Emacs Lisp

https://martinsos.com/posts/my-blog-in-elisp
2•Martinsos•26m ago•1 comments

The History of Lipstick

https://www.saturdayeveningpost.com/2026/04/common-threads-the-history-of-lipstick/
2•ohjeez•26m ago•0 comments

Alberta allows windfall oil and gas payments to select ranchers – on public land

https://thenarwhal.ca/alberta-grazing-oil/
3•Teever•28m ago•0 comments

US blockade costs Iran $4.8B, US Navy acting 'sort of like pirates,' Trump says

https://www.jpost.com/middle-east/iran-news/article-894867
2•Levitating•31m ago•2 comments

A preliminary model to establish a digital twin for coffee roasting

https://www.nature.com/articles/s41598-026-43923-9?fromPaywallRec=false
2•bookofjoe•31m ago•0 comments

Show HN: RegularMonk – a web app that helps me use my phone less

https://www.regularmonk.com/hello
1•amit9968•31m ago•0 comments

Apple Faces Lawsuits over AirTag Stalking After Class Action Denied

https://www.macrumors.com/2026/05/01/airtag-stalking-lawsuits-apple/
1•mgh2•31m ago•0 comments

Make Common Sense Common Again

https://nik.art/make-common-sense-common-again/
1•herbertl•33m ago•0 comments

Stackless coroutines for gamedev in ~200 lines of C++

https://vittorioromeo.com/index/blog/sfex_coroutine.html
2•tzury•34m ago•0 comments

Proudly Pathetic

https://craigatallahfrost.com/post/2025/08/17/proudly-pathetic/
1•herbertl•35m ago•0 comments

NASA to increase CLPS contract to support surge of lunar lander missions

https://spacenews.com/nasa-to-increase-value-of-clps-contract-to-support-surge-of-lunar-lander-mi...
3•rbanffy•36m ago•0 comments

America's Expanding Domestic Surveillance

https://www.wsj.com/articles/americas-expanding-domestic-surveillance-08b73187
5•Brajeshwar•41m ago•0 comments

The Fake Hawaii CTO Who Fooled Everyone

https://dallasexpress.com/national/from-vegas-stages-to-official-warnings-the-fake-hawaii-cto-who...
1•greenchair•41m ago•0 comments

Apple Stores Targeted in $16.2M Counterfeit Device Scheme

https://pasadenanow.com/main/pasadena-apple-store-among-locations-targeted-in-16-2-million-counte...
1•kid64•41m ago•0 comments

Docker vs. Podman: Which Containerization Tool Is Right for You – DataCamp

https://www.datacamp.com/blog/docker-vs-podman
1•abdelhousni•45m ago•1 comments

Ask HN:Do people configure Claude Code to use other models

https://openrouter.ai/apps/claude-code
1•ripvanwinkle•45m ago•4 comments

LibreLocal 2026 – Global Meetups Across Six Continents

https://tux.re/forum/viewtopic.php?t=217
1•tuxyz•45m ago•0 comments

We Forgot How to Write

https://www.timwehrle.de/blog/we-forgot-how-to-write/
3•timwehrle•46m ago•1 comments

Sebastian Proactive: a local‑first AI companion that initiates conversations

https://github.com/DaroHacka/proactive-sebastian-ai-companion
1•darohacka•49m ago•0 comments

Chinese AI models are ~8 months behind and falling further behind

https://twitter.com/scaling01/status/2050395242663223751
1•enraged_camel•49m ago•4 comments

Path to Vibe Engineering

https://leandronsp.com/articles/path-to-vibe-engineering
1•leandronsp•50m ago•0 comments

Zig's issue tracker got 3k spam issues in 20 minutes

https://codeberg.org/ziglang/zig/issues
3•qilme•51m ago•2 comments