frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

The Bitter Lessons

https://www.hyperdimensional.co/p/the-bitter-lessons
1•gmays•1m ago•0 comments

Back to Basics: Let Denoising Generative Models Denoise

https://arxiv.org/abs/2511.13720
1•dvrp•1m ago•0 comments

Browser-based Lunatic Fringe Clone

https://jackinloadup.github.io/lunatic-fringe/
1•hoag•2m ago•0 comments

We Can Now Track Individual Monarch Butterflies

https://www.nytimes.com/2025/11/17/science/monarch-butterfly-migration-tracking-sensor.html
1•m463•2m ago•1 comments

The Fate of Data Model Dependency

https://medium.com/@HobokenDays/the-fate-of-shared-data-model-cf8a3dc88ac9
1•HideInNews•4m ago•0 comments

Show HN: Waitinglist.to – Let Founders Focus on Building

https://waitinglist.to/
1•ivanramos•5m ago•0 comments

The obvious economics of preserving the Amazon

https://www.economist.com/the-americas/2025/10/23/the-obvious-economics-of-preserving-the-amazon
1•gwintrob•9m ago•0 comments

Fuzzbox: Modern fuzzy finder for Vim with minimal dependencies

https://github.com/vim-fuzzbox/fuzzbox.vim
1•inatreecrown2•10m ago•1 comments

AA-Omniscience: Evaluating Cross-Domain Knowledge Reliability in Language Models

https://arxiv.org/abs/2511.13029
4•declanjackson•12m ago•0 comments

The End of Traditional Industrial Design and Transition into a New Frontier (2021) [video]

https://www.youtube.com/watch?v=fozZfnJCoaE
1•mgh2•12m ago•0 comments

Migrate from Ingress-Nginx Controller to Nginx Ingress Controller

https://docs.nginx.com/nginx-ingress-controller/install/migrate-ingress-nginx/
1•mooreds•13m ago•0 comments

The productivity impact of coding agents

https://cursor.com/blog/productivity
2•todsacerdoti•14m ago•0 comments

SimpleMMO – How I made a hole a home (2021)

https://blog.galahadcreative.com/simplemmo-how-i-made-a-hole-a-home/
1•bdlowery•19m ago•0 comments

Count Cachula – Local-first performance without the complexity

https://countcachula.spooky.click/
1•jakelazaroff•19m ago•0 comments

My Tesla Robotaxi "safety" driver fell asleep

https://old.reddit.com/r/sanfrancisco/comments/1p00wmx/my_tesla_robotaxi_safety_driver_fell_asleep/
1•leoh•25m ago•0 comments

Seekdb,unified search database for AI(relational, vector and full text)

https://github.com/oceanbase/seekdb
1•jinqueeny•26m ago•0 comments

So You Want To Look Rich? How to use museum archives to make large wall art

https://marykateandsmashley.substack.com/p/so-you-want-to-look-rich
2•novia•32m ago•1 comments

Microsoft's AI Strategy Deconstructed – From Energy to Tokens

https://newsletter.semianalysis.com/p/microsofts-ai-strategy-deconstructed
1•nsoonhui•33m ago•0 comments

Rank-balanced trees (2014) [pdf]

https://sidsen.azurewebsites.net/papers/rb-trees-talg.pdf
2•todsacerdoti•40m ago•0 comments

RDMA-Rust: Why another RDMA wrapper

https://rdma-rust.github.io/2025/11/16/why-another-rdma-wrapper/
2•wmf•41m ago•0 comments

PPP-over-HTTP/2: Having Fun with dumbproxy and pppd

https://snawoot.github.io/ppp-over-http2/
1•Snawoot•47m ago•0 comments

Salesforce no longer sells Slack directly in Hong Kong, Macau and Taiwan

3•dt060•48m ago•0 comments

CoreWeave, the AI industry's ticking time bomb

https://www.niemanlab.org/reading/meet-coreweave-the-ai-industrys-ticking-time-bomb/
9•zerosizedweasle•50m ago•3 comments

A Theory of Dumb: Why Are IQ Scores Suddenly Falling?

https://nymag.com/intelligencer/article/american-adult-lower-iq-scores-cognitive-decline-technolo...
4•tysone•57m ago•0 comments

C23: A Slightly Better C

https://lemire.me/blog/2024/01/21/c23-a-slightly-better-c/
2•signa11•57m ago•0 comments

New rare earth crisis is brewing as yttrium shortages spread

https://www.reuters.com/business/aerospace-defense/new-rare-earth-crisis-is-brewing-yttrium-short...
2•perihelions•59m ago•0 comments

The Life of a Packet in the Linux kernel: From write() to recv()

https://www.0xkato.xyz/life-of-a-packet-in-the-linux-kernel/
1•signa11•1h ago•0 comments

Show HN: A Claude Code plugin for build agent (dogfodding it now)

https://github.com/openonion/connectonion-claude-plugin
1•OpenOnion•1h ago•0 comments

Mexican government partially unblocks secure internet

https://blog.torproject.org/mexican-government-partially-unblocks-tor/
3•iamnothere•1h ago•0 comments

OpenZFS 2.4 Squeezes in Some Last Minute Improvements

https://www.phoronix.com/news/OpenZFS-2.4-rc4-Released
3•Bender•1h ago•0 comments