frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

The Halo Remake – Balancing the Old and the New

https://relativenostalgia.com/posts/the-halo-remake-balancing-the-old-and-the-new
1•speckx•2m ago•0 comments

Behind the Scenes: One Flew over the Cuckoo's Nest with Jack Nicholson [video]

https://www.youtube.com/watch?v=FS219uod4y0
1•petethomas•2m ago•0 comments

Engineer proves that Kohler's smart toilet cameras aren't private

https://arstechnica.com/gadgets/2025/12/despite-accessing-user-data-kohler-still-says-its-smart-t...
1•brie22•3m ago•0 comments

YouTuber accidentally crashes the rare plant market with viral cloning technique

https://www.dexerto.com/youtube/youtuber-accidentally-crashes-the-rare-plant-market-with-a-viral-...
1•StrangeSound•4m ago•0 comments

SpaceX – ISS Docking Simulator

https://iss-sim.spacex.com/
1•emreb•4m ago•0 comments

The Dial-Up Volunteer Army

https://tedium.co/2021/07/23/aol-community-leader-volunter-program-history/
1•austinallegro•5m ago•0 comments

Foreign-dlopen: load dynamic libraries into a statically-linked executable

https://github.com/pfalcon/foreign-dlopen
1•fanf2•6m ago•0 comments

Why won't Steam Machine support HDMI 2.1?

https://arstechnica.com/gaming/2025/12/why-wont-steam-machine-support-hdmi-2-1-digging-in-on-the-...
2•saghm•7m ago•0 comments

Ask HN: Will AI make humans smarter through evolutionary selection pressure?

1•amichail•7m ago•0 comments

The Thoughts of a Spiderweb (2017)

https://www.quantamagazine.org/the-thoughts-of-a-spiderweb-20170523/
1•ColinWright•7m ago•0 comments

IKEA arrives in New Zealand. Even the country's leader came out to celebrate

https://www.cnn.com/2025/12/04/business/ikea-sweden-new-zealand-intl-hnk
1•TMWNN•7m ago•1 comments

Thoughts on Go vs. Rust vs. Zig

https://sinclairtarget.com/blog/2025/08/thoughts-on-go-vs.-rust-vs.-zig/
2•yurivish•8m ago•0 comments

Show HN: The Turboconfabulator – LLM Turboencabulator Parody [video]

https://www.youtube.com/shorts/2kyK_-9Jo7M
1•rmatteson•9m ago•0 comments

Structured Iteration – The C++ Way

https://thecppway.com/posts/structured_iteration/
1•klaussilveira•11m ago•0 comments

Discovering universal technical indicators with AlphaEvolve

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5791062
1•kyuksel•15m ago•2 comments

Investment without optimization: LLM-as-a-Judge tournaments and evolution

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5835462
1•kyuksel•15m ago•2 comments

NASA Rocket Engine Fireplace (8 hrs in 4K) [video]

https://www.youtube.com/watch?v=_cgTVTwu4nw
2•rmason•16m ago•0 comments

Countdown until the AI bubble bursts

https://pop-the-bubble.xyz/
3•tapematch•20m ago•0 comments

The Department of War Just Shot the Accountants and Opted for Speed

https://steve-blank.com/
1•rmason•22m ago•0 comments

Baby dies after being left lying on cold and damp bed sheets at Welsh hospital

https://www.walesonline.co.uk/news/wales-news/baby-died-after-being-left-32964330
1•iamronaldo•23m ago•0 comments

Common Knowledge, Regained

https://arxiv.org/abs/2311.04374
1•robot-wrangler•23m ago•0 comments

AI-Native vs. Anti-AI Engineers

1•grandimam•27m ago•1 comments

The Navigational Triangle

https://www.johndcook.com/blog/2025/12/04/the-navigational-triangle/
1•ibobev•27m ago•0 comments

Me vs. the VNC Guy

https://martinrue.com/coding-stories-me-vs-vnc/
1•afisxisto•28m ago•0 comments

Solving Spherical Triangles

https://www.johndcook.com/blog/2025/12/04/solving-spherical-triangles/
1•ibobev•28m ago•0 comments

Google Antigravity wipes user's HDD

https://www.tomshardware.com/tech-industry/artificial-intelligence/googles-agentic-ai-wipes-users...
4•jihadjihad•28m ago•1 comments

NY judge orders OpenAI to hand over ChatGPT conversations in win for newspapers

https://www.nydailynews.com/2025/12/03/ny-judge-orders-openai-to-hand-over-chatgpt-conversations-...
3•Cyclone_•29m ago•1 comments

Show HN: ~$ root-dir closed beta is live – a command-line community for devs

https://www.root-dir.com
1•madsmadsdk•29m ago•0 comments

From Zero to Package in Seconds: The New Conan MCP Server

https://blog.conan.io/mcp/ai/gpt/conan/conan-mcp/2025/12/04/From-Zero-to-Package-in-Seconds-the-n...
2•ibobev•29m ago•0 comments

A quine made with Nix and HTML

https://embedding-shapes.github.io/niccup/examples/quine/demo/
1•embedding-shape•29m ago•1 comments