frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•7mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

The Useless Web

https://theuselessweb.com/
1•nateb2022•2m ago•0 comments

What Is Plus Times Plus? (Lambda Calculus Pictorially) [video]

https://www.youtube.com/watch?v=RcVA8Nj6HEo
1•rramadass•4m ago•0 comments

The US's 2k-year-old mystery mounds

https://www.bbc.com/travel/article/20221204-the-us-2000-year-old-mystery-mounds
1•1659447091•5m ago•0 comments

The Monty Hall Problem, a side-by-side simulation

https://www.pcloadletter.dev/blog/monty/
2•ronbenton•13m ago•1 comments

The State of Agentic iOS Engineering in 2026

https://dimillian.medium.com/the-state-of-agentic-ios-engineering-in-2026-c5f0cbaa7b34
2•Anon84•14m ago•1 comments

On biological & artificial consciousness: A case for biological computationalism

https://www.sciencedirect.com/science/article/pii/S0149763425005251
2•bookofjoe•17m ago•0 comments

Show HN: Sentinel Shield – Pure C DMZ for AI Security (23K LOC, <1ms latency)

2•Chgdz•17m ago•0 comments

Ask HN: Favorite Articles in the ACM Digital Library

2•lioeters•20m ago•2 comments

Interpreter – Offline screen translator for Japanese retro games

https://github.com/bquenin/interpreter
3•bane•23m ago•0 comments

Making beautiful PDF documents from HTML and CSS

https://css4.pub/
2•jez•24m ago•0 comments

Ask HN: Which AI productivity tools are you using in 2026?

3•Vishal19111999•28m ago•0 comments

Ukraine enters EU's single mobile roaming zone

https://www.yahoo.com/news/articles/ukraine-enters-eus-single-mobile-164712435.html
4•gok•29m ago•0 comments

Steam On Linux Ends 2025 With 3.19% Marketshare

https://www.phoronix.com/news/Steam-December-2025-Survey
6•doener•31m ago•0 comments

Engineering Is Becoming Beekeeping

https://bits.logic.inc/p/engineering-is-becoming-beekeeping
3•highfrequency•31m ago•0 comments

Balsa M2-F3 Lifting Body

https://www.engineersneedart.com/blog/m2f32025/m2f32025.html
3•chmaynard•32m ago•0 comments

Outrage as X's Grok morphs photos of women, children into explicit content

https://www.cnbctv18.com/technology/global-outrage-as-xs-grok-morphs-photos-of-women-children-int...
10•anonymousab•32m ago•1 comments

China's BYD set to overtake Tesla as top EV seller

https://www.bbc.com/news/articles/cj9rjwpvmpzo
11•decimalenough•33m ago•1 comments

Show HN: VideoCalling.app – Free Video Calling Service

https://videocalling.app
2•Airyisland•35m ago•0 comments

Webmention is an open web standard (W3C Recommendation) for conversations

https://indieweb.org/Webmention
4•doener•36m ago•0 comments

Show HN: Turning 100-plus comments HN threads into readable discussions

4•freakynit•39m ago•1 comments

DENT: A network operating system (NOS) for everyone else

https://dent.dev/
3•teleforce•40m ago•0 comments

Ask HN: Best videos for learning Java concurrency?

2•michalgad•40m ago•1 comments

Delete Request and Opt-Out Platform (Drop)

https://consumer.drop.privacy.ca.gov/
3•doener•41m ago•1 comments

Simulating a negative tax city on Cities Skylines 2 [video]

https://www.youtube.com/watch?v=MK_0mQ7TLY0
2•MinimalAction•44m ago•0 comments

ReactOS Starts 2026 with a Major Step Toward Windows NT6 Compatibility

https://www.phoronix.com/news/ReactOS-Starts-2026
7•hackthemack•45m ago•0 comments

Ask HN: Building a tool to ensure things get done on time

3•Vishal19111999•46m ago•0 comments

I bootstrapped an AI OSINT search engine to 35k users. Trying $5 Day Pass Model

https://ai.cylect.io/
2•nuzzl•48m ago•1 comments

Cerelog ESP-EEG is a new 8-channel biosensing board at a hobbyist-friendly price

https://www.autodidacts.io/cerelog-esp-eeg-affordable-openbci-like-board/
3•Curiositry•56m ago•0 comments

Designing Predictable and Maintainable Forms in React

https://jsdev.space/react-form-primitives/
3•javatuts•57m ago•0 comments

Construction to begin on Florida expressway that will charge EVs while driving

https://www.nbcmiami.com/news/construction-to-begin-on-florida-expressway-that-will-charge-evs-wh...
5•geox•1h ago•3 comments