frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•7mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Rocket Engine Wasn't Designed by Humans

https://www.youtube.com/watch?v=6Xx1GXjRbMk
1•bane•2m ago•0 comments

Ask HN: How would you formalize a time-based macroeconomic model?

1•AGsist•5m ago•0 comments

Show HN: Snipscribe – a digital organizer for your pen and paper notes

https://snipscribe.com/
3•wisemonk314•10m ago•1 comments

Winter braid lecture notes [pdf]

https://www.numdam.org/item/10.5802/wbln.9.pdf
1•marysminefnuf•12m ago•0 comments

Show HN: Browser extension that brings Vim motions to Google Docs

https://github.com/tirthd16/dockeys
1•tirthd•12m ago•0 comments

Running Quantized MobileNetV2 on ESP32 for Railway Crack Detection (Under $10)

https://medium.com/python-in-plain-english/frugal-innovation-building-a-real-time-railway-safety-...
1•hemanthmuralik•16m ago•1 comments

Stereoscopic Technologies and Heart Rate Variability in Extreme VR Gaming

https://www.mdpi.com/2227-7080/13/12/545
1•PaulHoule•19m ago•0 comments

Toward Training Superintelligent Software Agents Through Self-Play SWE-RL (Meta)

https://arxiv.org/abs/2512.18552
1•xhevahir•22m ago•0 comments

Fake MAS Windows activation domain used to spread PowerShell malware

https://www.bleepingcomputer.com/news/security/fake-mas-windows-activation-domain-used-to-spread-...
2•smurda•23m ago•0 comments

Show HN: BrandRetina – screenshot similarity API for spear-phish detection

https://brandretina.ai/
1•malik_naji•26m ago•0 comments

Show HN: Agno best agnet Framework – I built a complex multi-agent image app

https://picxstudio.com
1•Yash16•27m ago•0 comments

Why ice skating is a miracle of physics

https://bigthink.com/starts-with-a-bang/ice-skating-miracle-physics/
3•bookofjoe•29m ago•0 comments

China to deploy humanoid robot for patrolling at the Vietnam border

https://www.wionews.com/world/china-humanoid-robots-walker-s2-vietnam-border-deployment-176666228...
1•toomuchtodo•34m ago•0 comments

AI Code Review Adoption Tracker

https://www.aitooltracker.dev
2•patrickdevivo•37m ago•0 comments

Meeting Seed7

https://genodians.org/nfeske/2025-12-22-meeting-seed7
1•t-3•44m ago•0 comments

Breaking Free from RGB

https://www.youtube.com/watch?v=HDGxXgaBAWE
2•lukeh•46m ago•0 comments

Managing Diabetes in Software Freedom

https://sfconservancy.org/blog/2025/nov/06/juggluco-foss-continuous-glucose-montior-diabetes/
2•pabs3•47m ago•0 comments

AI Deregulation and Corruption: Companies Now Have Too Many GPUs [video]

https://www.youtube.com/watch?v=FnlgwyVahCY
2•pabs3•50m ago•0 comments

EngineAI T800: Humanoid Robot Performs Martial Arts Moves

https://scienceclock.com/engineai-t800-humanoid-robot-martial-arts/
13•notthesay•57m ago•3 comments

If AI Becomes Conscious, We Need to Know

https://www.wsj.com/opinion/if-ai-becomes-conscious-we-need-to-know-83aa61d8
4•kvee•1h ago•2 comments

Gleam: The happy holidays release 2025

https://gleam.run/news/the-happy-holidays-2025-release/
8•nateb2022•1h ago•0 comments

Ask HN: Why Do You Blog?

15•onesandofgrain•1h ago•4 comments

ARK's Aggressive Pivot: Wood Doubles Down on Tesla and Re-Enters Big Tech AI

https://www.13radar.com/guru/catherine-wood
1•EvansWilson•1h ago•1 comments

Moving Images Related to the Apollo Missions, 1967–1969

https://catalog.archives.gov/id/133360601
4•handfuloflight•1h ago•0 comments

TurboDiffusion: 100–200× Acceleration for Video Diffusion Models

https://github.com/thu-ml/TurboDiffusion
1•meander_water•1h ago•0 comments

WiFi DensePose: WiFi-based dense human pose estimation system through walls

https://github.com/ruvnet/wifi-densepose
28•nateb2022•1h ago•11 comments

Show HN: A Claude Code plugin that catch destructive Git and filesystem commands

https://github.com/kenryu42/claude-code-safety-net
2•kenryu•1h ago•0 comments

Show HN: I built a tutor that teaches only by asking questions

https://sovyr-learn.vercel.app/
1•hugholousk•1h ago•1 comments

Show HN: Fun sketch – Bring your sketches to life

https://funsketch.kigun.org/
1•mishu2•1h ago•0 comments

HUML (Human-oriented Markup Language) [video]

https://www.youtube.com/watch?v=4M_tD1N14Ao
1•manlymuppet•1h ago•0 comments