frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Manage risk with drawdown, not hope

https://pyquantnews.substack.com/p/size-positions-by-drawdown-not-hope
1•strimp099•29s ago•0 comments

Decades-old study on common weed killer retracted

https://www.cbc.ca/news/health/glyphosate-retraction-9.7004363
1•geox•32s ago•0 comments

Ceremonial Bugle

https://ceremonialbugle.com/
1•mhb•7m ago•0 comments

Show HN: WebGPU back end for PyTorch sneak peek

https://github.com/jmaczan/torch-webgpu
2•yu3zhou4•11m ago•0 comments

What would someone like me do with a tiny modular synth? [pdf]

https://www.musicthing.co.uk/collateral/WhatWouldSomeoneLikeMeDoWithATinyModularSynth_book.pdf
1•mhb•12m ago•0 comments

Kernel Float: Unlocking Mixed-Precision GPU Programming

https://dl.acm.org/doi/pdf/10.1145/3779120
2•gpuhacker•12m ago•0 comments

Godfather of AI' Geoffrey Hinton says Google is 'beginning to overtake' OpenAI

https://www.businessinsider.com/ai-godfather-geoffrey-hinton-google-overtaking-openai-2025-12
2•ashishgupta2209•13m ago•0 comments

Resolution Dynamics: Deriving the Fine Structure Constant from Shannon Capacity

https://zenodo.org/records/17821936
2•Jascon71•14m ago•0 comments

Show HN: Seq2Seq ML Learns the Inverse of Manual Multiplication (Gelosia Method)

https://gitlab.com/9o1d/gelosia
1•9o1d•16m ago•0 comments

A Class of Models with the Potential to Represent Fundamental Physics

https://www.wolframphysics.org/technical-introduction/
1•frizlab•16m ago•0 comments

Iran files charges against organizers after women runners flout hijab law

https://www.timesofisrael.com/iran-files-charges-against-marathon-organizers-after-women-runners-...
1•mhb•18m ago•0 comments

Why "all-in-one" productivity tools confuse new users

3•suffei771•19m ago•0 comments

Check out these YouTube Slides generated by VeedoAI

https://veedo.ai/yt-slides/s/0VKmpTYVWV4FXBtm4
1•shortpodo•19m ago•0 comments

Tiny Core Linux: a 23 MB Linux distro with graphical desktop

http://www.tinycorelinux.net/
2•LorenDB•20m ago•0 comments

Show HN: WhispaQA, the intent-driven QA tool

https://whispaai.com
1•j_mao•20m ago•0 comments

Implementing AMD GPU debugger and user mode graphics drivers internals in Linux

https://thegeeko.me/blog/amd-gpu-debugging/
1•thegeeko•21m ago•0 comments

I asked AI researchers and economists about SWE career strategy and AI's future

https://chrisbarber.co/I+asked+AI+researchers+%26+economists+about+SWE+career+strategy+and+the+fu...
1•cjbarber•27m ago•0 comments

I Recently Upgraded the Travel Avatar Feature

https://smartavatar.net/travel-avatar
1•harryyansu4•29m ago•1 comments

Meta acquires AI device startup Limitless

https://techcrunch.com/2025/12/05/meta-acquires-ai-device-startup-limitless/
1•redohmy•29m ago•1 comments

C3 vs. Zig in 2025: Who's Fixing C? [video]

https://www.youtube.com/watch?v=y3tDZACGRAY
1•mpweiher•30m ago•0 comments

Super-Flat ASTs

https://jhwlr.io/super-flat-ast/
1•mpweiher•32m ago•0 comments

Kiss vs. DRY in Infrastructure as Code: Why Simple Often Beats Clever

https://rosesecurity.dev/2025/11/14/kiss-versus-dry-iac.html
1•prognostikos•32m ago•0 comments

A lost idea can be a lost universe

https://rishi.monster/posts/a-lost-idea-can-be-a-lost-universe/
1•wawhal•32m ago•0 comments

Things You Shouldn't Leave in the Shed This Winter

https://www.countryliving.com/gardening/garden-ideas/a69611445/things-to-not-put-garden-shed-winter/
2•mooreds•33m ago•0 comments

Show HN: Octopii – a distributed runtime written in Rust

https://github.com/octopii-rs/octopii
1•joeeverjk•34m ago•0 comments

Glitches on video calls linked to real-world decisions

https://www.theregister.com/2025/12/04/glitchy_video_calls_research/
1•radeeyate•34m ago•0 comments

What makes a Platform Team work [audio]

https://www.swarmia.com/podcast/mason-jones-zapier/
1•mooreds•35m ago•0 comments

Show HN: PyTorch-World v0.1.0: Build & Train World Models

https://github.com/ParamThakkar123/pytorch-world
1•paramthakkar•35m ago•0 comments

Memory Crunch Hits PCs: Dell Hikes Prices 20%, Lenovo from January 2026

https://www.trendforce.com/news/2025/12/05/exclusive-memory-crunch-hits-pcs-dell-hikes-prices-15-...
2•akyuu•36m ago•0 comments

New discovery: The 'sacred boundary' surrounding Stonehenge

https://www.dw.com/en/new-discovery-the-sacred-boundary-surrounding-stonehenge/a-75039682
1•politelemon•36m ago•0 comments