frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•7mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Former WA state auditor suggests review of DCYF child daycare funding has merit

https://komonews.com/news/local/former-washington-state-auditor-brian-sonntag-department-of-child...
1•silexia•2m ago•1 comments

Claude vs. Codex in the Messy Middle

https://benr.build/blog/claude-vs-codex-messy-middle
1•bisonbear•2m ago•0 comments

The Early History of Distributed Open-Source Development

https://twitter.com/esrtweet/status/2007666808929620413
1•tapanjk•2m ago•0 comments

Show HN: Pull, Replay or Inspect any webhook event

https://github.com/mehdigmira/basehook
1•mehdig10•3m ago•0 comments

AMD Contemplates and Engineers Yottascale AI Compute

https://www.nextplatform.com/2026/01/06/amd-contemplates-and-engineers-yottascale-ai-compute/
1•rbanffy•3m ago•0 comments

Implementing a (Vibed) LLM Coding Agent in Prolog

https://deepclause.substack.com/p/implementing-a-vibed-llm-coding-agent
1•schmuhblaster•3m ago•0 comments

Dell's CES 2026 chat was the most pleasingly un-AI briefing I've had in 5 years

https://www.pcgamer.com/hardware/dells-ces-2026-chat-was-the-most-pleasingly-un-ai-briefing-ive-h...
2•mossTechnician•4m ago•0 comments

Threadsafe_datastore, a simple, convenient thread-safe data store for Python

https://hlfshell.ai/feed/000034/
1•speckx•7m ago•0 comments

Ask HN: How do you do store-and-forward telemetry at the edge?

2•Aydarbek•9m ago•1 comments

Target has their own forensic lab to investigate shoplifters

https://thehorizonsun.com/features/2024/04/11/the-target-forensics-lab/
1•jeromechoo•10m ago•0 comments

Show HN: Tylax – A bidirectional LaTeX to Typst converter in Rust

https://github.com/scipenai/tylax
2•democat•10m ago•0 comments

Why Study CS? Thoughts on LLM-assisted software engineering

https://kmicinski.com/claude-code-and-why-study-cs
3•annjose•11m ago•0 comments

Commodore 64 floppy drive has the power to be a computer and runs BASIC

https://www.tomshardware.com/pc-components/cpus/commodore-64-floppy-drive-has-the-power-to-be-a-c...
5•rbanffy•13m ago•0 comments

Update: Finbley Adds an AI-Based Spending Analyst (Natural Language Queries)

https://www.finbley.com
1•mo_hackernews•14m ago•1 comments

LLM Problems Observed in Humans

https://embd.cc/llm-problems-observed-in-humans
2•js216•14m ago•0 comments

SanDisk terminates WD brands and introduces Optimus SSD range

https://www.igorslab.de/en/sandisk-ends-wd-brands-and-introduces-optimus-ssd-series/
2•speckx•15m ago•0 comments

Australia's Social Media Ban: Age Limits Won't Fix What's Wrong with Platforms

https://blog.mozilla.org/netpolicy/2025/12/19/australias-social-media-ban-why-age-limits-wont-fix...
2•PaulHoule•15m ago•0 comments

Show HN: Worldstream – Real-time stream of headlines from everywhere

https://worldstream.io
3•raj-shekhar•15m ago•0 comments

The launches and landings we're most excited about in 2026

https://arstechnica.com/space/2026/01/here-are-the-launches-and-landings-were-most-excited-about-...
1•rbanffy•16m ago•0 comments

New Year 2026: Fusion Updates from Helion and Commonwealth Fusion

2•ralfd•17m ago•0 comments

Ideas are cheap, execution is cheaper

https://davekiss.com/blog/ideas-are-cheap-execution-is-cheaper
1•vinhnx•17m ago•0 comments

Information Is Still Free

https://thehistoryoftheweb.com/information-is-still-free/
1•cdrnsf•17m ago•0 comments

US Job Openings Decline to Lowest Level in More Than a Year

https://www.bloomberg.com/news/articles/2026-01-07/us-job-openings-decline-to-lowest-level-in-mor...
43•toomuchtodo•18m ago•3 comments

AI pilots a free-flying robot inside the International Space Station

https://scienceclock.com/first-autonomous-ai-robot-flight-iss/
2•akg130522•18m ago•0 comments

Secure job offers through AI-powered interviews [video]

https://www.youtube.com/watch?v=IKUy-h9L5zg
1•snasan•18m ago•0 comments

AI means we don't have to deal with nerds dreaming up over-engineered solutions

https://twitter.com/garrett_makes/status/2008532223713022125
1•jonnycomputer•19m ago•3 comments

Wegmans grocery store uses biometric surveillance on shoppers

https://www.aol.com/articles/popular-grocery-store-chain-uses-130056099.html?_guc_consent_skip=17...
1•WaitWaitWha•19m ago•0 comments

Easily Accelerating Python with Rust via Claude Code

https://www.generativist.com/notes/2026/Jan/6/claude-code-python-and-rust-oh-my
2•generativist•19m ago•0 comments

Show HN: A Bento-based service for reliably streaming usage events into billing

https://twitter.com/tryflexprice/status/2008863789916274851
1•sudeepsd__•20m ago•0 comments

VR-based brain training for ADHD

https://medium.com/@6thMind/vr-based-brain-training-for-adhd-what-the-2025-research-reveals-about...
1•smanuel•20m ago•0 comments