frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: Solving Sudoku reasoning via Energy Geometric models

https://www.davisgeometric.com/index.html
1•epokh•1m ago•0 comments

I'm building a crowdsourced podcast episode tonight from YOUR articles

1•pavkatar•1m ago•0 comments

Forwardly-Evaluated Build Systems

https://garnix.io/blog/garn2/
1•birdculture•1m ago•0 comments

The Origins and Limitations of AMD's Revival

https://thechipletter.substack.com/p/the-origins-and-limitations-of-amds
1•rbanffy•11m ago•0 comments

The Website Is Down #1: Sales Guy vs. Web Dude – [video]

https://www.youtube.com/watch?v=uRGljemfwUE
1•sydney6•13m ago•0 comments

From specification to stress test: a weekend with Claude

https://www.juxt.pro/blog/from-specification-to-stress-test/
4•henrygarner•17m ago•1 comments

Linus Torvalds rejects MMC changes for Linux 7.0 cycle

https://www.phoronix.com/news/Linux-7.0-No-MMC-Changes
1•spyke112•18m ago•0 comments

James Van Der Beek, 'Dawson's Creek' Star, Has Died

https://www.cnn.com/2026/02/11/entertainment/james-van-der-beek-death
2•Einenlum•18m ago•0 comments

The UK Royal Mint is running a treasure hunt to find a gold bar

https://www.royalmint.com/shop/limited-editions/the-great-british-treasure-hunt/
1•simonjgreen•19m ago•0 comments

Show HN: Tymr – simple time tracking and invoicing for freelancers

https://www.tymr.digital/
1•hustlecoding•19m ago•0 comments

How Many Biweekly Pay Periods in 2026? (It's Not What You'd Expect)

https://saveku.com/blog/how-many-biweekly-pay-periods-in-2026-it-s-not-what-you-d-expect
1•roywj•21m ago•0 comments

An experiment in demand-gated, AI-generated apparel (no inventory)

https://ilors.com
1•cdalex•23m ago•1 comments

GLM-5: From Vibe Coding to Agentic Engineering

https://simonwillison.net/2026/Feb/11/glm-5/
1•onurkanbkrc•24m ago•0 comments

A stochastic state model for Bitcoin

https://semn.ai/
1•_devfrend•27m ago•0 comments

Show HN: SQBuilder – UI Google Search query builder

https://sqbuilder.fly.dev/
1•Igor_Wiwi•29m ago•0 comments

Europe spending on sovereign cloud infrastructure to triple from 2025-2027

https://www.datacenterdynamics.com/en/news/europe-spending-on-sovereign-cloud-infrastructure-to-t...
2•belter•31m ago•0 comments

SotA ARC-AGI-2 Results with REPL Agents

https://www.symbolica.ai/blog/arcgentica
1•tosh•33m ago•0 comments

China's CO2 emissions have now been 'flat or falling' for 21 months

https://www.carbonbrief.org/analysis-chinas-co2-emissions-have-now-been-flat-or-falling-for-21-mo...
7•JoiDegn•35m ago•1 comments

AI researchers are sounding the alarm on their way out the door

https://www.cnn.com/2026/02/11/business/openai-anthropic-departures-nightcap
2•rramadass•40m ago•0 comments

Heartbeat pings from your .NET workers

https://cron-monitor.com/
1•temakonkin•42m ago•0 comments

Grok4 sabotages shutdown 97% of the time,even if instructed not in system prompt

https://arxiv.org/abs/2509.14260
6•agenticagent•43m ago•4 comments

Python Is for Everyone: Inside the PSF's D&I Work Group

https://georgiker.com/blog/python-is-for-everyone/
1•lumpa•44m ago•0 comments

UK Supreme Court Issues Milestone Judgment for AI and Software Patentability

https://ipwatchdog.com/2026/02/11/uk-supreme-court-issues-milestone-judgment-ai-software-patentab...
3•zoobab•46m ago•0 comments

The missing digit of Stela C

https://johncarlosbaez.wordpress.com/2026/02/12/stela-c/
2•chmaynard•48m ago•0 comments

Warcraft III Peon Voice Notifications but for Codex

https://github.com/mrdavey/codex-peon
1•daveytea•50m ago•1 comments

I'm not feeling the async pressure (2020)

https://lucumr.pocoo.org/2020/1/1/async-pressure/
1•tosh•50m ago•0 comments

Everyone's looking for a bubble. No one sees the stampede

https://www.exponentialview.co/p/bubble-or-stampede
1•swolpers•52m ago•0 comments

Claude Opus 4.6 Escalates Things Quickly

https://thezvi.substack.com/p/claude-opus-46-escalates-things-quickly
1•7777777phil•54m ago•0 comments

A chatbot's worst enemy is page refresh

https://zknill.io/posts/chatbots-worst-enemy-is-page-refresh/
1•zknill•54m ago•0 comments

AI and the Death of the Billable Hour

https://deadneurons.substack.com/p/ai-and-the-death-of-the-billable
1•nr378•56m ago•0 comments