frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

HN: Wispr Flow vs. Whisper by Remskill – voice typing app vs. full AI assistant

https://whisper.remskill.com/
1•denys12•1m ago•0 comments

Coalton is an efficient, statically typed Lisp with ideas from Haskell and OCaml

https://coalton-lang.github.io/
1•b-man•2m ago•0 comments

The End of the Digital Age

https://cacm.acm.org/opinion/the-end-of-the-digital-age/
1•berlianta•2m ago•0 comments

AI tools lead to 'clear racial disparities' in job hiring

https://www.ft.com/content/5c442b38-6989-461a-988e-653f7a275eee
1•1vuio0pswjnm7•2m ago•0 comments

Cloudflare Secrets Manager for Worker secrets and vars

https://github.com/nyigoro/cloudflare-secrets-manager
1•light_ideas•3m ago•0 comments

The longer you wait, the longer you should expect to wait

https://world.hey.com/apetrov/the-longer-you-wait-the-longer-you-should-expect-to-wait-30e2511a
1•apetrov•3m ago•0 comments

LLM Driven AutoForecasting with Sktime's `Craft()`

https://pub.towardsai.net/llm-driven-autoforecasting-with-sktimes-craft-0355f5c720e8
1•528491•5m ago•0 comments

So I'm making my own Spotify Wrapped this year but better

https://nickiedemakos.substack.com/p/so-im-making-my-own-spotify-wrapped
1•dzulp0d•5m ago•0 comments

Functional Programming in Lean

https://leanprover.github.io/functional_programming_in_lean/
1•tosh•5m ago•0 comments

Excerpts from Pope Leo XIV's manifesto about humanity in the AI era

https://apnews.com/article/vatican-ai-encyclical-pope-leo-excerpts-ee0de875adbdb3d599d4da2c597ff7bd
1•1vuio0pswjnm7•5m ago•0 comments

Ghost CMS SQL injection flaw exploited in large-scale ClickFix campaign

https://www.bleepingcomputer.com/news/security/ghost-cms-sql-injection-flaw-exploited-in-large-sc...
1•bushwart•6m ago•0 comments

GridOS – State Machine Challenges

https://everybody.codes/gridos/missions
1•vismit2000•6m ago•0 comments

PayPal's online checkout empire under siege as rivals squeeze its core business

https://apnews.com/article/paypal-apple-pay-payments-buy-now-50054e5db0c773c8fe9a437708e1d3a9
1•1vuio0pswjnm7•7m ago•0 comments

50 Years of Proof Assistants

https://lawrencecpaulson.github.io/2025/12/05/History_of_Proof_Assistants.html
1•tosh•7m ago•0 comments

Deterministic Automation for a Probabilistic System

https://stack72.dev/deterministic-automation-for-a-probabilistic-system/
1•nickstinemates•7m ago•0 comments

Tensor Cryptographic Behavioural Audit (TCBA)

https://physivitis.tech/services/tcba
1•kkjd1426•7m ago•1 comments

Show HN: Local-first PDF redaction for permanently removing data

1•daoxiaoyue2012•7m ago•0 comments

Interleaved Deltas

https://mmapped.blog/posts/51-interleaved-deltas
1•surprisetalk•8m ago•0 comments

Skills vs. MCP vs. prompts: which agent setup works best?

https://www.agentvoyagerproject.com/captains-log/1
1•pmkelly4444•9m ago•0 comments

For the average price of a car in the US, you could buy 5 new Chinese EVs

https://www.reuters.com/business/autos-transportation/average-price-car-us-you-could-buy-5-new-ch...
2•bushwart•11m ago•0 comments

Model is currently experiencing high demand

https://mayberay.bearblog.dev/this-model-is-currently-experiencing-high-demand/
1•mugamuga•13m ago•0 comments

Domestic Transport Usage by Mode

https://www.gov.uk/government/statistics/daily-domestic-transport-use-by-mode/domestic-transport-...
1•bookofjoe•14m ago•0 comments

The Challenge of Cross-language Interoperability (2013)

https://queue.acm.org/detail.cfm?id=2543971
1•downbad_•15m ago•0 comments

Nvidia Vera CPU seems to beat AMD and Intel on server workloads

https://www.phoronix.com/review/nvidia-vera-benchmarks/11
2•paoliniluis•15m ago•0 comments

GTA V – Graphics Study (2015)

https://www.adriancourreges.com/blog/2015/11/02/gta-v-graphics-study/
1•downbad_•16m ago•0 comments

A Wetland Without Water

https://www.theguardian.com/global-development/2026/may/26/chile-datacentres-water-tech-companies...
1•tosh•16m ago•0 comments

Show HN: Kakeibo – a simple budget tracking app for simple people

https://getkakeibo.com/en/
1•palpfiction•16m ago•0 comments

Show HN: Compile-time model-id validation with declared capability

https://github.com/yujonglee/openrouter-toolkit
1•yujonglee•18m ago•1 comments

Why Dags Are Taking over Auto-Research (With the Founders of Paradigma)

https://www.youtube.com/watch?v=zBlu6j5ryo0
1•research_pie•18m ago•0 comments

Erich's Packing Center

https://erich-friedman.github.io/packing/
1•yzydserd•19m ago•0 comments