frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Neocities domain suspended by Namecheap for unrelated court case

https://bsky.app/profile/neocities.org/post/3mnkqgxostk2k
1•ScrapBlox•17s ago•0 comments

The Fitbit Air is a good wearable weighed down by a chatty AI "coach"

https://arstechnica.com/gadgets/2026/06/the-fitbit-air-is-great-but-googles-ai-is-too-nice-to-be-...
1•canucker2016•2m ago•0 comments

Assessing the Effect of a Deep-Rooted Grass on Belowground Carbon Storage

https://agupubs.onlinelibrary.wiley.com/doi/10.1029/2025EF007102
1•PaulHoule•2m ago•0 comments

How Not to Die (2007)

https://paulgraham.com/die.html
1•downbad_•2m ago•0 comments

Aging and Eye Problems

https://ldstephens.net/posts/aging-and-eye-problems/
2•speckx•5m ago•0 comments

Building the Tampermonkey Replacement

https://www.youtube.com/watch?v=bvv3bYf-6ik
2•jobello•6m ago•1 comments

Reverse Engineering Crazy Taxi, Part 3

https://wretched.computer/post/crazytaxi3
3•wgreenberg•7m ago•0 comments

MS Sharepoint sunset of "Alert me" (on folder changes) completes next month

https://techcommunity.microsoft.com/blog/spblog/sharepoint-alerts-retirement/4410402
2•realityfactchex•7m ago•1 comments

Official Invitation to the Beta Test: "Knowledge in a Box"

https://sozialsoziokrat.substack.com/p/official-invitation-to-the-beta-test
2•Daniel_Bauer•10m ago•0 comments

Data Viz and Table Design from the Letterpress Era

https://chris-parmer.com/data-viz-from-the-letterpress-era/
1•robertclaus•13m ago•0 comments

Reviewing Code Requires Reading

https://hauleth.dev/post/review-requires-reading/
2•birdculture•14m ago•0 comments

Miasma Worm Targets AI Coding Agents via GitHub Repos

https://safedep.io/miasma-worm-ai-coding-agent-config-injection/
3•ngetchell•14m ago•0 comments

Harness engineering: Leveraging Codex in an agent-first world

https://openai.com/index/harness-engineering/
1•pramodbiligiri•15m ago•0 comments

Show HN: Relic, a tiny coding agent for ancient and constrained systems

https://github.com/felixrieseberg/relic
1•felixrieseberg•15m ago•0 comments

Microsoft and OpenAI broke up – now they're ready to fight

https://www.theverge.com/ai-artificial-intelligence/942242/microsoft-build-ai-agents-openai-compe...
1•speckx•15m ago•0 comments

Agentic Search Models with OpenSearch and Elasticsearch

https://bonsai.io/blog/agent-search-with-sid/
8•h3h•18m ago•0 comments

Meta weighs big equity raising after blockbuster Google deal

https://www.ft.com/content/e6df645d-1709-4a77-b15d-aa43a0209efd
2•mfiguiere•18m ago•0 comments

Show HN: Crowdsource agents for reasoning, reward top. A live Experiment

https://rezontree.com
1•BuddhaSource•19m ago•1 comments

Mythos found the bugs. Who funds the fixes?

https://opub.dev/blog/mythos-found-the-bugs-who-funds-the-fixes
1•goodroot•19m ago•1 comments

SupXML: The modern, memory-safe XML parser drop-in replacement for libxml2

https://supso.org/blog/introducing-supxml-modern-memory-safe-xml-parser-alternative-to-libxml2
1•jrpt•19m ago•0 comments

74k words and CPUs playing ZOEAE: How I built a dictionary for word game pedants

https://wordtrak.com/blog/2026-06-05-how-i-built-a-new-dictionary-for-pedantic-word-game-players
2•qrush•20m ago•0 comments

Crowdsource agents for reasoning, reward top. A live Experiment

https://reazontree.com
1•BuddhaSource•20m ago•2 comments

Call for Testing: PhoenixDKIM, A security-focused DKIM milter

https://www.phoenixdkim.org/
1•peregrinus_13•22m ago•1 comments

The reason dating is broken: Data behind romance went from joyful to miserable

https://www.youtube.com/watch?v=HGBuEjzsrHE
1•maxloh•23m ago•0 comments

Azure Functions Core Tools repository taken down

https://github.com/Azure/azure-functions-core-tools
1•hectorm•25m ago•1 comments

The Reflection on My First Year at Meta (Facebook)

https://johnjr.dev/posts/the-reflection-on-my-first-year-at-meta/
2•johnjr•27m ago•0 comments

Show HN: Interact with your .eml files using MCP tools

https://github.com/MiguelRipoll23/eml-mcp
1•PhilDunphy23•27m ago•0 comments

Show HN: Audit any AI/data pairing with Veritrooper

https://veritrooper.com/
1•brian8620•27m ago•0 comments

Show HN: Fooglemap – a map for local restaurant discovery

https://fooglemap.com/
1•rankiwiki•29m ago•0 comments

Meta putting up tents across the US to house AI servers

https://www.tomshardware.com/tech-industry/artificial-intelligence/meta-putting-up-tents-across-t...
3•jeffufl•29m ago•2 comments