frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Teaching an LLM to Speak Vestaboard Note: Building Vestaboard AI

https://corti.com/teaching-an-llm-to-speak-vestaboard-note-building-vestaboard-ai/
1•TechPreacher•21s ago•0 comments

kivo: A lightweight desktop teleprompter

https://github.com/rajtilakjee/kivo
1•ilreb•58s ago•0 comments

Extracting sound effects from a Switch game

https://blog.alexbeals.com/posts/extracting-sound-effects-from-a-switch-game
1•dado3212•2m ago•0 comments

Herdr: Agent multiplexer that lives in your terminal

https://github.com/ogulcancelik/herdr
1•mzehrer•2m ago•0 comments

Thai family mourns teen girl found dead in suitcase as Australian arrested

https://www.reuters.com/world/asia-pacific/thai-family-mourns-teen-girl-found-dead-suitcase-austr...
1•petethomas•2m ago•0 comments

Optimizing LLVM's Bump Allocator

https://maskray.me/blog/2026-06-28-optimizing-llvm-bump-allocator
1•jandeboevrie•4m ago•0 comments

Basecoat 1.0

https://github.com/hunvreus/basecoat/releases/tag/1.0.0
1•dabinat•5m ago•0 comments

Trillion-Dollar Borrowing Binge Lifting the Stock Market to Risky Heights

https://www.wsj.com/finance/stocks/the-trillion-dollar-borrowing-binge-lifting-the-stock-market-t...
1•petethomas•9m ago•0 comments

Australia investigating five social media giants for not enforcing ban on kids

https://www.theregister.com/public-sector/2026/06/29/australia-investigating-five-social-media-gi...
2•defrost•13m ago•0 comments

Amazon seller reveals shadow bribery market within Amazon

https://www.mercurynews.com/2026/06/24/amazon-seller-reveals-rare-glimpse-of-shadow-bribery-market/
1•Gaishan•13m ago•0 comments

'Superallowed' alpha decay seen for the first time

https://physicsworld.com/a/superallowed-alpha-decay-seen-for-the-first-time/
2•visha1v•15m ago•0 comments

New model of ocean waves sheds light on the spread of microplastic pollution

https://physicsworld.com/a/new-model-of-ocean-waves-sheds-fresh-light-on-the-spread-of-microplast...
2•visha1v•16m ago•0 comments

PCB-QA: Evaluating LLMs over the First PCB Design Question-Answer Dataset

https://arxiv.org/abs/2606.23704
2•teleforce•24m ago•0 comments

The 1000-mile handshake from Aden to Mangalore

https://drbhaskardasgupta1.substack.com/p/the-1000-mile-handshake
2•trojanalert•24m ago•0 comments

From Prompts to Loops: Building Autonomous Coding Agents

https://animeshgaitonde.medium.com/from-prompts-to-loops-building-autonomous-coding-agents-6135bf...
2•animesh371g•29m ago•0 comments

"Warming Hole" Heat Content Variations Are Caused by Ocean Heat Transport

https://agupubs.onlinelibrary.wiley.com/doi/full/10.1029/2025GL118383
5•baxtr•36m ago•0 comments

392-Year-Old Bonsai Tree That Survived the Hiroshima Atomic Blast (2024)

https://www.openculture.com/2024/05/this-392-year-old-bonsai-tree-survived-the-hiroshima-atomic-b...
4•vednig•40m ago•0 comments

'Down from Londoners' Are Transforming England's Seaside Towns

https://www.bloomberg.com/news/articles/2026-06-26/londoners-escape-to-england-s-seaside-raises-h...
2•petethomas•43m ago•0 comments

We Built Osmium for Scale

https://osmium.chat/blog/how-we-built-osmium-for-scale/
2•ateesdalejr•44m ago•0 comments

Remember SCANTRON? How did that work? [video]

https://www.youtube.com/watch?v=x2RvPFvR-CI
4•fortran77•45m ago•0 comments

My New Life with the Palantir Chore Coat

https://www.theatlantic.com/technology/2026/06/palantir-chore-coat/687686/
3•colinprince•45m ago•0 comments

PCB-Bench: Benchmarking LLMs for PCB Placement and Routing (ICLR 2026)

https://github.com/digailab/PCB-Bench
3•teleforce•46m ago•0 comments

Age verification is just a precursor to automated attribution of speech

https://nonogra.ph/age-verification-is-just-a-precursor-to-attribution-of-speech-06-29-2026
104•arkhiver•47m ago•19 comments

MFM: PINN based Motion Foundation Model

https://huggingface.co/JuSeongvin/pinn
2•urgentINC•51m ago•0 comments

Breaking the Tokenizer Barrier: On-Policy Distillation Across Model Families

https://arxiv.org/abs/2606.09456
2•Jimmc414•51m ago•0 comments

What Should We Optimize Away?

https://www.autodidacts.io/holistic-optimization/
3•Tomte•53m ago•0 comments

Collider: A meson package manager – A hash proves the bytes, not the source

https://collider.ee/blog/2026-06-28-1500_a_hash_proves_the_bytes_not_the_source/#a-hash-proves-th...
2•mog_dev•55m ago•0 comments

An Open Letter to Pete Buttigieg

https://richprocida.substack.com/p/an-open-letter-to-pete-buttigieg
2•RichProcida•1h ago•0 comments

SpaceX just landed in 401(k)s due to key index rule changes

https://moneywise.com/news/top-stories/spacex-401k-anthropic-openai-ipo-index-fund-rules
4•voxadam•1h ago•0 comments

GraphQL MCP Server and GraphiQL Plugins

https://graphql-mcp.com/
3•robjampar•1h ago•1 comments