frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•12mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Carbonara Before the Rules [video]

https://www.youtube.com/watch?v=jrz0KhclCWM
1•gadtfly•2m ago•0 comments

Best Proxy for Twitter 2026

https://momoproxy.com/blog/best-proxy-for-twitter-2026
1•xbjamilnz•17m ago•0 comments

Battery-Free 'MicroSparc' That Allegedly Draws Power from the Quantum Vacuum

https://thedebrief.org/free-energy-from-the-vacuum-warp-drive-pioneer-unveils-battery-free-micros...
2•rramadass•25m ago•1 comments

There Is No 'Hard Problem of Consciousness'

https://www.noemamag.com/there-is-no-hard-problem-of-consciousness/
7•ahalbert4•29m ago•1 comments

Building a multi-agent system from scratch: 50 lines of bash and Git

https://en.andros.dev/blog/ed26ea98/building-a-multi-agent-system-from-scratch-50-lines-of-bash-git/
2•thunderbong•30m ago•0 comments

Multiple commencement speakers booed for AI comments during graduation speeches [video]

https://www.youtube.com/watch?v=xwWaoyIy5e8
1•mgh2•32m ago•0 comments

Turn inbound call recordings into structured JSON

https://www.gensail.com/call-data-extraction
1•vartana•32m ago•0 comments

What A.I. Did to My College Class

https://www.nytimes.com/2026/05/17/opinion/chatgpt-ai-college-school-graduation.html
4•mmooss•38m ago•1 comments

Ben Affleck Banned from Hard Rock Casino When Caught Counting Cards (2014)

https://www.yahoo.com/entertainment/ben-affleck-banned-hard-rock-casino-counting-cards-223154372....
1•thunderbong•43m ago•0 comments

LLM Performance by Programming Language

https://gertlabs.com/blog/llm-performance-by-language
4•gertlabs•45m ago•1 comments

First-Ever Tokenized Space Tourism: AI and Space and Blockchain

https://cccforgc.com/
1•cccxtha•48m ago•0 comments

Archivists Turn to LLMs to Decipher Handwriting at Scale

https://spectrum.ieee.org/ai-handwriting-transcription-transkribus-lecun
1•pseudolus•48m ago•0 comments

Dot-Coms That Deliver (2001)

https://books.google.com/
1•Apocryphon•49m ago•3 comments

The American epoch of oil is collapsing. What comes next could be ugly

https://www.theguardian.com/us-news/ng-interactive/2026/may/17/america-china-energy-oil-renewables
5•nithinj•50m ago•0 comments

AI Leak Watch: 435,608 potential AI API key matches in public GitHub code

https://ai-keys-leaks.begimher.com/
3•dan_l2•51m ago•0 comments

The automation of jobs will never end

https://metastable.org/never-end/
1•pbw•55m ago•0 comments

Spirit Airlines Passenger Brings 'Emotional Support' Rotisserie Chicken Thru TSA

https://viewfromthewing.com/spirit-airlines-passenger-brings-emotional-support-rotisserie-chicken...
4•rawgabbit•1h ago•0 comments

Possible atmosphere detected on small trans-Neptunian object

https://www.smithsonianmag.com/smart-news/this-tiny-celestial-body-past-pluto-shouldnt-have-an-at...
3•pavel_lishin•1h ago•0 comments

Slop Bucket Idea – a dataset of AI slop (train AI what not to do)

2•IAmNeo•1h ago•4 comments

Hubris of Timing: Why being right abt the future isnt enough to capitalize on it

https://deciens.com/press-and-insights/epistula-14-the-hubris-of-timing
2•ryan_j_naughton•1h ago•0 comments

How fast is N tokens per second really?

https://mikeveerman.github.io/tokenspeed/
5•hexagr•1h ago•2 comments

Students deserve better than COLLEGE

https://stanforddaily.com/2026/05/14/students-deserve-better-than-college/
2•johntfella•1h ago•1 comments

Show HN: Dashbuster – Replace em dashes on any website

https://chromewebstore.google.com/detail/dashbuster/pnfhimkhinoecknjhlggdbgoajcogfll
1•qainsights•1h ago•0 comments

I went inside OpenAI's secretive San Francisco headquarters

https://www.sfgate.com/tech/article/openai-san-francisco-headquarters-22259754.php
2•bryan0•1h ago•0 comments

Most Americans don't trust AI – or the people in charge of it

https://www.theverge.com/ai-artificial-intelligence/644853/pew-gallup-data-americans-dont-trust-ai
19•cdrnsf•1h ago•1 comments

Apple announced the iPhone 17e with a chip developed in Israel

https://www.jpost.com/consumerism/article-888680
7•banku_brougham•1h ago•9 comments

Show HN: How to Kill the Dead Internet

https://chromewebstore.google.com/detail/d-slop/cnjeckkgbjgfledbphbobjabnfjnheef
4•bigger_fish•1h ago•2 comments

Microsoft will let you remap the Copilot key to restore right ctrl functionality

https://support.microsoft.com/en-us/accessibility/windows/copilot/understand-updates-to-the-copil...
2•razorbeamz•1h ago•0 comments

Tracking Skyscraper-Size Asteroids, Failed Supernovas and Interstellar Visits

https://www.quantamagazine.org/rubin-tracks-skyscraper-size-asteroids-failed-supernovas-and-inter...
2•tzury•1h ago•0 comments

I built a local layer that kills Token Tax–Python lib+Chrome extension+Mac app

https://omna.dev/
2•gauravji•1h ago•0 comments