frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Marfa Public Radio Puts You to Sleep

https://www.marfapublicradio.org/podcast/marfa-public-radio-puts-you-to-sleep
1•reaperducer•3m ago•0 comments

A Man Who Invented a Surgery to Cure Himself

https://medium.com/swlh/doug-lindsay-the-man-who-cure-himself-12d40d3f643e
1•raynchad•3m ago•0 comments

Direct observation of the superallowed α-decay of 104Te

https://www.nature.com/articles/s41586-026-10581-w
1•thunderbong•4m ago•0 comments

What Barbarians Like to Take Private

https://www.gmo.com/americas/research-library/part-1-what-barbarians-like-to-take-private_gmoquar...
1•andsoitis•9m ago•0 comments

US Layoffs Skyrocket to Highest Level Since Pandemic AI Blamed for 40% of Cuts

https://www.ibtimes.co.uk/us-layoffs-skyrocket-highest-level-since-pandemic-tech-giants-blame-ai-...
3•yogthos•10m ago•0 comments

Almost always look on the bright side of life

https://www.economist.com/business/2026/05/21/why-you-should-almost-always-look-on-the-bright-sid...
1•andsoitis•16m ago•0 comments

After 80 Years, Mathematicians Give Famed 'Erdős Method' an Upgrade

https://www.quantamagazine.org/after-80-years-mathematicians-give-famed-erdos-method-an-upgrade-2...
1•signa11•19m ago•0 comments

Grantham Warns U.S. Stocks Could Plunge 70% / Most Expensive Market in History

https://247wallst.com/investing/2026/06/26/jeremy-grantham-warns-u-s-stocks-could-plunge-70-in-th...
3•andsoitis•25m ago•0 comments

Feds Killed Polestar and Spared Volvo. That Should Terrify You

https://www.thedrive.com/news/feds-killed-polestar-and-spared-volvo-that-should-terrify-you
3•mraniki•30m ago•5 comments

I built a 100% local network privacy appliance to stop smart home spying

https://www.edgedefenseai.com/
1•arundass•32m ago•1 comments

China Has Matched Anthropic in Cybersecurity, Resetting AI Race

https://www.wsj.com/tech/ai/chinese-ai-anthropic-mythos-cybersecurity-574b02c2
4•madars•34m ago•3 comments

What Happens When You Run 10k Concurrent Lambda Functions Against DynamoDB

https://medium.com/@yalovoy/what-happens-when-you-run-10-000-concurrent-lambda-functions-against-...
1•zero-ground-445•35m ago•0 comments

Amble One

https://driveamble.com/pages/amble-one
2•dnw•38m ago•0 comments

Show HN: FSM – an advanced system monitor for Linux

https://github.com/mskrasnov/FSM
1•mskrasnov•39m ago•0 comments

The AI "Super Bubble" Warning Is a Filter, Not a Funeral

https://www.pentesty.co/blog/ai-super-bubble-cybersecurity-filter-2026
2•johnzoro107•43m ago•0 comments

Show HN: SpinnerRecruit – targeted job ads in CLI for AI wait states

https://www.spinnerrecruit.dev/
1•jamessmu•45m ago•3 comments

Response to AI slop is from Robin Williams

https://jayacunzo.com/blog/your-move-chief
20•herbertl•58m ago•3 comments

Chrome Extension to Bypass Paywalls

https://gitflic.ru/project/magnolia1234/bypass-paywalls-chrome-clean
1•thunderbong•58m ago•1 comments

Turning music into a chore is how I became a musician

https://the.scapegoat.dev/turning-music-into-a-chore-is-what-made-me-an-artist/
2•herbertl•1h ago•0 comments

Microchip June 2026: AVR LA Family [pdf]

https://ww1.microchip.com/downloads/aemDocuments/documents/MCU08/ProductDocuments/Brochures/AVR-L...
2•dragontamer•1h ago•1 comments

Show HN: Decomp Academy – Learn to decompile GameCube games into matching C

https://decomp-academy.dev
24•jackpriceburns•1h ago•7 comments

Show HN: Shopify UCP is insanely powerful

https://stack412.com/
2•westche2222•1h ago•3 comments

I designed and synthesized PAC-832 in a chemistry lab I built in my garage

https://twitter.com/DouglasYaoDY/status/2070904914050797582
3•gasull•1h ago•1 comments

People and Blogs Interview: David Cain, Raptitude

https://manuelmoreale.com/interview/david-cain
3•Curiositry•1h ago•0 comments

From Prompting Agents to Loop Engineering

https://twitter.com/omarsar0/status/2068008743153832264
5•gmays•1h ago•2 comments

Slop, trust, and a three-line patch

https://klez.me/2026/06/28/slop-trust-and-a-three-line-patch/
2•the_kLeZ•1h ago•1 comments

Google Patent Reveals Satellite Messages May Carry Device Tracking Data

https://patentlyze.com/patent/google-stuffing-device-data-satellite-messages/
3•Dfol•1h ago•2 comments

Show HN: Moumantai – self-hosted, agent-driven apps you can use on any device

https://github.com/xiang-deng/moumantai
2•no_0044•1h ago•0 comments

AMD Strix Halo RDMA Cluster Setup Guide

https://github.com/kyuz0/amd-strix-halo-vllm-toolboxes/blob/main/rdma_cluster/setup_guide.md
32•jakogut•1h ago•1 comments

GTA 3 on a Volumetric Display (2025) [video]

https://www.youtube.com/watch?v=onYH5gvlnzE
2•Tiberium•1h ago•0 comments