frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Three reasons why DeepSeek’s new model matters

https://www.technologyreview.com/2026/04/24/1136422/why-deepseeks-v4-matters/
1•thunderbong•1m ago•0 comments

Show HN: Cask.news – discover and track new homebrew Mac apps

https://cask.news/
1•to•1m ago•0 comments

I built a benchmark for testing LLMs playing Gomoku

https://github.com/homerquan/GomokuBench
1•homerquan•5m ago•0 comments

British cyclist takes KOM on San Francisco's steepest street with 41% gradient

https://www.bikeradar.com/news/harry-macfarlane-san-francisco-kom
1•littlexsparkee•5m ago•0 comments

Cheapest GPUs in the World

https://timlig.com/posts/cheapest-gpus-in-the-world/
1•anujsharmax•6m ago•0 comments

Meta Is Preparing to Have to Undo Its Manus Acquisition After China Ban

https://www.wsj.com/tech/ai/meta-is-preparing-to-have-to-undo-its-manus-acquisition-after-china-b...
2•thm•8m ago•0 comments

The AI Rug Pull

https://www.warman.life/blog/2026-04-27-the-apprenticeship/
3•shaunistyping•16m ago•0 comments

How to Start Journaling

https://www.theguardian.com/wellness/2026/apr/27/how-to-start-journaling
4•devonnull•19m ago•0 comments

ATS Resume Forge – an ATS-focused resume builder for job seekers

https://www.atsresumeforge.com/
1•kinrell•21m ago•1 comments

Why your 'Private Google Access enabled' subnet still bills Cloud NAT

https://github.com/FootprintAI/Containarium
1•hsin003•22m ago•1 comments

LingBot-Map: Streaming 3D reconstruction with geometric context transformer

https://technology.robbyant.com/lingbot-map
2•nateb2022•27m ago•0 comments

Florida AG probes ChatGPT's role in USF student killings

https://www.axios.com/local/tampa-bay/2026/04/27/florida-ag-openai-chatgpt-usf-murders-ai-account...
1•1vuio0pswjnm7•27m ago•1 comments

Claire's closes all 154 stores in UK and Ireland with loss of 1,300 jobs

https://www.bbc.com/news/articles/cg4047qnpk2o
4•stevekemp•31m ago•0 comments

Why Spotify has no button to filter out AI music

https://www.bbc.co.uk/news/articles/cd7jpg4w181o
2•dijksterhuis•34m ago•1 comments

The Hold

https://www.subbu.org/essays/2026/the-hold/
1•freediver•35m ago•0 comments

Cisco Introduces Universal Quantum Switch

https://newsroom.cisco.com/c/r/newsroom/en/us/a/y2026/m04/cisco-introduces-universal-quantum-swit...
1•kousthub•35m ago•1 comments

Conus Electrical Resistivity at 35km

https://www.usgs.gov/media/images/conus-electrical-resistivity-35km
2•testingonetwo34•35m ago•1 comments

Anthropic's Argument for Mythos SWE-bench improvement contains a fatal error

https://www.philosophicalhacker.com/post/anthropic-error/
2•kmdupree•37m ago•0 comments

Devin for Terminal

https://devin.ai/terminal
1•qainsights•41m ago•1 comments

Stripe: Radar Technical Guide

https://stripe.com/in/guides/primer-on-machine-learning-for-fraud-protection
1•jonnonz•41m ago•0 comments

OpenJDK 21 April 2026 CVEs Explain

https://tux.re/forum/viewtopic.php?t=175
2•Neteam•42m ago•0 comments

The Conspiracy Against High Temperature Sampling

https://gist.github.com/Hellisotherpeople/71ba712f9f899adcb08b94bce20d5397
2•Der_Einzige•44m ago•0 comments

TeamPCP Supply Chain Campaign: Update 008

https://isc.sans.edu/diary/32926
1•jruohonen•46m ago•0 comments

Grocyy – AI receipt scanner that tracks grocery spending by item, not just total

https://grocyy.com/
1•Devanship1•49m ago•0 comments

Video Upscaler with Temporal Smoothing

https://github.com/freeaigit/video-upscaler
1•nadermx•55m ago•0 comments

Try Contra Dancing

https://www.benkuhn.net/contra/
2•jefftk•57m ago•0 comments

Consequences of passing too few register parameters to a C function

https://devblogs.microsoft.com/oldnewthing/20260427-00/?p=112271
1•aragonite•58m ago•0 comments

China's push to commercialize research: match 680k innovators with companies

https://www.nature.com/articles/d41586-026-01202-7
2•manvel_hn•1h ago•0 comments

Show HN: See your computer's audio output on a real-time piano

https://github.com/ecstrema/overchords
1•ecstrema•1h ago•0 comments

Show HN: PrePrompt – rewrites vague prompts before they reach the LLM

https://preprompt.org/
2•yashdeeptehlan•1h ago•1 comments