frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•10mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

The Origin of AI's 'Reasoning' Abilities

https://www.theatlantic.com/technology/2026/04/4chan-ai-dungeon-thinking-reasoning/686794/
1•a_w•52s ago•0 comments

Traffic fatalities in US increased 15% on same days big albums were released

https://www.nytimes.com/2026/04/10/well/car-crashes-streaming-friday-harvard.html
1•bookofjoe•7m ago•1 comments

Iran's nuke confession blows reasons for America's war wide open

https://www.dailymail.co.uk/sciencetech/article-15732615/iran-nuclear-bomb-confession-ali-motahar...
1•Bender•15m ago•1 comments

Show HN: Idea File for LLM Cycling Coach

https://gist.github.com/leourbina/4db27d9a0a86b9e1551bf9d4b3fd6dad
1•leourbina•16m ago•0 comments

An open source template for building cloud agents

https://github.com/vercel-labs/open-agents
1•obilgic•18m ago•0 comments

Open Agents by Vercel

https://open-agents.dev/
1•obilgic•19m ago•0 comments

Grok Imagine 2.0 – AI-Powered Image Generation

https://grokimagine2.io
2•danielmateo773•21m ago•0 comments

Voxtral TTS – High-Quality Text-to-Speech API

https://voxtral-tts.com
1•danielmateo773•22m ago•0 comments

I built a local real estate site for Ottawa with neighborhood market data

https://maisonpropertygroup.ca
1•thugdrama•27m ago•0 comments

Scandinavian Governance: A Story of Trust and Shared Power

https://polycentricleadership.com/casestudies/scandinavian-governance-a-story-of-trust-and-shared...
1•thunderbong•27m ago•0 comments

The Accursèd Alphabetical Clock

https://boat.horse/clock/index.html
3•ohjeez•37m ago•0 comments

Not Even Noise-Cancelling Headphones Can Block This Bicycle Bell

https://www.carscoops.com/2026/04/skoda-duobell-anc/
1•ohjeez•41m ago•0 comments

Ask HN: What's with the Wargames-like UX lately?

3•beatthatflight•42m ago•2 comments

Why QA and Cyber Security Matter More Than Ever [video]

https://www.youtube.com/watch?v=4K2p7eXAYTM
1•taleodor•42m ago•0 comments

Woman with three deadly diseases has 'remarkable' recovery after cell therapy

https://www.theguardian.com/science/2026/apr/09/autoimmune-diseases-cell-therapy-immune-reset
5•gmays•45m ago•0 comments

Sheaf, a minimal custom 65% keyboard

https://github.com/nxrmqlly/sheaf65
1•sadeshmukh•45m ago•0 comments

Show HN: Memwright – Self-hosted memory for multi-agent teams, no LLM in path

https://github.com/bolnet/agent-memory
1•Bolnet•45m ago•0 comments

Understanding the FFT Algorithm (2013)

https://jakevdp.github.io/blog/2013/08/28/understanding-the-fft/
1•peter_d_sherman•47m ago•0 comments

FL man arrested for running multi-state Ponzi scheme, defrauding victims in MA

https://www.boston25news.com/news/local/florida-man-arrested-running-multi-state-ponzi-scheme-def...
1•1vuio0pswjnm7•55m ago•0 comments

Ask HN: Apple force-updated me to Tahoe. Worth fighting?

2•strogonoff•1h ago•2 comments

Show HN: Keynot – Kill PowerPoint with HTML

https://github.com/shawnzam/keynot
2•shawnzam•1h ago•0 comments

Dependency cooldowns turn you into a free-rider

https://calpaterson.com/deps.html
2•pabs3•1h ago•0 comments

One size fits none: let communities build for themselves

https://werd.io/one-size-fits-none-let-communities-build-for-themselves/
1•benwerd•1h ago•0 comments

Glyphosate resistance: a driver for multidrug-resistant clinical strains?

https://www.frontiersin.org/journals/microbiology/articles/10.3389/fmicb.2026.1740431/full
1•PaulHoule•1h ago•0 comments

Gauss' Secret Way to Calculate π Faster [video]

https://www.youtube.com/watch?v=7qiDDhIYx48
1•peter_d_sherman•1h ago•1 comments

Not all elementary functions can be expressed with exp-minus-log

https://www.stylewarning.com/posts/not-all-elementary/
4•mmastrac•1h ago•0 comments

Show HN: StockFit API – structured SEC EDGAR data with a free tier

https://developer.stockfit.io
1•areimann•1h ago•1 comments

The GNU libc atanh is correctly rounded

https://inria.hal.science/hal-05591661
2•matt_d•1h ago•0 comments

Google Arts and Culture

https://artsandculture.google.com/
2•satvikpendem•1h ago•0 comments

How to recover from a Git force push

https://gist.github.com/tomj/758d16b7f8e474035db72688663bb3cb
2•nstj•1h ago•0 comments