frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Monad Tutorials Timeline

https://wiki.haskell.org/Monad_tutorials_timeline
2•brudgers•8m ago•1 comments

Lessons from Building an OTel Normalizer for GenAI

https://www.groundcover.com/blog/otel-normalizer-genai-part-1
1•thebitofmyheart•12m ago•0 comments

Physicists reveal universal speed limit on quantum information scrambling

https://phys.org/news/2026-04-physicists-reveal-universal-limit-quantum.html
1•thunderbong•18m ago•1 comments

Talking About Dark Matter – Sixty Symbols [video]

https://www.youtube.com/watch?v=qxuMRaUbWow
1•celias•19m ago•1 comments

Show HN: Vela Coach – an open-source coach that reads your Granola transcripts

https://github.com/Vela-Engineering/coach
1•yihlamur•19m ago•0 comments

Ashby's Law of Requisite Variety (Cybernetics)

https://en.wikipedia.org/wiki/Variety_(cybernetics)
1•nomilk•21m ago•0 comments

Humanoid Robots Are Still a Body Problem with Jerry Pratt [video]

https://www.youtube.com/watch?v=fsPDZjWMKMs
2•thunderbong•27m ago•0 comments

Meta stock drops on quarterly results 'internet disruptions' user numbers down

https://www.cnbc.com/2026/04/29/meta-q1-earnings-report-2026.html
1•1vuio0pswjnm7•31m ago•0 comments

In Backlash Against Tech in Schools, Parents Are Winning Rollbacks

https://www.nytimes.com/2026/04/29/technology/parents-school-tech-backlash.html
2•1vuio0pswjnm7•33m ago•0 comments

Tourist dies after being bitten at snake show while on vacation in Egypt

https://www.cnn.com/2026/04/28/africa/tourist-dies-snake-bite-egypt-scli-intl
1•fortran77•34m ago•0 comments

OpenAI, Sam Altman Hit with Slate of Lawsuits over Mass Shooting Canadian School

https://www.law.com/therecorder/2026/04/29/openai-sam-altman-hit-with-slate-of-lawsuits-over-mass...
2•1vuio0pswjnm7•36m ago•0 comments

Infrastructure Spend in the AI Era

https://redmonk.com/sogrady/2026/04/29/infrastructure-spend-in-the-ai-era/
1•mooreds•36m ago•0 comments

The Harmful Consequences of the Robustness Principle (2018)

https://datatracker.ietf.org/doc/html/draft-thomson-postel-was-wrong-03
1•Tomte•37m ago•0 comments

AMS – Detect unsafe LLMs in 30 seconds via activation analysis

https://github.com/GoogleCloudPlatform/activation-model-scanner
1•gmessenger•39m ago•0 comments

Fanspeak (1999)

https://www.fantasymaps.com/stuff/fanspeak.html
1•ipnon•40m ago•0 comments

Operation Sundevil (1990)

https://en.wikipedia.org/wiki/Operation_Sundevil
1•rickcarlino•46m ago•0 comments

Stablecoin Reserve Race: Brazil Pix, Morgan Stanley, OCC Charters

https://stablecoinbrief.substack.com/p/stablecoin-reserve-race-brazil-pix
1•knivef•51m ago•0 comments

Building agents that reach production systems with MCP

https://claude.com/blog/building-agents-that-reach-production-systems-with-mcp
1•gmays•57m ago•0 comments

Biology is a Burrito: A text- and visual-based journey through a living cell

https://burrito.bio/essays/biology-is-a-burrito
26•the-mitr•1h ago•3 comments

Can LLMs create lasting flashcards from readers' highlights?

https://memory-machines.com/report/
1•jryio•1h ago•0 comments

The Arpanet Reconstruction Project

https://obsolescence.dev/arpanet_home.html
2•ecliptik•1h ago•0 comments

Where the goblins came from

https://openai.com/index/where-the-goblins-came-from/
155•ilreb•1h ago•68 comments

Demonstrating the idea of gamma camera imaging [video]

https://www.youtube.com/watch?v=PyGlHtvihXA
4•num42•1h ago•0 comments

Ernie 5.1 Preview

https://ernie.baidu.com/
3•qainsights•1h ago•0 comments

Strait of Hormuz Daily Intelligence

https://insights.windward.ai/
3•jnord•1h ago•0 comments

LFM2-24B-A2B: Scaling Up the LFM2 Architecture

https://www.liquid.ai/blog/lfm2-24b-a2b
2•nateb2022•1h ago•0 comments

Finetuning Activates Verbatim Recall of Copyrighted Books in LLMs

https://github.com/cauchy221/Alignment-Whack-a-Mole-Code
29•reconnecting•1h ago•8 comments

Functional Programmers need to take a look at Zig

https://pure-systems.org/posts/2026-04-29-functional-programmers-need-to-take-a-look-at-zig.html
26•xngbuilds•1h ago•5 comments

Transponders to be installed on New York area airport ground vehicles

https://apnews.com/article/laguardia-plane-crash-air-canada-transponders-18578e94f2f7ec67b5ed45e9...
2•geox•1h ago•0 comments

GitHub Is Sinking

https://dbushell.com/2026/04/29/github-is-sinking/
5•xngbuilds•1h ago•1 comments