frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Intercom Becomes Fin

https://www.intercom.com/blog/today-intercom-becomes-fin/
1•tjwds•54s ago•0 comments

Meta employees protest against mouse tracking tech at US offices

https://www.reuters.com/sustainability/society-equity/meta-us-employees-organize-protest-against-...
5•delichon•3m ago•0 comments

How a 150-year-old Japanese workshop survived the age of slop and distraction

https://bigthinkmedia.substack.com/p/how-a-small-shop-in-kyoto-connects
1•Duanemclemore•4m ago•0 comments

Snitches Get 202'd

https://github.com/agudulin/simple-proxy
1•sagod•4m ago•0 comments

CS Trivia: computer science crosswords

https://cstrivia.com/
1•bluejulius•5m ago•0 comments

Show HN: Fast, reliable MCP for LinkedIn, Uber, Venmo (r/w)

https://candle.fi
1•liambl•5m ago•0 comments

Android Intrusion Logging for consensual forensic analysis

https://securitylab.amnesty.org/latest/2026/05/android-intrusion-logging-as-a-new-source-of-data-...
1•ledoge•8m ago•0 comments

Show HN: GitGlimpse – CLI for understanding AI-generated Git diffs

https://gitglimpse.com
1•dinoze•8m ago•0 comments

Swatch Internet Time

https://en.wikipedia.org/wiki/Swatch_Internet_Time
2•florianmari•8m ago•0 comments

Gea: A Compile-Time Reactive UI Framework That's Just JavaScript

https://www.coyotiv.com/blog/posts/introducing-gea-compile-time-reactive-ui-framework/
1•arbayi•10m ago•0 comments

Railway which inspired Thomas the Tank Engine marks 75 years (of Presevation)

https://www.bbc.co.uk/news/articles/cz02347714mo
1•rb2e•12m ago•0 comments

Scrcpy v4.0

https://github.com/Genymobile/scrcpy/releases/tag/v4.0
2•xnx•12m ago•1 comments

MergeBrake – catch DB-breaking Prisma/Drizzle PRs before merge

https://github.com/mergebrake/mergebrake
1•mergebrake•12m ago•0 comments

Free llms.txt generator (browser-side, no signup)

https://geotrackerai.com/tools/llms-txt-generator
1•geotrackerai•14m ago•0 comments

World Models Can Change Everything

https://weightythoughts.com/p/world-models-can-change-everything
1•gmays•14m ago•0 comments

Prowl: Native macOS codings agent orchestrator

https://tangled.org/onev.cat/3mktize72cy22
1•nerdypepper•15m ago•0 comments

SpaceX backs Anthropic with data centre deal amidst Musk's OpenAI lawsuit

https://www.aljazeera.com/economy/2026/5/6/spacex-backs-anthropic-with-data-centre-deal-amidst-mu...
3•billybuckwheat•17m ago•0 comments

Any app on recent Android versions can leak certain traffic

https://mullvad.net/en/blog/any-app-on-recent-android-versions-can-leak-certain-traffic
1•jonah-archive•17m ago•0 comments

Show HN: Ranking every disease (the unmet needs index)

https://insights.convoke.bio/unmet-needs
3•snats•18m ago•0 comments

Smart Goals Are Overrated

https://arrowcoaching.net/blog/post.html?slug=smart-goals
1•Joboman555•19m ago•0 comments

My implementation of CVE-2026-31431(CopyFail) in C++, no dependency needed

https://github.com/gbonacini/CVE-2026-31431
1•bg_bg•20m ago•1 comments

Three things in AI to watch, according to a Nobel-winning economist

https://www.technologyreview.com/2026/05/11/1137090/three-things-in-ai-to-watch-according-to-a-no...
2•Brajeshwar•21m ago•0 comments

One engine, many tools – Introducing Rubydex

https://railsatscale.com/2026-05-12-one-engine-many-tools/
1•ufuk•23m ago•0 comments

Ploopy Bean: a trackpoint for every computer

https://ploopy.co/shop/bean-pointing-stick/
2•jibcage•23m ago•1 comments

3D renderings of 142 significant objects at the Metropolitan Museum of Art

https://www.metmuseum.org/art/collection/search?showOnly=has3d
4•bookofjoe•23m ago•0 comments

Android Auto home screen widgets look nearly ready

https://www.androidauthority.com/android-auto-home-widgets-3662452/
1•1970-01-01•25m ago•0 comments

Paper introduces Positive Alignment framework for AI

https://twitter.com/RubenLaukkonen/status/2054215967584944599
1•momentmaker•26m ago•0 comments

US Government Concedes First Amendment Violation in Berenson Settlement

https://foundationforfreedomonline.com/us-government-concedes-first-amendment-violation-in-berens...
2•iamnothere•27m ago•0 comments

Spar [KILL]: AI distribution agent for startups that identifies warm intro paths

https://sparit.vc/spar/d/BPCf3D6XsY
1•endofcoding•33m ago•0 comments

Markdown, the WD-40 of Digital Information

https://hoeijmakers.net/markdown-the-wd-40-of-digital-information/
3•speckx•34m ago•0 comments