frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•12mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: Logbox – let Claude monitor your dev logs

https://github.com/struct-dot-ai/logbox
1•nimeshmc•37s ago•0 comments

Likely AI-generated short story won a major prize

https://twitter.com/nabeelqu/status/2056397504824963296
1•thatoneengineer•40s ago•0 comments

Show HN: Melogen – Generate MIDI melodies for free

https://www.melogen.ai/
1•squirrelon•2m ago•0 comments

Show HN: FastBack end – schema-first back end runtime with OpenAPI output

https://github.com/darula-hpp/fastbackend
1•ombedzi•3m ago•0 comments

The Gemini app becomes more agentic, delivering proactive, 24/7 help

https://blog.google/innovation-and-ai/products/gemini-app/next-evolution-gemini-app/
2•gfortaine•6m ago•0 comments

Disney Erased FiveThirtyEight

https://www.natesilver.net/p/disney-erased-fivethirtyeight
3•7777777phil•7m ago•0 comments

Which campaigns actually drive your leads?

https://www.digitalpilot.app/
1•iamjeylabrecque•8m ago•0 comments

Show HN: Coding agent where a second agent QAs every PR in a real browser

https://www.notesasm.com/
1•kavin_key•9m ago•0 comments

The missing men of the American marriage market

https://www.npr.org/sections/planet-money/2026/05/19/g-s1-122695/the-missing-men-of-the-american-...
2•sizzle•10m ago•0 comments

Scientists worried about de-extinction ethics as biotech co. touts breakthrough

https://www.rnz.co.nz/news/science-and-technology/595719/scientists-concerned-about-de-extinction...
3•billybuckwheat•10m ago•0 comments

Automate your computer using real code – not drag-and-drop blocks

https://github.com/hassananayi/codeonix
1•hassananayi•10m ago•1 comments

The Trouble with Emotion AI

https://www.computerworld.com/article/4171382/the-trouble-with-emotion-reading-ai.html
2•mikelgan•10m ago•1 comments

Lapdog: Local Coding Agent Assistant

https://lapdog.datadoghq.com/
1•astuyvenberg•11m ago•1 comments

Ruby vs. Java vs. TypeScript: Building Claude Cowork Docx Plugin

https://tanin.nanakorn.com/ruby-java-typescrip-claude-docx-plugin/
2•tanin•12m ago•0 comments

Mistral AI Python package compromised on PyPI [2026-05-12]

https://github.com/mistralai/client-python/issues/523
2•r2vcap•12m ago•0 comments

Finding Unpinned and Unpinnable GitHub Actions Across Your Org

https://www.pavel.gr/blog/finding-unpinned-and-unpinnable-github-actions
1•howlett•13m ago•0 comments

From Compute Overhang to Compute Crunch

https://secondthoughts.ai/p/the-ai-race
1•speckx•14m ago•0 comments

Chrome Dev Blog: Declarative Partial Updates (Interleaved HTML Streaming)

https://bsky.app/profile/did:plc:ilj6i6evo5xxl5iixp2y76nt/post/3mm7rxrubqs2v
1•avarev•15m ago•0 comments

Show HN: Search 67K .AI domains by AI-extracted tags and descriptions

https://ratemyaisite.com/explore
1•prolly97•17m ago•0 comments

Gemini Omni Flash is coming soon

https://gemini-omni-flash.net/
1•Jenny249•17m ago•0 comments

A case against the case against full-body MRI screening

https://medium.com/the-tideline/why-the-smartest-people-i-know-are-ignoring-their-doctors-on-full...
1•biancaleeman•18m ago•1 comments

TinyFish Vault: Your Web Agent Can Now Log in Without Touching Your Passwords

https://www.tinyfish.ai/blog/tinyfish-vault-your-web-agent-can-now-log-in-without-touching-your-p...
1•gargigupta•18m ago•0 comments

AI slop is flooding maths YouTube [video]

https://www.youtube.com/watch?v=mRO_QonhC2c
4•Imustaskforhelp•21m ago•1 comments

Google pushes update to Antigravity instead it reinstalls and locks everyone out

https://twitter.com/antigravity/status/2056795168326754759
3•thekevan•22m ago•2 comments

The TTY Demystified (2008)

https://www.linusakesson.net/programming/tty/index.php
2•20after4•23m ago•0 comments

Your AI Frustration Is My Opportunity

https://metedata.substack.com/p/012-your-ai-frustration-is-my-opportunity
1•young_mete•24m ago•0 comments

Stop 'tokenmaxxing' and deploy AI sensibly instead

https://www.nature.com/articles/s42256-026-01253-5
4•mikelgan•24m ago•0 comments

Parents Are Fuming About Other Peoples' Kids Getting Extra Time on the SAT

https://www.wsj.com/us-news/education/parents-are-fuming-about-other-peoples-kids-getting-extra-t...
2•bookofjoe•27m ago•1 comments

Show HN: Pg_deltax, Apache-licensed alternative to TimescaleDB

https://github.com/xataio/deltax
2•tee-es-gee•27m ago•0 comments

Powers of Ten

https://www.powersoften.tv
3•structuredPizza•28m ago•0 comments