frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Utilize.internet – Browser-based file tools, nothing uploaded

https://utilizeinternet.com/
1•faisalhaleem•1m ago•0 comments

The First Labor Congressman: How Milwaukee's Labor Movement Elected Henry Smith

https://greysidewalk.substack.com/p/the-first-labor-congressman-how-milwaukees
1•PostingThings•2m ago•0 comments

Films of 2026:Q2

https://scottsumner.substack.com/p/films-of-2026q2
1•paulpauper•4m ago•0 comments

Not Dark Yet

https://agoodhardstare.substack.com/p/not-dark-yet
1•paulpauper•5m ago•0 comments

Why Being Overqualified Is a Risk

https://newsletter.bphogan.com/archive/issue-52-run-coding-models-locally-and-why-being/
1•mooreds•5m ago•0 comments

My Side-Project, a Multitool Harness for Unreal Engine and Claude Code

https://github.com/oliver-io/unreal-harness
1•oliver-io•5m ago•0 comments

Tesla Robotaxi Launches in Miami

https://twitter.com/robotaxi/status/2073030246161367153
1•spikels•5m ago•0 comments

Baby Busts and Growth Booms: Demographic Change and the Macroeconomy [pdf]

https://economics.mit.edu/sites/default/files/2026-06/Baby%20Busts%20and%20Growth%20Booms%20-%20D...
1•paulpauper•6m ago•0 comments

Jensen Huang's signed leather jacket could fetch up to $60K in charity auction

https://www.tomshardware.com/peripherals/wearable-tech/jensen-huangs-iconic-signed-leather-jacket...
1•LorenDB•7m ago•0 comments

Show HN: Auto-continue Claude Fable 5 the second your 5-hour limit lifts

https://github.com/wavever/CCLimitPing
1•wavever•8m ago•0 comments

PDFx: Extension of the traditional PDF standard-store multiple files together

https://github.com/AlexandrosGounis/pdfx
1•thunderbong•10m ago•0 comments

We run sandboxes for agents at scale

https://adapt.com/blog/orchestrating-agent-sandboxes
2•scsmithr•13m ago•0 comments

Avian-style respiration allowed gigantism in pterosaurs (2014)

https://journals.biologists.com/jeb/article/217/15/2627/12175/Avian-style-respiration-allowed-gig...
1•Eridanus2•14m ago•0 comments

The MJ Rathbun case: How an autonomous AI bot cyberbullied a human programmer

https://chatgptdesactualizado.blogspot.com/2026/07/mj-rathbun-chronicles-of-first-bully.html
2•ErrorHunter•14m ago•0 comments

ClawdMojis – A Clawd for Every Occasion

https://github.com/afspies/ClawdMoji
1•afspies•15m ago•0 comments

AI agents are not your "coworkers"

https://www.technologyreview.com/2026/06/29/1139849/ai-agents-are-not-your-coworkers/
1•ashumz•16m ago•0 comments

Meta reuses old RAM in new servers with custom bridge chip

https://www.networkworld.com/article/4192827/meta-reuses-old-ram-in-new-servers-with-custom-bridg...
2•ihsw•16m ago•0 comments

Show HN: Kontext – Move an AI chat's full context to another AI in one click

https://github.com/anuragmerndev/kontext-ai
1•anuragmerndev•16m ago•0 comments

Microsfoft "completely reimagined" access denied screen across Microsoft 365

https://twitter.com/onedrive/status/2072773050991264140
1•SG-•17m ago•0 comments

How to Enjoy John Ashbery

https://joshuacorey.substack.com/p/how-to-enjoy-john-ashbery
1•Caiero•22m ago•0 comments

Show HN: Track Token usage for major platforms,know your token flow

https://www.lifehacksgermany.com/en
1•1Kapish•23m ago•0 comments

GPT 5.5 (high) is as good at coding as Claude Fable (medium) at a lower cost

https://deepswe.datacurve.ai/
3•handfuloflight•25m ago•0 comments

Who Gets to Monetize the Open Web?

https://mitch.website/blog/disproportionate-impacts/
1•hyperultra•26m ago•0 comments

Show HN: I ran 400 hours of interviews, so I built the tool I wished existed

1•ud0•26m ago•0 comments

Gap between closed and open models might be much smaller than commonly assumed

https://old.reddit.com/r/LocalLLaMA/comments/1ukp2bu/the_gap_between_closed_and_open_models_might...
1•johnnyApplePRNG•28m ago•0 comments

How Fascists Construct Scientific Ignorance [video]

https://www.youtube.com/watch?v=HTh4y2wABrA
3•KittenInABox•33m ago•0 comments

Why nobody uses Craigslist anymore [video]

https://www.youtube.com/watch?v=31oqfd8myBI
1•onemoresoop•34m ago•0 comments

FreeBSD Ate My RAM

https://crocidb.com/post/freebsd-ate-my-ram/
4•theanonymousone•35m ago•0 comments

Show HN: Make your terminal pulse orange when Claude Code needs input

https://github.com/rickardstureborg/claude-needs-input
2•rstureborg•39m ago•0 comments

Extralite 3.0.0: fast and innovative SQLite wrapper for Ruby

https://noteflakes.com/articles/2026-07-02-extralite-3
2•thunderbong•40m ago•0 comments