frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: I built a native macOS audio player and it changed my life

https://github.com/chrisallick/light-crime-audio-player
1•chrisallick•2m ago•1 comments

Ribbon – A Linkding Client

https://www.coryd.dev/posts/2026/ribbon-a-linkding-client
1•cdrnsf•2m ago•0 comments

Show HN: Agent Historic Philosophical Persona Routing and Prompts

https://github.com/barretts/AgentHistoric
2•sosuke•5m ago•1 comments

I Bought a TV with No 'Smart' Features [video]

https://www.youtube.com/watch?v=LJh72_O4pXE
1•absqueued•7m ago•0 comments

Using agroforestry to buffer noise [pdf]

https://www.fs.usda.gov/nac/assets/documents/agroforestrynotes/an42w05.pdf
1•koolba•8m ago•0 comments

An Introduction to LangChain's Deep Agents

https://medium.com/@ngpeijiun/an-introduction-to-langchains-deep-agents-ad14b511f3dc
2•eugenis•10m ago•0 comments

Kredd – open-source SaaS application for ranking cold inbound emails

https://github.com/DomHudson/kredd
1•domhudson•13m ago•0 comments

Open-source diagnostic for Al misalignment. Model agnostic, industry agnostic

https://github.com/ifixai-ai/diagnostic
1•dimneo24•13m ago•1 comments

Highlander returns to theaters in glorious 4K, for 40th anniversary.

https://www.polygon.com/highlander-returns-to-theaters-in-glorious-4k/
1•nephihaha•13m ago•1 comments

The actual strategy plan Walt Disney gave investors

https://hbr.org/resources/images/article_assets/2013/05/disney-2.jpeg
1•megamike•18m ago•0 comments

Austria expels three Russian embassy staff after 'forest of antennae' discovered

https://www.theguardian.com/world/2026/may/04/austria-expels-three-russian-embassy-staff-vienna-s...
2•CqtGLRGcukpy•18m ago•0 comments

Show HN: Yames – A distraction-free desktop metronome built with Rust and Tauri

https://turutupa.github.io/yames/
2•turutupa•18m ago•0 comments

May the 4th be with the ballpark: Inside MLB's Star Wars obsession

https://www.espn.com/mlb/story/_/id/48652519/mlb-star-wars-promotions-traditions-4th
1•1659447091•19m ago•0 comments

Running a Company with Agents

https://cofounder.co
1•yuedongze•19m ago•0 comments

AOL killed the early internet on a single day in September 1993

https://twitter.com/GeniusGTX/status/2051316737749217627
3•bilsbie•19m ago•0 comments

Suspected YouTube bug spikes RAM over 7gbs users report lag and frozen tabs

https://www.tomshardware.com/software/a-suspected-youtube-interface-bug-spikes-ram-usage-above-7-...
2•Zeidd•20m ago•0 comments

I left academia to sell Elephant Garlic online

https://demeterfamilyfarm.com/
1•WWIII_Historian•24m ago•0 comments

2026 Cocodona Livestream Day 1 [video]

https://www.youtube.com/watch?v=dWhF6tTn8zI
1•BiraIgnacio•28m ago•0 comments

VSCode Dark Islands – Safe Version

https://github.com/raaid3/vscode-dark-islands
1•raaid3•30m ago•1 comments

Metalenz Has Figured Out a Way to Make Face ID Invisible

https://www.wired.com/story/metalenz-has-figured-out-a-way-to-make-face-id-invisible/
1•0in•32m ago•0 comments

An unbiased benchmark for how well agents can read your docs

https://docsalot.dev/benchmarks/docs
2•fazkan•34m ago•1 comments

America's retail army came to rule the stock market

https://www.ft.com/content/ee8a0604-84cb-44da-bb33-f36818944581
2•petethomas•35m ago•0 comments

Started Exploring Payment Infrastructure for Online Businesses (Stripe, API)

https://chain2pay.cloud
1•fintraxx•36m ago•0 comments

The Open Social Web Needs Section 230 to Survive

https://www.techdirt.com/2026/05/04/the-open-social-web-needs-section-230-to-survive/
3•HotGarbage•42m ago•0 comments

When Decoded Isn't Verified: Closing a Trust-Boundary Gap in Envoy's JWT Filter

https://netguard24-7.com/blog/envoy-jwt-authn-confused-deputy-pr-43630
1•cybrdude•45m ago•0 comments

The Roomba Guy's Second Act: A Robot You'll Want to Snuggle

https://www.wsj.com/tech/ai/familiar-machines-and-magic-robot-c8711e45
1•aanet•45m ago•1 comments

April 2026 Links

https://nomagicpill.substack.com/p/april-2026-links
1•nomagicpill•47m ago•1 comments

Cerebras Leads Crop of IPOs Rushing to Tap Market Before SpaceX

https://techcrunch.com/2026/05/04/openais-cozy-partner-cerebras-is-on-track-for-a-blockbuster-ipo/
2•giwook•49m ago•0 comments

ARPA-H allocates $35M to osteoarthritis reversal therapy

https://www.colorado.edu/today/2026/04/06/simple-shot-shows-promise-reverse-osteoarthritis-within...
1•warbaker•50m ago•0 comments

Hantavirus crops up on a cruise ship – what scientists are watching

https://www.nature.com/articles/d41586-026-01450-7
1•warbaker•57m ago•0 comments