frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

AI based Software Testing system that works

https://ytest.ai/
1•mchauhanx•1m ago•0 comments

The Attack on Competence

https://deadsimpletech.com/blog/attack-on-competence
1•lproven•2m ago•0 comments

OpenAI's Altman says AI unlikely to lead to 'jobs apocalypse'

https://www.reuters.com/world/asia-pacific/openais-altman-says-ai-unlikely-lead-jobs-apocalypse-2...
1•cdrnsf•2m ago•0 comments

The Finite Work Fallacy – Off-Square

https://offsquare.substack.com/p/the-finite-work-fallacy
1•fkodom•3m ago•0 comments

Superintelligence: The Idea That Eats Smart People (2016)

https://idlewords.com/talks/superintelligence.htm
1•thoughtpeddler•9m ago•0 comments

I Built a Thing

https://github.com/tylrcc/tapeline
2•tylrr•10m ago•0 comments

Stop AI agents from being weaponized through their own memory (OWASP)

https://www.helpnetsecurity.com/2026/06/01/owasp-agent-memory-guard/
2•vgudur297•10m ago•0 comments

Fourth Grade Product Thinking

https://brentfitzgerald.com/posts/fourth-grade-product-thinking/
2•_doctor_love•11m ago•1 comments

Build a Basic AI Agent from Scratch: Tools

https://www.ruxu.dev/articles/ai/build-an-ai-agent-with-tools/
2•ruxudev•12m ago•0 comments

Virtual File Tree VSCode Extension for Developers

https://codeberg.org/hjdesulme/virtual-file-tree
2•hdell49•12m ago•1 comments

NYT Publisher: A.I., Journalism and the Uncertain Future of the Public Square

https://www.nytco.com/press/a-i-journalism-and-the-uncertain-future-of-the-public-square/
2•tysone•12m ago•0 comments

Should you normalize RGB values by 255 or 256?

https://30fps.net/pages/255-vs-256-division/
4•pplanu•15m ago•0 comments

Emergence World: A Laboratory for Evaluating Long-Horizon Agent Autonomy

https://www.emergence.ai/blog/emergence-world-a-laboratory-for-evaluating-long-horizon-agent-auto...
2•amai•17m ago•0 comments

I Hate the Weekends (2017)

https://www.cdahmedeh.net/blog/2017/4/15/why-i-hate-the-weekends
2•downbad_•17m ago•0 comments

I Observed Some Brownian Motion at Home

https://chillphysicsenjoyer.substack.com/p/i-observed-some-brownian-motion-at
2•crescit_eundo•18m ago•0 comments

Is Huh? a universal word? (2013)

https://markdingemanse.net/huh/
2•downbad_•18m ago•0 comments

Amazon Shuts Down Internal AI Leaderboard After Employees Cheated

https://www.404media.co/amazon-shuts-down-internal-ai-leaderboard-after-employees-cheated/
3•cdrnsf•20m ago•3 comments

Hackers Asked Meta AI to Give Them Access to Instagram Accounts. It Worked

https://www.404media.co/hackers-simply-asked-meta-ai-to-give-them-access-to-high-profile-instagra...
6•pulisse•21m ago•1 comments

Show HN: Trumpstonks – every company Trump's named, backtested vs. the S&P

https://www.trumpstonks.com/
2•racketracer•22m ago•1 comments

Show HN: Make a free 3.8B model as reliable as one 7× bigger at parsing data

https://pypi.org/project/llm-feedback-control/
2•pcoz•24m ago•0 comments

Sloppy and Paste

https://wordsrightman.beehiiv.com/p/sloppy-and-paste
2•tags2k•25m ago•0 comments

Show HN: Poolnarc – catch hidden Linux cryptominers from two eBPF hooks

https://github.com/yeet-src/poolnarc
2•r3tr0•26m ago•0 comments

Show HN: .Net/C# TUI framework (used Claude)

https://github.com/rivoli-ai/andy-tui2
2•M4R5H4LL•26m ago•0 comments

Journal of my journey over the mountains ... in 1747-8 (by George Washington)

https://www.gutenberg.org/cache/epub/52395/pg52395-images.html
2•Michelangelo11•26m ago•0 comments

I Developed DaVinci Resolve Plugin to Edit Videos from Claude

https://www.youtube.com/watch?v=a9RQBNIBA2s
2•ivo_ovcharov•27m ago•1 comments

Upai.lat – Global AI-powered platform for startups and businesses

https://upai.lat/
2•deivst97•28m ago•0 comments

Distinguishing Technology from Technology

https://geoffgraham.me/distinguishing-technology-from-technology/
2•speckx•28m ago•0 comments

Visa invests in Replit to power agentic payments for developers

https://techcrunch.com/2026/05/28/visa-invests-in-replit-to-power-agentic-payments-for-developers/
6•alexreysa•29m ago•0 comments

Show HN: Create and Maintain Filesystem Structures

https://github.com/Isaac12x/seed-cli
2•hunterx•30m ago•0 comments

A native, local-first alternative to Logitech Options+, written in Rust

https://openlogi.org/en
4•driesdep•30m ago•0 comments