frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Residential IP RDP – Real ISP Home Network Access – Rdpextra

1•EVAN1098•2m ago•0 comments

Have Top Chinese AI Researchers Stayed in the United States?

https://carnegieendowment.org/emissary/2025/12/china-ai-researchers-us-talent-pool?lang=en
1•hunglee2•2m ago•0 comments

Show HN: Meeting notes and transcripts straight into Obsidian

https://obsidian.md/plugins?search=granola
2•tomelliot•3m ago•0 comments

TinyLife6502. GOL in 114 Bytes

https://www.lemon64.com/forum/viewtopic.php?t=88104
1•orac81•4m ago•1 comments

Original PoCs for React2Shell CVE-2025-55182

https://github.com/lachlan2k/React2Shell-CVE-2025-55182-original-poc
1•dbushell•6m ago•0 comments

Show HN: Atlas4D – Open-source 4D spatiotemporal platform on PostgreSQL

https://github.com/crisbez/atlas4d-base
1•atlas4d•8m ago•0 comments

Show HN: TaskWand – Generate n8n workflows using RAG on 2k+ real examples

https://taskwand.io/
1•ronanren•8m ago•0 comments

Vibe code like it's 1986

https://vibecommander.dev
1•fatliverfreddy•10m ago•0 comments

SedaiBasic2: A fast BASIC interpreter with a pure register-based VM

https://github.com/camauri/SedaiBasic2
1•camauri•12m ago•1 comments

Japanese Game co. asks applicants to draw in person to avoid generative AI fraud

https://automaton-media.com/en/news/mid-size-game-company-in-japan-asks-potential-recruits-to-dra...
1•Geekette•13m ago•0 comments

A new jailbreak has been released for iOS 17.0 and 16.7 RC (20H18)

https://old.reddit.com/r/jailbreak/comments/1pej2jq/a_new_jailbreak_has_been_released_for_ios_170...
1•remark5396•15m ago•1 comments

My mom doesn't like cat videos anymore

4•FrankyHollywood•15m ago•0 comments

Lisp Style and Design

https://archive.org/details/miller-and-benson-1990-lisp-style-design
1•todsacerdoti•18m ago•0 comments

The Return of Procedural Programming [video]

https://www.youtube.com/watch?v=vQPHtAxOZZw
2•tosh•18m ago•0 comments

The Road to Zig 1.0 (2019) [video]

https://www.youtube.com/watch?v=Gv2I7qTux7g
1•tosh•20m ago•0 comments

Why Does A.I. Write Like That?

https://www.nytimes.com/2025/12/03/magazine/chatbot-writing-style.html
1•laurenzse•22m ago•0 comments

Shredded Cheese Journalism

https://dkdc.dev/posts/shredded-cheese-journalism/
1•dkdcio•24m ago•0 comments

Unredacted Magazine Issue 008 SEP 2025 [pdf]

https://unredactedmagazine.com/issues/008.pdf
1•signa11•24m ago•0 comments

Why We Can't Quit Excel

https://www.bloomberg.com/features/2025-microsoft-excel-ai-software/
3•thm•25m ago•1 comments

Trump reveals what he wants for the world

https://www.politico.com/news/2025/12/05/trump-reveals-national-security-strategy-western-hemisph...
1•breppp•27m ago•0 comments

Show HN: TypeScript runtime that syncs state for your multiplayer app/game

https://martini-kit.com/
1•yaoke259•28m ago•0 comments

Go's escape analysis and why my function return worked

https://bonniesimon.in/blog/go-escape-analysis
1•bonniesimon•29m ago•0 comments

Rebuilding our documentation site using AI

https://endor.dev/blog/rebuilding-our-docs
1•angelmm•31m ago•0 comments

Framework Design Guidelines

https://github.com/dotnet/runtime/blob/main/docs/coding-guidelines/framework-design-guidelines-di...
1•jpeter•32m ago•0 comments

Sending 'real' telegrams with Telegram

https://github.com/rscircus/the-telegram-telegram
1•rawland•33m ago•0 comments

American Data Centers

https://tech.marksblogg.com/american-data-centers.html
1•marklit•33m ago•0 comments

I built an API to give LLMs instant access to documentation for 1000 libraries

1•riskofcollision•35m ago•0 comments

Fearless frogs feast on deadly hornets

https://www.kobe-u.ac.jp/en/news/article/20251204-67323/
2•nickcotter•36m ago•0 comments

Show HN: Vibe coded AI built astro and tailwind static site with full animations

https://tariqdude.github.io/Github-Pages-Project-v1/visual-showcase/
1•chiengineer•41m ago•1 comments

Cloudflare suffers second outage in as many months during routine maintenance

https://www.theregister.com/2025/12/05/cloudflare_outage_again/
1•jjgreen•43m ago•0 comments