frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Hating Stranger Things During the Death Rattle of Criticism

https://freddiedeboer.substack.com/p/hating-stranger-things-during-the
1•paulpauper•1m ago•0 comments

Spotify Wrapped but for LeetCode

https://github.com/collinboler/leetcodewrapped
2•collinboler2•3m ago•0 comments

Should CSS be a constraint system instead?

https://pavpanchekha.com/blog/why-css-bad.html
1•fanf2•4m ago•0 comments

Show HN: Zafiro – a painterly 3D island in the browser (Three.js)

https://playzafiro.com/isle-lab
1•bartoszu_•6m ago•0 comments

AI Continuity System: Safe Multi-Instance Collaboration without Persistent Mem.

https://github.com/sirspyr0/ai-continuity-system
2•sirspyr0•12m ago•1 comments

React2Shell Exploitation: A Short Summary of Honeypot Activity

https://defusedcyber.com/react2shell-exploitation-honeypot-analysis
2•waihtis•13m ago•0 comments

Every Commodore 64 Ultimate comes with a USB Cassette, pre-loaded with goodies

https://www.commodore.net/pressplay
2•amichail•13m ago•0 comments

My experience learning AI from scratch and why it changed how I see coding

1•ViktorKuz•16m ago•0 comments

Ask HN: Is a human brain more like a CPU or an FPGA?

1•arduinomancer•17m ago•1 comments

X shuts down the European Commission's ad account the day after major fine

https://www.engadget.com/social-media/x-shuts-down-the-european-commissions-ad-account-the-day-af...
3•MilnerRoute•23m ago•1 comments

I created A-Lang – a new lightweight language, focused on simplicity and speed

https://alang-doc.vercel.app/
1•alexandrelanda•27m ago•2 comments

Optimizing Associative Operations (2015)

https://ericlippert.com/2015/10/27/optimizing-associative-operations/
1•mooreds•28m ago•0 comments

For a Literary Saint, Margaret Atwood Can Sure Hold a Grudge

https://www.nytimes.com/2025/11/01/books/review/margaret-atwood-book-of-lives-memoir.html
4•mooreds•29m ago•0 comments

Navigating the future of AI agent security with Dan Moore [audio]

https://overcommitted.dev/ep-36-navigating-the-future-of-ai-agent-security-with-dan-moore/
1•mooreds•29m ago•0 comments

Using AI to Modernize Ubuntu Error Tracker Produced Code That Was 'Plain Wrong'

https://www.phoronix.com/news/Ubuntu-Error-Tracker-AI-Error
1•Lockal•31m ago•0 comments

Sitekick – simple AI-driven web chat and lead-capture for any website

1•nagendraallam•31m ago•1 comments

Claude Code Tips

https://agenticcoding.substack.com/p/32-claude-code-tips-from-basics-to
2•ykdojo•33m ago•1 comments

First explorations in Indian classical music

https://andrewbatson.com/2025/10/11/first-explorations-in-indian-classical-music/
5•surprisetalk•34m ago•0 comments

Ignore the pessimists – we are living through a literary golden age

https://www.commonreader.co.uk/p/ignore-the-pessimists-we-are-living
4•surprisetalk•35m ago•0 comments

(Dis)Assembling Experience

https://www.raphkoster.com/games/presentations/disassembling-games/
2•surprisetalk•35m ago•0 comments

So What Should We Call This – A Grue Jay?

https://cns.utexas.edu/news/research/so-what-should-we-call-grue-jay
2•surprisetalk•35m ago•0 comments

Why Fighter Jets Ban 90% of C++ Features [video]

https://www.youtube.com/watch?v=Gv4sDL9Ljww
23•AareyBaba•39m ago•11 comments

Musicians must embrace 'unstoppable force' of AI, Eurythmics' Dave Stewart urges

https://www.theguardian.com/music/2025/dec/05/musicians-must-embrace-unstoppable-force-of-ai-eury...
2•binning•43m ago•2 comments

Supercomputer Creates One of the Most Realistic Virtual Brains Ever Seen

https://www.sciencealert.com/supercomputer-creates-one-of-the-most-realistic-virtual-brains-ever-...
3•wjSgoWPm5bWAhXB•43m ago•0 comments

Why do so many girls in Blackpool want to become boys?

https://juliebindel.substack.com/p/why-do-so-many-girls-in-blackpool-a6a
2•binning•45m ago•0 comments

Internet became 'enshittified' – and how to fix it

https://www.rnz.co.nz/news/on-the-inside/581142/how-internet-became-enshittified-and-how-to-fix-it
5•billybuckwheat•45m ago•0 comments

Show HN: Honest Reviews Club – Deep, transparent digital product reviews

https://honest-reviews.club/
2•launchzilla•49m ago•0 comments

Rape victims will no longer be depicted as serial liars in England and Wales

https://www.theguardian.com/society/2025/dec/02/rape-victims-england-wales-protected-serial-liar-...
6•binning•52m ago•0 comments

A series of tricks and techniques I learned doing tiny GLSL demos

https://blog.pkh.me/p/48-a-series-of-tricks-and-techniques-i-learned-doing-tiny-glsl-demos.html
2•ux•55m ago•0 comments

Who Invented ClassPass?

https://twitter.com/JonasBrandon/status/1997694250692293069
2•metricmissions•56m ago•0 comments