frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Apple's chief chip architect has reportedly talked to Tim Cook about leaving

https://www.tomshardware.com/tech-industry/apples-chief-chip-architect-for-the-last-decade-has-re...
1•pseudolus•4m ago•0 comments

Asynchronous Circuit

https://en.wikipedia.org/wiki/Asynchronous_circuit
1•DustinEchoes•8m ago•0 comments

Programming Party Tricks [video]

https://www.youtube.com/watch?v=4KdvcQKNfbQ
1•marvinborner•8m ago•0 comments

Lessons Learned After Trying MeshCore for Off-Grid Text Messaging

https://hackaday.com/2025/12/06/lessons-learned-after-trying-meshcore-for-off-grid-text-messaging/
2•lxm•15m ago•0 comments

The Authentication Rabbit Hole: What I Learned from Vibe-Coding Auth with AI

https://fusionauth.io/blog/vibe-coding-authentication
2•mooreds•17m ago•0 comments

Show HN: Enterprise ad-blocker and privacy guard

https://zen.irbis.sh/enterprise
2•anfragment•27m ago•0 comments

Claude Diary

https://rlancemartin.github.io/2025/12/01/claude_diary/
2•aratahikaru5•32m ago•1 comments

HiRTOS: A high-integrity multi-core RTOS kernel written in SPARK Ada

https://github.com/jgrivera67/HiRTOS
2•jacques_chester•37m ago•0 comments

Multiplying our way out of division

https://xania.org/202512/07-division-again
2•ibobev•41m ago•0 comments

Show HN: I replaced my premium workout app with vibecode

https://strengthquest.lovable.app/
2•maddmann•41m ago•0 comments

NY judge orders ChatGPT conversation handover in newspaper copyright win

https://www.nydailynews.com/2025/12/03/ny-judge-orders-openai-to-hand-over-chatgpt-conversations-...
4•gnabgib•50m ago•0 comments

Oath of the Horatii

https://en.wikipedia.org/wiki/Oath_of_the_Horatii
1•andsoitis•51m ago•0 comments

AI chatbots can sway voters better than political advertisements

https://www.technologyreview.com/2025/12/04/1128824/ai-chatbots-can-sway-voters-better-than-polit...
1•gnabgib•54m ago•1 comments

Linux GPIB Drivers Declared Stable 53 Years After HP Introduced the Bus

https://www.phoronix.com/news/GPIB-De-Staged-Linux-6.19
3•LorenDB•54m ago•1 comments

Spinlocks vs. Mutexes: When to Spin and When to Sleep

https://howtech.substack.com/p/spinlocks-vs-mutexes-when-to-spin
21•birdculture•58m ago•1 comments

What Folk Can Do

https://folk.computer/guides/what-folk-can-do
3•luu•59m ago•1 comments

List of Common Misconceptions (Wikipedia)

https://en.wikipedia.org/wiki/List_of_common_misconceptions
3•greazy•1h ago•0 comments

Energy efficiency task scheduling algorithm for multi-core embedded platforms

https://www.sciencedirect.com/science/article/abs/pii/S0045790625008298
1•stevenjgarner•1h ago•1 comments

A Look into NASA's Coding Philosophy (2017)

https://observer.com/2017/07/a-look-into-nasa-coding-philosophy-kennedy-space-center-programming/
2•kristianp•1h ago•0 comments

Toyota Unintended Acceleration and the Big Bowl of "Spaghetti" Code(2013)

https://www.safetyresearch.net/toyota-unintended-acceleration-and-the-big-bowl-of-spaghetti-code/
3•SoKamil•1h ago•1 comments

The Ilya Sutskever interview – my key takeaways

https://quickchat.ai/post/ilya-sutskever-interview
2•piotrgrudzien•1h ago•0 comments

Show HN: Cdecl-dump - represent C declarations visually

https://github.com/bbu/cdecl-dump
4•bluetomcat•1h ago•0 comments

An Attempt at a Compelling Articulation of Forth's Practical Strengths and Eter

https://im-just-lee.ing/forth-why-cb234c03.txt
2•todsacerdoti•1h ago•0 comments

Algebraic Constraints [pdf]

http://reports-archive.adm.cs.cmu.edu/anon/scan/CMU-CS-83-132.pdf
1•andsoitis•1h ago•0 comments

A Grand Social Media Experiment Begins in Australia

https://www.nytimes.com/2025/12/07/world/asia/australia-social-media-ban-under-16.html
1•apparent•1h ago•0 comments

The era of jobs is ending

https://www.thepavement.xyz/p/the-era-of-jobs-is-ending
4•SturgeonsLaw•1h ago•4 comments

India's request for satellite-aided iPhone location data is a privacy nightmare

https://appleinsider.com/articles/25/12/06/indias-request-for-satellite-aided-iphone-location-dat...
3•walterbell•1h ago•0 comments

Network extensible Window System (1986)

https://en.wikipedia.org/wiki/NeWS
1•hbbio•1h ago•1 comments

Show HN : WealthYogi - Net worth Tracker

https://apps.apple.com/gb/app/wealthyogi-net-worth-tracker/id6753881658
3•aalbatross•1h ago•0 comments

Show HN: LLM-Powered Log Analysis Wrapper (Python)

https://github.com/IncidentAI-Dev/incident-summarizer-wrapper/blob/main/README.md
1•joe117•1h ago•1 comments