frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Don't Lie

https://notes.philippdubach.com/0007
2•7777777phil•1m ago•0 comments

Police Invested Millions in Shadowy Phone-Tracking Software Won't Say How Used

https://www.texasobserver.org/texas-police-invest-tangles-sheriff-surveillance/
3•nobody9999•4m ago•0 comments

Agents Should Write Code, Not JSON

https://blog.sshh.io/p/building-multi-agent-systems-part-c0c
1•sshh12•4m ago•0 comments

The way I run standup meetings (2024)

https://marcgg.com/blog/2024/11/20/standup/
1•BerislavLopac•5m ago•0 comments

I vibecoded my way into the #1 position on the Highload.fun leaderboard

https://josusanmartin.com/blog/2026/01/18/the-game-has-changed-vibecoded-highload.html
1•josu•5m ago•0 comments

Simulating the Ladybug Clock Puzzle

https://austinhenley.com/blog/ladybugclock.html
1•ibobev•6m ago•0 comments

Writing an LLM from scratch, part 31 – the models are now on Hugging Face

https://www.gilesthomas.com/2026/01/llm-from-scratch-31-models-on-hugging-face
1•ibobev•7m ago•0 comments

I created a game engine for Django?

https://en.andros.dev/blog/6e9e4485/i-created-a-game-engine-for-django/
1•ibobev•7m ago•0 comments

ISS SOS: The plan to leave a doomed space station – quickly

https://www.bbc.com/future/article/20260115-how-do-you-evacuate-a-space-station
1•breve•11m ago•0 comments

Switch Join: PostgreSQL that adapts on the fly

https://alenarybakina.substack.com/p/switch-join-postgresql-that-adapts
1•tanelpoder•12m ago•0 comments

The Advanced Matrix Factorization Jungle

https://igorcarron.github.io/welcome-to-the-matrix-factorization-jungle/
1•jjgreen•13m ago•0 comments

Apple parental controls have more holes than Swiss cheese

https://twitter.com/MichaelErmer_/status/2012515535326527740
4•michaelermer•14m ago•0 comments

Thoughts and Observations Regarding Apple Creator Studio

https://daringfireball.net/2026/01/thoughts_and_observations_regarding_apple_creator_studio
3•k2enemy•15m ago•0 comments

Stirling Cycle Machine Analysis

https://ohioopen.library.ohio.edu/opentextbooks/9/
4•akshatjiwan•24m ago•0 comments

The AI Trap That Is Quietly Wiping Out Investors

https://substack.com/inbox/post/184956334
3•rafaepta•26m ago•1 comments

Show HN: Dock – Slack minus the bloat, tax, and 90-day memory loss

https://getdock.io/
2•yadavrh•27m ago•1 comments

The Future of AI Development Isn't a New IDE

https://docs.overcut.ai/blog/the-real-future-of-ai-development
1•yuvalhazaz•30m ago•0 comments

Show HN: HakHok

https://hakhok.replit.app/
1•sharjeel•31m ago•0 comments

The Agentic Software Development Lifecycle

https://docs.overcut.ai/blog/the-agentic-software-development-lifecycle
1•yuvalhazaz•31m ago•0 comments

Show HN: Stop manually syncing rules between Claude, Cursor, and Codex

https://github.com/nanxiaobei/ai-global
1•nanxiaobei•31m ago•0 comments

Which countries are adopting AI the fastest?

https://www.economist.com/graphic-detail/2026/01/12/which-countries-are-adopting-ai-the-fastest
2•gmays•33m ago•1 comments

Boston, NYC, Washington DC Guide

https://lopespm.com/notes/2026/01/18/nyc_boston_dc.html
1•lopespm•34m ago•0 comments

Show HN: VPNBypass – macOS menu bar app to route domains around your VPN

1•geiser•36m ago•0 comments

Bridging Bitchat and MeshCore

https://juraj.bednar.io/en/blog-en/2026/01/18/bridging-bitchat-and-meshcore-resilient-communicati...
1•hyzyla•36m ago•0 comments

psmux: Terminal multiplexer for Windows – tmux alternative

https://github.com/marlocarlo/psmux
1•curioussquirrel•37m ago•0 comments

Childhood Neighbors Influence Occupation Choice [pdf]

https://www.econ.queensu.ca/sites/econ.queensu.ca/files/neighbors_occupations_AHPW_aug1_2025.pdf
4•7777777phil•40m ago•3 comments

Ask HN: Predictions for New GTLDs in 2026?

1•cyode•41m ago•0 comments

An Introduction to Orthic

https://mutsumino.neocities.org/scripts/orthic
1•helterskelter•43m ago•0 comments

Train Journey across the USA [video]

https://www.youtube.com/watch?v=BbGljB4ikTs
1•notmysql_•45m ago•0 comments

Show HN: Map of illegal dumping reports in Oakland

https://illegal-dumping-map.vercel.app/oakland
3•arkits•45m ago•0 comments