frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Unsloth: GLM-4.7-Flash

https://unsloth.ai/docs/models/glm-4.7-flash
1•tosh•2m ago•0 comments

IP Addresses Through 2025

https://www.potaroo.net/ispcol/2026-01/addr2025.html
2•petercooper•3m ago•0 comments

You Only Have 47 Seconds

https://thinkingrock.substack.com/p/you-only-have-47-seconds
1•7777777phil•3m ago•0 comments

What is product development in 2026?

https://cory.news/posts/2026-01-20-development/
1•callumprentice•3m ago•0 comments

Telegram counts EU users under the Digital Services Act

https://www.ftm.eu/articles/telegram-plays-down-user-figures-to-avoid-stricter-eu-rules-documents...
1•evevandermeer•4m ago•0 comments

Show HN: BlueMouse – AI Code Generator with 17-Layer Validation

https://github.com/peijun1700/bluemouse
1•bluemouse_ai•4m ago•0 comments

Why Linear Chat is failing us

https://obliqueangles.substack.com/p/why-linear-chat-is-failing-us
1•TomBers•6m ago•0 comments

Sutskever 30 Essential Papers:Complete NumPy Implementations with Visualizations

https://github.com/pageman/sutskever-30-implementations
2•pajop•6m ago•0 comments

Show HN: Havyl OS – Decide what to do based on your energy

https://havyl.com
2•limionaider•6m ago•1 comments

Polar weather on Jupiter and Saturn hints at the planets' interior details

https://news.mit.edu/2026/polar-weather-jupiter-saturn-hints-planets-interior-details-0119
1•el_duderino•9m ago•0 comments

Show HN: Unfault – A CLI and LSP for code orientation

https://unfault.dev
1•sylvain-h•10m ago•1 comments

How to Build the Life You Want: 3 Takeaways

https://www.mindbodydad.com/mind/build-the-life-you-want
1•Olshansky•10m ago•0 comments

Apple vs. the AI Hype Cycle

https://ericlamb.substack.com/p/apple-vs-the-ai-hype-cycle
1•ericlamb89•13m ago•0 comments

Amazon Ion

https://amazon-ion.github.io/ion-docs/
2•tosh•14m ago•0 comments

You shouldn't trust data collected on MTurk

https://osf.io/preprints/psyarxiv/zs6pk_v1
1•speckx•15m ago•0 comments

Banana Pro – Nano Banana Pro 4K AI Image Generator

https://www.banana-pro.com
1•amierhan•16m ago•0 comments

Show HN: I created Wiz, personal AI agent with Claude Code

https://thoughts.jock.pl/p/wiz-personal-ai-agent-claude-code-2026
1•joozio•17m ago•0 comments

The Zen of Reticulum

https://github.com/markqvist/Reticulum/blob/master/Zen%20of%20Reticulum.md
4•mikece•20m ago•1 comments

Trump Shares Map of US Including Greenland, Canada, Venezuela

https://www.newsweek.com/trump-shares-map-of-us-including-greenland-canada-venezuela-11384438
6•djkivi•20m ago•2 comments

Huge amounts of extra land needed for RFK Jr's meat-heavy diet guidelines

https://www.theguardian.com/environment/2026/jan/20/rfk-jr-trump-meat-diet-guidelines-land
3•ndsipa_pomu•24m ago•1 comments

Show HN: Tycostream – turn Materialize views into real-time GraphQL APIs

https://github.com/tycoworks/tycostream
2•chrisanderson85•24m ago•0 comments

Going to write 1.000.000 lines of code for community projects

https://onemillionlines.com/
2•websku•24m ago•1 comments

Why Your European Business Is Probably Breaking GDPR Law

https://blog.please-open.it/posts/cloud-act-gdpr/
3•mathieupassenau•24m ago•1 comments

How Greenland keeps its eye on independence [pdf]

https://isonomiaquarterly.com/wp-content/uploads/2025/11/iq-3.4-zellen-greenland.pdf
1•brandonlc•24m ago•0 comments

Special Address by President von Der Leyen at the World Economic Forum

https://ec.europa.eu/commission/presscorner/detail/en/speech_26_150
2•armcat•30m ago•0 comments

Concurrent Validity of 16 Commercial Photoplethysmographic Heart Rate Monitors

https://www.mdpi.com/2076-3417/16/1/126
2•PaulHoule•32m ago•0 comments

Creatures in Higher Dimensions [video]

https://www.youtube.com/watch?v=349r0xJFGNw
1•surprisetalk•32m ago•0 comments

Snow Simulation Toy

https://potch.me/2026/snow-simulation-toy.html
2•surprisetalk•32m ago•0 comments

Show HN: Coni – Trust-first Claude Cowork-style agent with permission prompts

https://github.com/coni-ai/coni
1•lime66•33m ago•2 comments

Uca High School Nationals

https://x.com/ucanhscc
1•notgoodme•33m ago•0 comments