frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Anna's Archive perd son domaine en .org mais reste debout

https://korben.info/annas-archive-domaine-org-suspendu.html
1•frunkp•4m ago•0 comments

Show HN: FlowWatch – Decorator-first file watcher for Python workflows

https://github.com/MichielMe/flowwatch
1•michielme•4m ago•0 comments

Anthropic writes Constitution for Claude it thinks will soon be proven misguided

https://www.theregister.com/2026/01/22/anthropic_claude_constitution/
1•beardyw•4m ago•0 comments

The Data Box; Why "Smarter" AI Feels Dumber

https://blog.nimbial.com/pages/the_data_box
1•ajayarama•6m ago•0 comments

Erdős Problem #347 Solved (AI assisted math)

https://www.erdosproblems.com/forum/thread/347
1•tzury•8m ago•0 comments

Designing an Authentication System: A Dialogue in Four Scenes (1997)

https://web.mit.edu/kerberos/www/dialogue.html
1•vismit2000•15m ago•0 comments

Oldest cave painting could rewrite human creativity timeline

https://www.bbc.com/news/articles/czx1pnlzer5o
1•griffzhowl•21m ago•0 comments

Anthropic's new Claude 'constitution': be helpful, and don't destroy humanity

https://www.theverge.com/ai-artificial-intelligence/865185/anthropic-claude-constitution-soul-doc
1•xparadigm•22m ago•0 comments

Starlink in Iran: How the regime jams the service and what helps against it

https://www.heise.de/en/background/Starlink-in-Iran-How-the-regime-jams-the-service-and-what-help...
2•DeathArrow•30m ago•0 comments

Semantica: Open-source semantic layers, knowledge graphs, and GraphRAG

https://github.com/Hawksight-AI/semantica
2•kaifahmad1•34m ago•1 comments

New Security Vulnerability Database Launches in the EU

https://www.forbes.com/sites/kateoflahertyuk/2026/01/20/new-security-vulnerability-database-launc...
2•cedricbonhomme•36m ago•1 comments

Why Greenland Looks (It's Not) [video]

https://www.youtube.com/watch?v=tK7yTJ8Mk7A
1•handfuloflight•40m ago•0 comments

Graph of All Human Languages

https://dr.eamer.dev/datavis/poems/language/network.html
3•samwho•41m ago•0 comments

Mixing incentives and penalties found key to cutting carbon emissions long term

https://phys.org/news/2025-12-incentives-penalties-key-carbon-emissions.html
1•PaulHoule•41m ago•0 comments

With this tool, you can enjoy NAS functionality even without a NAS

https://quicksend.chat/
1•foodhome•43m ago•0 comments

The Tighter Weave: On Editing and Not Editing

https://hedgehogreview.com/issues/place-and-revolution/articles/the-tighter-weave
1•samclemens•44m ago•0 comments

OpenSkills – Stop bloating your LLM context with unused agent instructions

1•twwch•44m ago•0 comments

Rare Data Hunters [video]

https://www.youtube.com/watch?v=IU4ByUbDKNc
1•DiscourseFan•47m ago•0 comments

Video for ROS2

https://github.com/stryngs/rosVid
1•stryngs42•50m ago•1 comments

We are updating Dokploy's Open Source license

https://dokploy.com/blog/we-are-updating-dokploys-open-source-license
1•raybb•59m ago•1 comments

Show HN: Scribefully is a portfolio/HN-style community for academics & pros

https://scribefully.com/
1•hoag•1h ago•0 comments

CAP theorem: Why Pick Two Misses the Point

https://www.blog.ahmazin.dev/p/cap-theorem-explained
1•artmonk•1h ago•0 comments

US science after a year of Trump

https://www.nature.com/immersive/d41586-026-00088-9/index.html
6•newman314•1h ago•0 comments

Blue4est Paper – BPA-Free Thermal Print Camera Compendium

https://thermalprintcameras.wordpress.com/blue4est-paper/
1•walterbell•1h ago•0 comments

Ask HN: Why does Google Maps still use mercator projection?

2•hbarka•1h ago•1 comments

Show HN: Aident, agentic automations as plain-English playbooks

https://aident.ai/
4•ljhskyso7•1h ago•0 comments

Why AGI Would Shape Humanity in the Shadows the Revelation Trap

1•unspokenlayer•1h ago•0 comments

Governance in the Age of AI, Nuclear Threats, and Geopolitical Brinkmanship [video]

https://www.youtube.com/watch?v=XACETcmQAeM
1•measurablefunc•1h ago•0 comments

Ask HN: Is there any good open source model with reliable agentic capabilities?

1•baalimago•1h ago•0 comments

Show HN: MCP server for searching and retrieving 200k icons

https://github.com/better-auth/better-icons
2•bekacru•1h ago•0 comments