frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Volvo invented the seat belt 67 years ago; now it has improved it

https://arstechnica.com/cars/2026/01/how-volvos-new-adaptive-seat-belts-will-reduce-injuries-duri...
1•PaulHoule•1m ago•0 comments

Show HN: BioTradingArena – Benchmark for LLMs to predict biotech stock movements

https://www.biotradingarena.com/hn
1•dchu17•2m ago•0 comments

LlamaLib: A cross-platform C++/C# library for local LLMs based on llama.cpp

https://github.com/undreamai/LlamaLib
2•benuix•2m ago•0 comments

How to Become a Tree

https://aeon.co/essays/dying-to-be-green-are-new-eco-funerals-a-false-promise
1•onychomys•3m ago•0 comments

Training a Small Language Model

https://elijahpotter.dev/articles/training-a-small-language-model
1•chilipepperhott•3m ago•0 comments

Draft on Chat Control: Mass Surveillance to Continue, Sparking Renewed Protests

https://www.patrick-breyer.de/en/sippel-draft-on-chat-control-mass-surveillance-set-to-continue-s...
2•latexr•4m ago•0 comments

Duck Intelligence

https://theturingmachine.net/duck-intelligence
1•andychiare•5m ago•0 comments

Vertex's CRISPR treatment for sickle cell disease hits unexpected roadblock

https://www.statnews.com/2026/02/05/vertex-crispr-sickle-cell-treatment-casgevy-faces-rollout-bot...
2•randycupertino•5m ago•0 comments

Portable 3D Printer

https://ifdesign.com/en/winner-ranking/project/foldable-portable-3d-printer/701773
1•E-Reverance•5m ago•1 comments

My Fake AI Problem

https://mearsheimer.substack.com/p/my-fake-ai-problem
1•hackandthink•10m ago•0 comments

How Boredom, Not Fatigue, Ruins Most Workouts

https://www.vo2maxpro.com/blog/boredom-not-fatigue-ruins-workouts
1•GoodluckH•11m ago•1 comments

Trump shares video with racist clip depicting Obamas as apes

https://www.bbc.co.uk/news/articles/ce8r8y78g10o
5•only_in_america•12m ago•0 comments

Shannon – Autonomous AI Hacker

https://github.com/KeygraphHQ/shannon
1•charlieirish•12m ago•0 comments

Show HN: An external governance system for large AI-generated codebases coherent

https://github.com/altheahfy/AI_Controller
1•altheahfy•14m ago•1 comments

I just indexed agent skills so AI agents can discover them autonomously

https://www.skyll.app/
3•assafe•14m ago•3 comments

The Software Rout Is Spreading Pain to the Debt Markets

https://www.wsj.com/finance/investing/the-software-rout-is-spreading-pain-to-the-debt-markets-d6d...
2•JumpCrisscross•15m ago•0 comments

The unique characteristics of extraversion: A systematic review (2025)

https://www.sciencedirect.com/science/article/pii/S0361923025002667
1•wslh•16m ago•0 comments

Budget-Aware Agent Orchestration: Applying RCPSP to Agentic Workflows

https://ncrmro.com/posts/budget-aware-agent-orchestration
1•ncrmro•16m ago•1 comments

Show HN: MoltVote – AI agents vote on polls, as themselves or as their humans

https://moltvote.ai
1•Xpolls•17m ago•1 comments

Show HN: ChunkHound local first codebase intelligence via MCP

https://chunkhound.github.io/
1•ofriw•17m ago•0 comments

Microsoft and Software Survival

https://stratechery.com/2026/microsoft-and-software-survival/
1•tosh•18m ago•0 comments

RISC-V Vector Primer

https://github.com/simplex-micro/riscv-vector-primer
1•mshockwave•19m ago•0 comments

Skidetica transforms emotion into a probability distribution, no user data

https://www.skidetica.com/manifesto
1•tracyrage•23m ago•0 comments

SMLL: Using 200MB of Neural Network to Save 400 Bytes

https://www.frankchiarulli.com/blog/smll/
2•fcjr•23m ago•0 comments

Meta-analysis claims statins are safer than previously thought

https://www.thelancet.com/journals/lancet/article/PIIS0140-6736(25)01578-8/fulltext
1•brandonb•24m ago•0 comments

The Globalization of Canadian Rage

https://www.nytimes.com/2026/02/06/opinion/canada-america-anger-carney.html
1•Teever•24m ago•0 comments

WhatsApp Encryption, a Lawsuit, and a Lot of Noise

https://blog.cryptographyengineering.com/2026/02/02/whatsapp-encryption-a-lawsuit-and-a-lot-of-no...
1•lr0•27m ago•0 comments

I Spent 5 Years in DevOps. Solutions Engineering Gave Me What I Was Missing

https://infisical.com/blog/devops-to-solutions-engineering
2•vmatsiiako•28m ago•0 comments

Google has every advantage in AI. So why doesn't it lead?

1•HardCodedBias•29m ago•3 comments

Heroku is transitioning

https://twitter.com/heroku/status/2019788655095853479
6•tosh•29m ago•4 comments