frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

MCP to Check LLM Prices Right from Claude Code and Cursor

https://pricepertoken.com/mcp
1•alexellman•1m ago•1 comments

Tesla built largest US lithium refinery in just 2 years and it's now operational [video]

https://www.youtube.com/watch?v=rxYTx6aj96k
1•7777777phil•1m ago•0 comments

Fine, I'll Do It Myself

https://dinosaurseateverybody.com/blog/fine-ill-do-it-myself
1•dorkrawk•2m ago•1 comments

Web Almanac 2025

https://almanac.httparchive.org/en/2025/
1•tricinel•2m ago•0 comments

The UK is losing the industry that makes everything [video]

https://www.youtube.com/watch?v=PQ3hT8tqZgo
2•skeltoac•3m ago•0 comments

Experimental High Dynamic Range Video Playback on Windows in Firefox Nightly 148

https://mozillagfx.wordpress.com/2026/01/16/experimental-high-dynamic-range-video-playback-on-win...
1•todsacerdoti•3m ago•0 comments

TranslateGemma: A new suite of open translation models

https://blog.google/innovation-and-ai/technology/developers-tools/translategemma/
1•taubek•4m ago•0 comments

LLM Authorization

https://docs.permify.co/use-cases/llm-authorization
1•mooreds•5m ago•0 comments

Mother of Elon Musk's child sues xAI over Grok deepfakes

https://www.bbc.com/news/articles/cp37erw0zwwo
1•vinni2•5m ago•0 comments

Show HN: I Claude coded a small open-source jj VSCode extension

https://marketplace.visualstudio.com/items?itemName=olup.open-jj
1•olup•8m ago•0 comments

I made a playable pitch deck

https://deck.blogcore.app
1•ash_rahman•9m ago•1 comments

Show HN: ShipMate- Find the 3 tech-debt fixes that will unblock feature

1•Saurabh_Kumar_•9m ago•0 comments

Rassumfrassum an Emacs LSP Multiplexer

https://github.com/joaotavora/rassumfrassum
1•vordoo•9m ago•1 comments

Astro is joining Cloudflare

https://blog.cloudflare.com/astro-joins-cloudflare/
4•dbelson•10m ago•0 comments

Alpine.js: Your new, lightweight, JavaScript framework

https://alpinejs.dev/
2•thunderbong•11m ago•0 comments

Scott Alexander: The Dilbert Afterlife

https://www.astralcodexten.com/p/the-dilbert-afterlife
1•rendall•12m ago•0 comments

Data Holds the Key in Slowing Age-Related Illnesses

https://www.wired.com/story/data-holds-the-key-in-slowing-age-related-illnesses/
1•rbanffy•15m ago•0 comments

ComcastifyJS

https://github.com/theonion/comcastifyjs
1•carlos-menezes•16m ago•0 comments

The Antarctic Snow Cruiser

https://www.amusingplanet.com/2026/01/the-antarctic-snow-cruiser.html
2•terryf•20m ago•0 comments

BigQuery: Partition and cluster your data for optimal performance (2020)

https://cloud.google.com/blog/topics/developers-practitioners/bigquery-explained-storage-overview
1•tosh•21m ago•0 comments

NestJS Best Practices (Yet another Claude skill)

https://github.com/Kadajett/agent-nestjs-skills
2•kadajett•23m ago•1 comments

Speculation of Donut Lab Solid State Battery [video]

https://www.youtube.com/watch?v=RbGxbII44eE
2•josalhor•24m ago•1 comments

AI Security Isn't Bullshit. But We're Securing the Wrong Thing

https://hackthemodel.com/ai-security-isnt-bullshit-but-we-re-securing-the-wrong-thing-b925d04b517a
1•mooreds•24m ago•0 comments

White-Collar Workers Shouldn't Dismiss a Blue-Collar Career Change

https://www.wsj.com/lifestyle/careers/white-collar-workers-shouldnt-dismiss-a-blue-collar-career-...
2•smurda•24m ago•1 comments

Hard drive prices have surged by an average of 46% since September

https://www.tomshardware.com/pc-components/hdds/hard-drive-prices-have-surged-by-an-average-of-46...
3•speckx•25m ago•0 comments

Building Docfind: Fast Client-Side Search with Rust and WebAssembly

https://code.visualstudio.com/blogs/2026/01/15/docfind
1•petercooper•26m ago•0 comments

Show HN: Brodocs deep onprem knowledge harvester

1•BroTechLead•28m ago•0 comments

LocationMind XPop

https://locationmind.com/products/xpop-e/
1•mooreds•28m ago•0 comments

Quantum Structured Light Could Transform Secure Communication and Computing

https://www.sciencedaily.com/releases/2026/01/260106001911.htm
1•backpackerBMW•29m ago•0 comments

Risks and rewards: a case for worker coops in tech

https://antropia.studio/blog/on-risks-and-rewards/
1•serchinastico•29m ago•0 comments