frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•7mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Temperature Effects in Watches

https://www.vintagewatchstraps.com/temperatureeffects.php
1•pillars•29s ago•0 comments

VLLM Large Scale Serving: DeepSeek 2.2k Tok/S/H200 with Wide-EP

https://blog.vllm.ai/2025/12/17/large-scale-serving.html
1•robertnishihara•36s ago•0 comments

Show HN: I built Gridfy – live website widgets from Airtable, Notion and Sheets

https://gridfy.io
1•jumagrande•1m ago•0 comments

Show HN: I Will Do Whatever to Get Primeagen to My Hackathon Stream

https://vibe.devpost.com/
1•abdibrokhim•2m ago•0 comments

Show HN: Term.stream – Stream your terminal to any device via URL

https://term.stream
1•zero_dev•4m ago•0 comments

Show HN: RSS Reader using the browser's local storage

https://github.com/travisred/rss-local-storage
1•travisr•5m ago•0 comments

Show HN: WordsUnite – Synchronized Crowd Chants at Scale

https://wordsunite.us/
1•wordsunite•5m ago•0 comments

Tools for AI Collaboration Are a Different Design Problem

https://michaelhegner.com/blog/tools-for-ai-collaboration-are-a-different-design-problem
1•shellDev•5m ago•0 comments

Apple Creator Studio

https://www.apple.com/apple-creator-studio/
1•davidbarker•7m ago•0 comments

Every GitHub Object Has Two IDs

https://www.greptile.com/blog/github-ids
1•dakshgupta•8m ago•0 comments

Show HN: 26x speedup on BitNet sparse ops with AVX-512 and 2-bit encoding

https://github.com/microsoft/BitNet/pull/365
1•HyperFoldUK•8m ago•0 comments

Nuclear startups are back in vogue with small reactors, and big challenges

https://techcrunch.com/2026/01/11/nuclear-startups-are-back-in-vogue-with-small-reactors-and-big-...
1•rbanffy•9m ago•0 comments

Virtual fireside chat with OllyGarden co-founder and CEO Juraci PaixãO Kröhling

https://chinstrap.community/fireside-chats/juraci-krohling/
1•reedciccio•10m ago•0 comments

Stack Overflow's AI Assist Powered by OpenAI

https://stackoverflow.com/ai-assist
2•Abimelex•10m ago•0 comments

Dilbert Principle

https://en.wikipedia.org/wiki/Dilbert_principle
1•tosh•10m ago•0 comments

What a year of solar and batteries saved us in 2025

https://scotthelme.co.uk/what-a-year-of-solar-and-batteries-really-saved-us-in-2025/
6•MattSayar•11m ago•0 comments

Target's Internal GitHub Repositories Exposed

https://www.bleepingcomputer.com/news/security/targets-dev-server-offline-after-hackers-claim-to-...
1•andiareso•11m ago•0 comments

Show HN: LintPage – Catches SEO issues on staging sites before you deploy

1•orzmar•12m ago•0 comments

Redesign Our Site Identity

https://www.ruby-lang.org/en/news/2025/12/22/redesign-site-identity/
1•amalinovic•12m ago•0 comments

What If Your AI Never Forgot? The Claude 4 Memory Experiment

https://www.gptfrontier.com/what-if-your-ai-never-forgot-the-claude-4-memory-experiment/
1•ssengupta3•12m ago•0 comments

Google Chrome Will Drop macOS Monterey Support with Version 150

https://www.macobserver.com/news/google-chrome-will-drop-macos-monterey-support-with-version-150/
1•bookofjoe•13m ago•0 comments

Stop Gatekeeping Referrals

https://af-dev.com/blog/stop-gatekeeping-referrals
1•_adev•13m ago•0 comments

Best Practices for Coding with Agents

https://cursor.com/blog/agent-best-practices
1•gmays•14m ago•0 comments

Jed Baker's podcast media co acquired by SuperAwesome

https://deadline.com/2026/01/jed-baker-podcast-starglow-media-acquired-superawesome-1236677686/
1•dylancollins•16m ago•0 comments

Southern states hate Leo: 2024 naming trends by state, region, and politics

https://three-things.medium.com/three-southern-states-really-hate-the-name-leo-9c97a093022a
1•murph314•16m ago•0 comments

Worktrunk, autoclaude and AskUserQuestion – Claude Code workflow

https://henryaj.substack.com/p/my-claude-code-workflow
1•henryaj•17m ago•0 comments

Show HN: Meter – web scraper that syncs changes & bypasses antibot

https://www.meter.sh/
2•hankwilliamsjr•18m ago•0 comments

Elsevier threatens others for linking to Sci-Hub but does so itself

https://eve.gd/2019/08/03/elsevier-threatens-others-for-linking-to-sci-hub-but-does-it-itself/
2•fanf2•18m ago•0 comments

I'm Betting That OpenAI Will Go Broke

https://www.nytimes.com/2026/01/13/opinion/openai-ai-bubble-financing.html
3•oppodeldoc•18m ago•0 comments

You Need a Kitchen Slide Rule

https://entropicthoughts.com/kitchen-slide-rule
2•aebtebeten•19m ago•0 comments