frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Bypassing VSCode Copilot's Premium Requests

https://dganev.com/posts/2026-01-24-bypassing-copilot-requests/
1•syl5x•40s ago•1 comments

The Guardian view on Europe's payments problem: sovereignty starts at the till

https://www.theguardian.com/commentisfree/2026/jan/25/the-guardian-view-on-europes-payments-probl...
1•mcc1ane•1m ago•0 comments

Show HN: I used my book generator to generate a catalog of books it can generate

https://www.ebook-forge.com/Omni
1•lywald•1m ago•1 comments

Show HN: Forward My Inbox – IMAP‑to‑Gmail Forwarding After Gmail Kills POP3

https://forwardmyinbox.com/
1•moshetanzer•4m ago•0 comments

Hypergrowth Isn't Always Easy

https://tailscale.com/blog/hypergrowth-isnt-always-easy
1•tosh•6m ago•0 comments

New study disrupts the narrative that ChatGPT's launch triggered a job decline

https://the-decoder.com/new-study-disrupts-the-narrative-that-chatgpts-launch-triggered-a-job-dec...
1•Vaslo•6m ago•0 comments

Life Map

https://lifemap.mattrighetti.com/
1•akktor•7m ago•0 comments

Agent Index: Building a "Tiobe Index" for AI Coding Agents (January Survey)

https://agentic-coding-survey.pages.dev/
1•7777777phil•7m ago•1 comments

The behavioral cost of personalized pricing

https://digitalseams.com/blog/the-behavioral-cost-of-personalized-pricing
3•bobbiechen•9m ago•0 comments

Steinway Spirio: The most famous concert pianos got a major tech upgrade (2024)

https://www.technologyreview.com/2024/02/28/1088268/steinway-spirio-concert-pianos-performance-up...
1•xeonmc•10m ago•0 comments

The Most Extreme CSS Reset Ever Created: 10k Lines of Failure

https://dustin.boston/css-reset/
2•clairvoyant_cod•11m ago•0 comments

Incidental Complexity

https://blog.kasperhermansen.com/posts/incidental-complexity/
1•kjuulh•14m ago•0 comments

Spanish track was fractured before high-speed train disaster, report finds

https://www.bbc.com/news/articles/c1m77dmxlvlo
2•Rygian•15m ago•0 comments

The AI Revolution in Coding: Why I'm Ignoring the Prophets of Doom

https://codingismycraft.blog/index.php/2026/01/23/the-ai-revolution-in-coding-why-im-ignoring-the...
11•mmphosis•18m ago•1 comments

A Metabolic Workspace

https://www.joanwestenberg.com/a-metabolic-workspace/
1•andsoitis•23m ago•0 comments

First, Make Me Care

https://gwern.net/blog/2026/make-me-care
2•andsoitis•24m ago•0 comments

National poll: Less than half of parents say swearing is never OK for kids

https://www.michiganmedicine.org/health-lab/less-half-parents-say-swearing-never-ok-kids
3•PaulHoule•27m ago•0 comments

Frozen Insight in a Moving World

https://jdu.github.io/2026-01-25-frozen-insights-in-a-moving-world.html
1•todsacerdoti•27m ago•0 comments

Strategies and lessons from partitioning a 17TB table in PostgreSQL

https://www.tines.com/blog/futureproofing-tines-partitioning-a-17tb-table-in-postgresql/
1•shayonj•28m ago•0 comments

List of Engineering Blunders

https://en.wikipedia.org/wiki/List_of_engineering_blunders
4•erhuve•28m ago•1 comments

The API Authorization Hierarchy of Needs: Why You Aren't Ready for AI Agents

https://auth0.com/blog/api-authorization-hierarchy-needs/
1•aaguiarz•30m ago•0 comments

Show HN: HyprKCS – A fast, native GTK4/Adwaita keybind manager for Hyprland

https://github.com/kosa12/hyprKCS
1•kosa12•30m ago•0 comments

Show HN: Decompile and deminify Bun using an LLM

https://www.npmjs.com/package/@shepherdjerred/bun-decompile
1•shepherdjerred•31m ago•0 comments

Show HN: Fdir – find and organize anything on your system

https://github.com/VG-dev1/fdir
1•Orbyss_Studio•32m ago•0 comments

Forza's Game Studio Rejects No-AI Clause, French VA Localization Canceled

https://twitter.com/MathieuTouquet/status/2015425148237533311
2•WhereIsTheTruth•32m ago•0 comments

Show HN: Uv-pack – Pack a uv environment for later portable (offline) install

https://github.com/davnn/uv-pack
2•davnn•32m ago•0 comments

Animals Build a Sense of Direction

https://www.quantamagazine.org/how-animals-build-a-sense-of-direction-20260121/
1•tzury•32m ago•0 comments

ACM Conference on Reproducibility and Replicability

https://acmrep.github.io
2•jruohonen•33m ago•1 comments

Generative AI is not trained on "data"

https://deniz.aksimsek.tr/2026/training-data/
1•speckx•33m ago•0 comments

PkgFed: ActivityPub for Package Releases

https://nesbitt.io/2026/01/25/pkgfed-activitypub-for-package-releases.html
2•8organicbits•33m ago•0 comments