frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Trump Responds to Anthropic

https://twitter.com/PeteHegseth/status/2027487514395832410
1•Finbarr•30s ago•0 comments

LLM-Based Evolution as a Universal Optimizer

https://imbue.com/research/2026-02-27-darwinian-evolver/
1•miohtama•3m ago•0 comments

Trump Orders US Agencies to Drop Anthropic After Pentagon Feud

https://www.bloomberg.com/news/articles/2026-02-27/trump-orders-us-government-to-drop-anthropic-a...
8•ZeroCool2u•4m ago•1 comments

Netflix Declines to Raise Offer for Warner Bros

https://ir.netflix.net/investor-news-and-events/financial-releases/press-release-details/2026/Net...
1•7777777phil•9m ago•0 comments

Show HN: I Built a $1 Escalating Internet Billboard – Called Space

https://www.spacefilled.com/
1•clarkage•10m ago•0 comments

Show HN: I vibe coded a DAW for the terminal. how'd I do?

https://github.com/mohsenil85/imbolc
2•lmohseni•11m ago•0 comments

How to Run a One Trillion-Parameter LLM Locally: AMD Ryzen AI Max+ Cluster Guide

https://www.amd.com/en/developer/resources/technical-articles/2026/how-to-run-a-one-trillion-para...
1•guerby•11m ago•0 comments

It's Time for LLM Connection Strings

https://danlevy.net/llm-connection-strings/
1•iamwil•11m ago•0 comments

A War Foretold

https://www.theguardian.com/world/ng-interactive/2026/feb/20/a-war-foretold-cia-mi6-putin-ukraine...
2•fabatka•14m ago•0 comments

Recontextualizing Famous Quotes for Brand Slogan Generation

https://arxiv.org/abs/2602.06049
1•PaulHoule•15m ago•0 comments

Poland Plans Social Media Ban for Kids in Challenge to US Tech

https://www.bloomberg.com/news/articles/2026-02-27/poland-plans-social-media-ban-for-kids-in-chal...
1•1vuio0pswjnm7•15m ago•0 comments

Show HN: A pure Python HTTP Library built on free-threaded Python

https://github.com/grandimam/barq
1•grandimam•16m ago•0 comments

I Was Tired of Juggling My Agents, So I Hired a Middle Manager

https://www.sawyerhood.com/blog/hired-a-middle-manager
1•sawyerjhood•16m ago•0 comments

The Problem with P(doom)

https://blog.cosmos-institute.org/p/not-even-wrong
1•alexicon_•16m ago•0 comments

Commit on Firefox repo: When an agent commits, don't add itself as author

https://github.com/mozilla-firefox/firefox/commit/71cc24b6a400dbd434e4df37087960d94b764791
1•thesdev•16m ago•0 comments

Malicious NPM Packages Use Pastebin Steganography to Deploy Credential Stealer

https://socket.dev/blog/stegabin-26-malicious-npm-packages-use-pastebin-steganography
1•feross•17m ago•0 comments

Trump orders federal agencies to stop using Anthropic AI tech 'immediately'

https://www.cnbc.com/2026/02/27/trump-anthropic-ai-pentagon.html
16•johnbarron•17m ago•4 comments

Show HN: Dynamic SVG Cards for Credly Badges in GitHub READMEs

https://github.com/ebenezer-isaac/credly-readme-stats
1•ebenezer-isaac•18m ago•0 comments

Innovation Could Make the Perfect Silicon Chip–and End Moore's Law

https://www.wsj.com/tech/silicon-chips-moores-law-photolithography-91b9ac4f
1•marc__1•19m ago•1 comments

Trump moves to blacklist Anthropic over AI fight with Pentagon

https://www.axios.com/2026/02/27/anthropic-pentagon-supply-chain-risk-claude
15•jaz•20m ago•1 comments

BYOC, the Hard Parts

https://twitter.com/sharkymark123/status/2027487122362442146
1•realsharkymark•21m ago•0 comments

When to Use DNS Load Balancing (and When Not To)

https://singh-sanjay.com/2026/02/24/when-dns-load-balancing-is-not-enough.html
1•singhsanjay12•22m ago•1 comments

Global Intelligence Crisis

https://www.citadelsecurities.com/news-and-insights/2026-global-intelligence-crisis/
1•rafaelc•22m ago•0 comments

Could a vaccine prevent dementia? Shingles shot data only getting stronger

https://arstechnica.com/health/2026/02/could-a-vaccine-prevent-dementia-shingles-shot-data-only-g...
5•rafaelc•24m ago•0 comments

I built a calorie tracker that replaces food databases with plain text input

https://apps.apple.com/us/app/fud-ai-calorie-tracker/id6758935726
1•apoorvdarshan•24m ago•1 comments

Ask HN: How does training an AI on another AI actually work?

2•timonpimba•26m ago•0 comments

I Built an Open-Source AI Agent That Builds Its Own Tools

https://github.com/elophanto/EloPhantoShowHN:I%27mEloPhanto,anopen-sourceAIagentthatrunslocallyon...
1•elophanto_agent•30m ago•0 comments

Show HN: Swarmit – Long-term planning for AI agents

https://github.com/zeapo/swarmit
1•zeapo•32m ago•1 comments

Edward L. Deci, 83, Dies; Founder of Self-Determination..

https://www.nytimes.com/2026/02/26/science/edward-l-deci-dead.html
1•paulpauper•33m ago•1 comments

Show HN: Mac hardware toys – pipe your accelerometer into your keyboard lights

https://github.com/pirate/mac-hardware-toys
2•nikisweeting•33m ago•1 comments