frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Human Brain Cells on a Chip Learned to Play Doom in a Week

https://m.slashdot.org/story/452896
1•computersuck•1m ago•0 comments

Show HN: CanaryAI – Claude Code Security Monitoring Tool

https://github.com/jx887/homebrew-canaryai
1•jx887•4m ago•0 comments

Pure Rust, zero dependencies AI models, runs locally, free forever

https://huggingface.co/qoranet
1•blockmandev•6m ago•1 comments

Show HN: Circuitchat, a Tor-first encrypted messaging program using Noise

https://github.com/uncognic/circuitchat
1•uncognic•7m ago•0 comments

Samsung Galaxy update removes Android recovery menu tools, including sideloading

https://9to5google.com/2026/02/27/samsung-galaxy-update-android-recovery-menu-removed/
1•josephcsible•15m ago•0 comments

OpenAI Reaches A.I. Agreement With Defense Dept. After Anthropic Clash

https://www.nytimes.com/2026/02/27/technology/openai-reaches-ai-agreement-with-defense-dept-after...
3•jbegley•19m ago•0 comments

Bird Losses Are Accelerating

https://www.nytimes.com/2026/02/26/climate/bird-declines.html
4•lxm•24m ago•0 comments

India disrupts access to popular developer platform Supabase with blocking order

https://techcrunch.com/2026/02/27/india-disrupts-access-to-popular-developer-platform-supabase-wi...
1•pouwerkerk•25m ago•0 comments

A Day in the Life of an Enshittificator [video]

https://www.youtube.com/watch?v=T4Upf_B9RLQ
1•ianrahman•31m ago•0 comments

We may Soon have City-Spanning 900 MHz Mesh Networks (2021)

https://cheapskatesguide.org/articles/900mhz-mesh.html
1•ColinWright•34m ago•0 comments

System prompt change Claude's reasoning depth – side-by-side comparison tool

https://claude.ai/public/artifacts/eba2a270-dd61-4f0c-a276-34a53e604f13
2•Yuudaiikoma•34m ago•1 comments

Build your own Command Line with ANSI escape codes (2016)

https://www.lihaoyi.com/post/BuildyourownCommandLinewithANSIescapecodes.html
1•vinhnx•34m ago•0 comments

The Enshittificator [video]

https://vimeo.com/1168468796
2•gurjeet•35m ago•0 comments

SUNN O))) HalfLife Fer Mmxxv

https://sunn.southernlord.com/sunn-o-halflife-fer-mmxxv/
2•rglover•35m ago•0 comments

YouTube now determines your watch list [video]

https://www.youtube.com/watch?v=7U_LhzgwJ4U&list=RD7U_LhzgwJ4U
2•bilekas•39m ago•0 comments

Binance's MAGA-Branding Strategy

https://www.thenation.com/article/economy/binance-crypto-trump/
2•petethomas•39m ago•0 comments

As We May Think (1945)

https://en.wikipedia.org/wiki/As_We_May_Think
2•ColinWright•42m ago•0 comments

Estimating π with a Coin

https://arxiv.org/abs/2602.14487
2•vismit2000•43m ago•0 comments

The Mountain Eagle Is Now Online

https://www.mountaineagle.net/articles/display/?entry_short=the-mountain-eagle-is-now-online
2•retrocog•48m ago•1 comments

German Tank Problem

https://en.wikipedia.org/wiki/German_tank_problem
1•ColinWright•50m ago•0 comments

Latency

https://cheat.sh/latency
1•vismit2000•51m ago•0 comments

Show HN: Agents-lint – detect stale paths and context rot in AGENTS.md files

https://github.com/giacomo/agents-lint
1•devGiacomo•51m ago•1 comments

Show HN: Recall – Persistent Memory for Claude Code via MCP Hooks

https://recallmcp.com
1•elfenleid•52m ago•0 comments

The Reason Anthropic Wants Guardrails

https://www.theatlantic.com/ideas/2026/02/anthropic-pentagon-ai/686172/
1•Stratoscope•53m ago•1 comments

Ask HN: How do products get priced after the bubble bursts?

1•AbstractH24•53m ago•1 comments

Joint Statement from OpenAI and Microsoft

https://openai.com/index/continuing-microsoft-partnership/
3•alex_young•58m ago•0 comments

OpenAI reaches deal to deploy AI models on U.S. DoW classified network

https://www.reuters.com/business/openai-reaches-deal-deploy-ai-models-us-department-war-classifie...
25•erhuve•59m ago•9 comments

Six Simple Machines: Lever, Wheel, Pulley, Inclined Plane, Wedge, and Screw

https://en.wikipedia.org/wiki/Simple_machine
1•gurjeet•1h ago•0 comments

Drug-resistant strain of deadly 'ancient fever' spreading to US

https://www.dailymail.co.uk/health/article-15598967/typhoid-fever-surging-drug-resistant-US-UK.html
1•Bender•1h ago•0 comments

OpenAI Onboards Department of War

https://twitter.com/i/status/2027578652477821175
2•dinosor•1h ago•1 comments