frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: A visual sitemap editor that forces you to design structure before UI

2•epic_ai•2m ago•1 comments

Show HN: Memctl v0.1.0 Open source shared persistent memory for AI coding agents

https://memctl.com
1•meszmate•3m ago•0 comments

HeadElf-Mvidia: Executive Intelligence Template

https://github.com/pauljbernard/HeadElf-MVIDIA
2•paulbernard•6m ago•1 comments

Agents are not thinking: Science of agent behavior

https://technoyoda.github.io/agent-science.html
3•chse_cake•10m ago•0 comments

Sam Altman Answers Questions on X.com About Pentagon Deal, Threats to Anthropic

https://news.slashdot.org/story/26/03/01/0233230/sam-altman-answers-questions-on-xcom-about-penta...
1•MilnerRoute•11m ago•0 comments

Church of the SubGenius

https://en.wikipedia.org/wiki/Church_of_the_SubGenius
1•thomassmith65•11m ago•0 comments

Show HN: MCP server that strips injection vectors and cuts token costs by 93%

https://github.com/timstarkk/mcp-safe-fetch
1•timstark•12m ago•0 comments

Shipping Traffic Through Strait of Hormuz Plummets After Attacks on Iran

https://www.nytimes.com/2026/02/28/world/middleeast/strait-of-hormuz-ship-traffic.html
1•ParentiSoundSys•15m ago•0 comments

Show HN: ChoresMates – Splitwise, but for Household Chores

https://apps.apple.com/us/app/choresmates/id6757452488
1•bittujoju•19m ago•0 comments

Iran's Supreme Leader Ali Khamenei Killed

https://www.reuters.com/world/middle-east/irans-supreme-leader-ali-khamenei-killed-senior-israeli...
6•codethief•26m ago•2 comments

Ask HN: What did you find out or explore today?

3•blahaj•34m ago•4 comments

Ercot Max Solar Record 31 GW

https://www.gridstatus.io/records/ercot?record=Maximum%20Solar
1•chris222•35m ago•1 comments

Dating Apps, Data Structures, and Dopamine

https://www.errorcodezero.dev/blog/dating-apps-dsa-and-dopamine/
2•errorcodezero•36m ago•1 comments

The Science of Detecting LLM-Generated Text

https://dl.acm.org/doi/10.1145/3624725
1•vinhnx•42m ago•0 comments

Niche Developer Tooling for WordPress

https://coderjerk.com/blog/icenberg
1•ddevine•43m ago•0 comments

In puzzling outbreak, officials look to cold beer, gross ice, and ChatGPT

https://arstechnica.com/health/2026/02/did-chatgpt-help-health-officials-solve-a-weird-outbreak-m...
1•Bender•43m ago•0 comments

The Double Standard of Carbon: Why we grant souls to meat but not silicon

https://natansessays.com/posts/the-myth-of-carbon-narcissism/
1•JhonOliver•45m ago•3 comments

History Rhymes: Large Language Models Off to a Bad Start?

https://michaeljburry.substack.com/p/history-rhymes-large-language-models
1•drob518•47m ago•0 comments

What Was Software Programmer Contribution in the Human Technology Timeline?

https://medium.com/@ggonweb/what-was-the-software-programmer-generations-contribution-in-the-huma...
1•ggonweb•49m ago•1 comments

Stem cells provide a potent treatment for frailty

https://www.nature.com/articles/d41586-026-00584-y
1•bilsbie•50m ago•0 comments

Strike in the Middle East use Anthropic even after the Trump ban

https://www.wsj.com/livecoverage/iran-strikes-2026/card/u-s-strikes-in-middle-east-use-anthropic-...
2•johncole•52m ago•0 comments

Samsung Galaxy update removes Android recovery menu tools, including sideloading

https://9to5google.com/2026/02/27/samsung-galaxy-update-android-recovery-menu-removed/
19•pabs3•52m ago•1 comments

Ask HN: How would you know if an AI model has been nerfed?

2•gitgud•54m ago•1 comments

The trap Anthropic built for itself

https://techcrunch.com/2026/02/28/the-trap-anthropic-built-for-itself/
1•pseudolus•55m ago•0 comments

Sites with a /Now Page

https://nownownow.com
2•zdw•59m ago•0 comments

Happy Map

https://pudding.cool/2026/02/happy-map/
1•latexr•1h ago•0 comments

Just two days of oatmeal cut bad cholesterol by 10%

https://www.sciencedaily.com/releases/2026/02/260225081217.htm
28•gradus_ad•1h ago•21 comments

Microgpt

http://karpathy.github.io/2026/02/12/microgpt/
75•tambourine_man•1h ago•8 comments

Blender iPad App Development Halted as Android Tablets Get Priority

https://www.macrumors.com/2026/02/27/blender-ipad-pro-app-development-halted/
3•mrkpdl•1h ago•0 comments

Reconstructing OPL: Joseph Weizenbaum's Online Programming Language

https://timereshared.com/reconstructing-joseph-weizenbaums-opl/
2•abrax3141•1h ago•0 comments