frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Bet on German Train Delays

https://bahn.bet
1•indiantinker•3m ago•0 comments

Agentmap – Like SKILL spec but for code. A frontmatter for source code files

https://github.com/remorses/agentmap
1•xmorse•3m ago•1 comments

Iran Strikes U.S. Military Communication Infrastructure in Mideast

https://www.nytimes.com/2026/03/03/world/middleeast/iran-strikes-us-military-communication-infras...
2•TheAlchemist•3m ago•0 comments

What 127.5M forms can tell you about the state of front-end regex input v

https://amandastjerna.se/blog/127-million-forms/
1•fanf2•3m ago•0 comments

The next era of social media: built and run in Europe, ruled by our laws

https://www.eurosky.tech
1•doener•5m ago•0 comments

American Snacking Habits Are Transforming the Restaurant Industry

https://www.theatlantic.com/health/2026/03/restaurant-snack-meal-menu/686220/
1•fortran77•9m ago•1 comments

Solidjs releases 2.0 beta – The <Suspense> is Over

https://github.com/solidjs/solid/releases/tag/v2.0.0-beta.0
1•evertheylen•10m ago•1 comments

The Corporate Bullshit Receptivity Scale

https://www.sciencedirect.com/science/article/abs/pii/S0191886926000620
1•robtherobber•11m ago•0 comments

Bugsight – CLI tool that analyzes errors and suggests fixes

https://github.com/Arnel-rah/bugsight
1•nel123•11m ago•0 comments

The Context Optimization Layer for LLM Applications

https://github.com/chopratejas/headroom
1•selvan•12m ago•0 comments

Show HN: Ungrind – the solopreneur CRM that updates itself

https://ungrind.ai
1•magnumpowerz•13m ago•0 comments

I built Formguard – form back end without DB or APIs

https://formguard.strivio.world
1•sh20raj•14m ago•1 comments

Show HN: CUP – MCP but for desktop UI (open spec for computer use agents)

https://github.com/computeruseprotocol/computeruseprotocol
2•k4cper-g•14m ago•2 comments

Google Analytics Made Beautiful

https://analyticsma.de/
1•daniloao•15m ago•0 comments

Ask HN: How do you catch OpenAPI drift before the UI breaks?

1•losalah•16m ago•1 comments

ClawOS:Linux Panel for OpenClaw,nanobot,picoclaw,nullclaw

https://github.com/mrytsr/clawos
1•mrytsr•17m ago•0 comments

LocalStack: Community edition abandoned, users will need to create an account

https://blog.localstack.cloud/the-road-ahead-for-localstack/
1•greatgib•19m ago•0 comments

Show HN: You don't forget password, You just forget pattern

https://drp.kingname.info/
1•kingname•20m ago•1 comments

CIA working to arm Kurdish forces to spark uprising in Iran, sources say

https://www.cnn.com/2026/03/03/politics/cia-arming-kurds-iran
3•Imustaskforhelp•20m ago•2 comments

Which LLMs fold under pressure? We made 6 LLMs argue 300 hard cases to find out

https://servanda.ai/benchmarks/the-post-training-stress-test
1•luke14free•21m ago•1 comments

From RGB to L*a*b* color space (2024)

https://kaizoudou.com/from-rgb-to-lab-color-space/
1•kqr•21m ago•0 comments

Show HN: SFT to convert a base language model into a conversational chat model

https://github.com/onurkanbakirci/Llama-2-7b-oasst-sft
1•onurkanbkrc•21m ago•0 comments

OpenAI in talks to deploy AI across NATO classified networks

https://www.marketscreener.com/news/openai-in-talks-to-deploy-ai-across-nato-classified-networks-...
1•_____k•22m ago•0 comments

Donx64mcp-dbg – an injected DLL debugger toolkit with an MCP server for x64 apps

https://github.com/d0nk3yhm/donx64mcp-dbg
1•d0nk3yhm•22m ago•1 comments

Molmo 2: video understanding, pointing, and tracking

https://github.com/allenai/molmo2
1•tamnd•22m ago•0 comments

Erratic ILS Signal Causes a Missed Approach

https://www.boldmethod.com/learn-to-fly/safety/erratic-ils-signal-causes-a-missed-approach/
1•kqr•22m ago•0 comments

Show HN: Dirsv – live reload server for dir browsing, GFM, and more filetypes

https://github.com/letientai299/dirsv
1•letientai299•23m ago•0 comments

Wikipedia articles on the Iran war are being rewritten in real time

https://medium.com/@chris_50496/the-world-is-burning-wikipedia-is-being-rewritten-in-real-time-5c...
2•membrshiperfect•24m ago•1 comments

Toyota and Stellantis exit Tesla's EU regulatory pool for 2026 – Ford remains

https://www.schmidtmatthias.de/post/toyota-and-stellantis-exit-tesla-s-eu-regulatory-pool-for-202...
1•doener•24m ago•0 comments

Slack bot coding agent built on pi (mom)

https://github.com/badlogic/pi-mono/tree/main/packages/mom
1•rmhsilva•25m ago•0 comments