frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Repo that automates ALL sides of building a business

https://github.com/AlexKapadia/AutoFirm
1•alexkapadia1•58s ago•0 comments

Show HN: Alpenglow, a Linux distribution that boots to login in 0.6s

https://github.com/tschk/alpenglow
1•undivisible•1m ago•0 comments

Show HN: A tool that scores your OpenAPI spec for test-generation readiness

https://resources.kusho.ai/openapi-spec-analyzer
3•AkshatVirmani•2m ago•0 comments

Ask HN: Is AI helping with personal projects/tools? What's your stack?

1•fourside•3m ago•0 comments

Show HN: Mira – Open-source and self-hosted AI code reviewer

https://github.com/miracodeai/mira
1•upmostly•4m ago•0 comments

Show HN: Transcribez,Transcription for Kenyan English and Swahili

https://www.transcribez.xyz
1•Smbugua•4m ago•0 comments

Valve Boss Gabe Newell Buys $70M Mansion with Its Own Tunnel to the Beach

https://www.wsj.com/real-estate/luxury-homes/videogame-billionaire-buys-florida-home-for-70-8-mil...
1•HelloUsername•6m ago•0 comments

I built a free JPEG XL converter that runs 100% in browser files never uploaded

https://jpegxlconvert.com/en/
1•El-Necora•10m ago•0 comments

Show HN: A knowledge graph of 15,941 math states – proof as path-finding

https://ansumandas441.github.io/mathematical-discovery-engine/
1•ansuman441•10m ago•0 comments

Pull Requests are Free Puppies [video]

https://www.youtube.com/watch?v=x8_ZZhRL3YU
1•ndr•11m ago•2 comments

A Conversation with APL Pioneers (2024) [video]

https://www.youtube.com/watch?v=b211ZaNt1H4
1•tosh•11m ago•0 comments

LLM Councils Show Groupthink

https://www.strangeloopcanon.com/p/llm-councils-show-groupthink
2•surprisetalk•12m ago•0 comments

Relocating Rigor

https://aicoding.leaflet.pub/3mbrvhyye4k2e
1•simonebrunozzi•13m ago•0 comments

Your Source. Posted Everywhere

https://www.socialboy.io/
2•withoutshape•13m ago•2 comments

Show HN: Web-based Nürburgring driving game – custom physics, synthesized sound

https://drive-game.pages.dev
1•esc5221•13m ago•0 comments

Savearoundtrip: Publish an HTTPS DNS record, skip a round trip

https://savearoundtrip.com/
1•ibobev•14m ago•0 comments

Agenthatch – Compile any skill into a standalone Python agent

https://github.com/agenthatch/agenthatch
1•EternalRights•15m ago•0 comments

Retrofitting the WM_COPY­DATA message onto Windows 3.1

https://devblogs.microsoft.com/oldnewthing/20260616-00/?p=112430
2•ibobev•15m ago•0 comments

The feedback loops behind Kubernetes

https://planetscale.com/blog/the-feedback-loops-behind-kubernetes
2•polyrand•16m ago•0 comments

Show HN: Hextrap – Package Firewall with OPA Policies and MCP Support

https://hextrap.com/products/firewall/
1•thenrich99•16m ago•0 comments

The Emptiness of Online Education

https://hollisrobbinsanecdotal.substack.com/p/from-the-athens-of-veracruz-to-chatgpt
1•HR01•18m ago•0 comments

As the Job Market Stutters, Simulated Work Is Surging

https://www.nytimes.com/2026/06/17/arts/as-the-job-market-stutters-simulated-work-is-surging.html
3•adrianhon•19m ago•0 comments

Not having an opinion on SpaceX is going to cost you

https://www.ft.com/content/d4069188-30ca-4838-a3d3-f3c8ffe4a13b
1•root-parent•20m ago•1 comments

Mitochondria Are Alive

https://www.asimov.press/p/mitochondria
2•dan-bailey•20m ago•0 comments

A fastest growing Substack Newsletter is on Sale

https://cloudhandbook.substack.com/
1•kisanpakhreen•20m ago•1 comments

What if YouTube had slug-based URLs?

https://jamesg.blog/2025/04/13/what-if-youtube-had-slug-based-urls
1•phewlink•20m ago•0 comments

Given a sequence, find the nth figure. Solve many before the clock runs out

https://playnth.eu/
1•amaury•20m ago•1 comments

Ask HN: What is your worst lesson learned from using AI?

3•vantareed•21m ago•4 comments

Ask HN: Who's Solving GTM Agentically?

1•artur_makly•21m ago•0 comments

HRM-Text: Efficient Pretraining Beyond Scaling

https://arxiv.org/abs/2605.20613
2•root-parent•22m ago•0 comments