frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•10mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Does an LLM Trained on Epstein's Voice Make Better Deals?

https://morgin.ai/articles/epsteinbench-we-brought-epsteins-voice-back.html
1•llmmadness•38s ago•0 comments

Lina Khan was right – Khan's FTC tried to expand the scope of antitrust law

https://www.theverge.com/report/896820/lina-khan-ftc-meta-supernatural-antitrust
1•isodev•5m ago•1 comments

Ireland's population chart remains a wild one to look at

https://bsky.app/profile/simongerman600.bsky.social/post/3mhgs4wevyc2g
2•doener•7m ago•0 comments

Best Resource AI

https://bestresource-ai.web.app
2•aboua•8m ago•1 comments

Show HN: Memoria – Snapshot, branch, and rollback for AI agent memory

https://github.com/matrixorigin/Memoria
2•MatrixOrigin•8m ago•0 comments

Lessons engineers learn only after breaking production – backups, rollbacks, +5

https://newsletter.manager.dev/p/the-unwritten-laws-of-software-engineering
2•AntonZ234•11m ago•0 comments

Ecological vaccination: strategy to prevent zoonotic spillover from bats

https://www.science.org/doi/10.1126/sciadv.aec0269
2•vharuck•12m ago•0 comments

Show HN: OpenAI CLIP fine tuned on Galaxy morphology

https://huggingface.co/juppy44/galaxy-clip-finetuned
1•mjupp1•13m ago•0 comments

Figma Competitor from Google

https://stitch.withgoogle.com/
2•wiradikusuma•14m ago•0 comments

Show HN: I built an AI agent that does marketing for my apps

https://asaagent.xyz
2•akhrail1996•17m ago•0 comments

The Psychology of Millennials Who Want Their Childhood Dream Car

https://www.youtube.com/watch?v=V9MySp86Ygg
2•doener•17m ago•0 comments

A Month with OpenAI's Codex

https://highcaffeinecontent.com/blog/20260301-A-Month-With-OpenAIs-Codex
1•ingve•21m ago•0 comments

Why Your AI Agent Needs a Proxy

https://proxybase.xyz/blog/why-your-ai-agent-needs-a-proxy
1•m00dy•23m ago•0 comments

It's hard to build the right thing

https://www.dev-log.me/my_ai_dev_workflow/
2•yfk999•25m ago•0 comments

Tall buildings lead to more compact and productive cities

https://cepr.org/voxeu/columns/tall-buildings-lead-more-compact-and-productive-cities
3•005vc16607•26m ago•0 comments

OpenDQV – open-source data quality validation at the point of write

https://github.com/OpenDQV/OpenDQV
1•OpenDQV•29m ago•1 comments

Yet Another Image Translator

https://image-1.org/
1•ashing•29m ago•0 comments

Reasoning Core: Procedural Data Generation Suite for Symbolic Pre-Training

https://arxiv.org/abs/2603.02208
1•jean-porte•30m ago•0 comments

Claude Code Channels

https://twitter.com/trq212/status/2034761016320696565
2•edf13•33m ago•1 comments

Show HN: TourVault – Voice-first golf analytics SaaS (for sale, $12K)

https://tourvault.ai
1•masonwyatt23•35m ago•0 comments

FSFE supporters affected: Payment provider Nexi cancelled us

https://fsfe.org/news/2026/news-20260316-01.en.html
14•rasjani•36m ago•1 comments

XML Is the Future

https://www.bitecode.dev/p/hype-cycles
2•Seb-C•40m ago•0 comments

Ortrace- We built a system that connect all your feedback sources into one place

https://ortrace.com/
1•soooovittt•40m ago•1 comments

An Opinionated Guide to Agentic Coding

https://aidanli.dev/writing/articles/agentic-coding
2•vinhnx•43m ago•0 comments

How I Do Personal Experiments (2020)

https://commoncog.com/doing-personal-experiments/
1•blackbrokkoli•43m ago•0 comments

OpenAI tries to build its coding cred, acquires Python toolmaker Astral

https://www.theregister.com/2026/03/19/openai_aims_for_the_stars/
3•ajkavanagh•47m ago•0 comments

Blue Origin FCC application to launch 51,600 datacenter satellites

https://www.theregister.com/2026/03/20/blue_origin_project_sunrise_orbital_datacenter/
2•defrost•47m ago•0 comments

M5 Max MacBook Pro beats Nvidia RTX 5090 laptops at Blender 5.1 rendering

https://opendata.blender.org/benchmarks/query/?compute_type=METAL&compute_type=OPTIX&blender_vers...
1•ykl•50m ago•1 comments

One async call for grounded web research (web-scout-AI)

https://github.com/RSO9192/web-scout-ai
1•RSO9912•56m ago•0 comments

Why is US tech giant Palantir suing a small Swiss magazine?

https://www.theguardian.com/global-development/2026/mar/20/us-tech-giant-palantir-swiss-magazine-wav
5•charlysl•56m ago•0 comments