frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•11mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Medium's JavaScript Bloat, Gibberish Semantics, and Accessibility Failings

https://medium.com/codex/mediums-javascript-bloat-gibberish-semantics-and-accessibility-failings-...
1•hexage1814•1m ago•0 comments

Whom the Gods Would Destroy, They First Give Real-Time Analytics (2013)

https://mcfunley.com/whom-the-gods-would-destroy-they-first-give-real-time-analytics
1•downbad_•3m ago•0 comments

buffa – zero-copy Protobuf lib for Rust

https://github.com/anthropics/buffa
1•protortyp•6m ago•0 comments

DeepSeek V4: The Open-Source Model Frontier Labs Feared

https://helloai.com/articles/deepseek-v4-open-source-frontier-parity
1•HelloAi•10m ago•0 comments

The Keysigning Party HOWTO (2008)

https://www.cryptnet.net/fdp/crypto/keysigning_party/en/keysigning_party.html
1•greyface-•14m ago•0 comments

UK Government Kicks Out Palantir

https://shkspr.mobi/blog/2026/05/uk-government-kicks-out-palantir/
1•robin_reala•14m ago•0 comments

Show HN: We sent a humanoid robot to clean a stranger's apartment in SF

https://gatsby.bot/
2•frishberg•15m ago•1 comments

EaglePress has been created 2026 – PyPowered

https://topcodercloud.com/post/eaglepress-has-been-created
1•eagle10ne•15m ago•0 comments

Show HN: Proving – A Career Intelligence App

https://proving.app
1•binarycleric•19m ago•0 comments

Universal High Quality OCR and Book Translator

https://github.com/sweisman/translation-pipeline
1•sweisman•20m ago•0 comments

Canada Is Acting Increasingly Like the EU's 28th Member State

https://www.bloomberg.com/news/newsletters/2026-05-12/alienated-by-trump-carney-s-canada-is-movin...
2•vrganj•22m ago•0 comments

Show HN: I solved my study problems by talking to a goose

https://professorgoose.com/
1•polaritymaking•24m ago•0 comments

'There are no rules':spotlight on Gossip Goblin as AI film-making enters new era

https://www.theguardian.com/technology/2026/may/14/gossip-goblin-ai-film-making-new-era-hollywood
2•sandebert•28m ago•0 comments

The Preparation of Programs for an Electronic Digital Computer

https://en.wikipedia.org/wiki/The_Preparation_of_Programs_for_an_Electronic_Digital_Computer
2•geox•29m ago•0 comments

I built an extension to preview GitHub repos and links without opening new tabs

https://gopeekapp.github.io/gopeek/
3•GeorgeWoff25•29m ago•0 comments

Genkit Middleware: Intercept, extend, and harden your agentic apps Blog

https://developers.googleblog.com/announcing-genkit-middleware-intercept-extend-and-harden-your-a...
1•shallow-mind•33m ago•0 comments

Programmable Phones

https://tailrecursion.com/~alan/ProgrammablePhones.html
3•wooby•35m ago•0 comments

The rise and fall of an AI-driven 'local news outlet' in South Florida

https://floridatrib.org/2026/05/14/the-rise-and-fall-of-an-ai-driven-local-news-outlet-in-south-f...
2•martey•38m ago•0 comments

What Are AI Ethics

https://krellixlabs.com/en/blog/what-are-ai-ethics
1•radu_me•38m ago•0 comments

Ask HN: One mistake or hack that taught you the most?

1•SyntaxErrorist•40m ago•2 comments

Agentic evals or LLM as a judge? considering cost, time and quality

1•pipelineofone•42m ago•0 comments

Lookup.disclose.io – find the right security contact for any asset

https://lookup.disclose.io
2•caseyjohnellis•43m ago•1 comments

Agentic SDLC: How OpenSearch accelerates engineering with its own engine

https://opensearch.org/blog/harness-first-agentic-sdlc-how-opensearch-builds-software-using-its-o...
1•Lunar5227•48m ago•0 comments

Show HN: QUptime, quorum based decentralized uptime tool

https://github.com/Axodouble/QUptime
1•Axodouble•50m ago•0 comments

Multi-LLM trading harness with live leaderboard on Alpaca paper trades

https://github.com/achaljhawar/1rok
1•satoshiclad•58m ago•0 comments

What Are the Different Types of AI Testing Tools?

2•allenmatthew•1h ago•0 comments

The Power of the Breath

https://medicine.yale.edu/news-article/the-power-of-the-breath/
4•andsoitis•1h ago•2 comments

Social Media Bans Are for Kids. What About Adults?

https://pmz1.substack.com/p/social-media-bans-are-for-kids-what
4•gieksosz•1h ago•2 comments

How climate-resilient homes in India are reducing dependence on air conditioners

https://www.thehindu.com/sci-tech/energy-and-environment/how-climate-resilient-homes-in-india-are...
2•rustoo•1h ago•0 comments

OpenAI just lost its enterprise AI crown to Anthropic

https://www.businessinsider.com/anthropic-tops-openai-business-ai-adoption-ramp-index-2026-5
4•mazokum•1h ago•0 comments