frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Anthropic's AI Coding Study: How You Use AI Matters More Than Whether You Use It

https://luther.io/articles/personal-growth-in-the-age-of-ai/
1•vidluther•6m ago•0 comments

Is traditional ML relevant anymore? Any active research going on in ML methods?

1•mitml•7m ago•0 comments

Hyping an Editor in the Age of AI

https://tildehacker.com/hyping-an-editor-in-the-age-of-ai
1•tildehacker•13m ago•0 comments

Intel Foundry boss leaves for Qualcomm

https://www.tomshardware.com/tech-industry/semiconductors/intel-foundry-boss-leaves-for-qualcomm-...
2•teleforce•14m ago•0 comments

Show HN: Leviathan, A world where AI agents write the laws and govern themselves

https://chenyu-li.info/leviathan
1•chenyusu•23m ago•1 comments

Swift System Metrics 1.0

https://www.swift.org/blog/swift-system-metrics-1.0-released/
1•peterspath•24m ago•0 comments

The normalization of corruption in organizations (2003) [pdf]

https://gwern.net/doc/sociology/2003-ashforth.pdf
1•rendx•24m ago•0 comments

Leaving AI to Code 24/7 Doesn't Work

https://twitter.com/victortaelin/status/2027214947193679932
1•jamesy0ung•24m ago•0 comments

Build dynamic agentic workflows in Opal

https://blog.google/innovation-and-ai/models-and-research/google-labs/opal-agent/
1•gmays•26m ago•0 comments

Show HN: PipelineIQ – AI diagnoses CI/CD failures and sends fixes to Slack

https://pipelineiq.dev
1•Raja_Dev•26m ago•0 comments

Techniques of Neutralization

https://en.wikipedia.org/wiki/Techniques_of_neutralization
1•rendx•28m ago•0 comments

All the troubles of the world by Isaac Asimov [pdf]

https://schools.ednet.ns.ca/avrsb/070/rsbennett/HORTON/shortstories/All%20the%20troubles%20of%20t...
1•thunfischtoast•29m ago•1 comments

Agentic Engineering Patterns

https://simonwillison.net/guides/agentic-engineering-patterns/
1•admp•33m ago•0 comments

Measuring CPU Load

https://en.wikipedia.org/wiki/Load_(computing)
1•pacct2025•36m ago•0 comments

Boston Review: A Brief History of AI Psychosis

https://www.bostonreview.net/articles/a-brief-history-of-ai-psychosis/
1•t0lo•44m ago•1 comments

Show HN: Arrival Radar

https://entropicthoughts.com/arrival-radar
2•kqr•47m ago•1 comments

How AI will change GTM?

1•imwoody•48m ago•0 comments

Show HN: Export Your YouTube Subscriptions to OPML (and Use Any RSS Reader)

https://gist.github.com/jillesvangurp/b43cc5bbdbc0a9a29c7f0944d6cc5854
1•jillesvangurp•49m ago•0 comments

When the Bubble Bursts

http://theprogrammersparadox.blogspot.com/2026/02/when-bubble-bursts.html
3•r4um•50m ago•1 comments

Spatial AI-native graph workspace

1•aXlireza•51m ago•0 comments

Show HN: MotionDesk: Physics-driven, Metal-accelerated wallpapers for macOS

3•motiondeskapp•51m ago•0 comments

Show HN: Nano Banana 2 – Sub-second AI image gen via Gemini 3.1 Flash

https://nano-banana2.me/
1•naxtsass•55m ago•0 comments

Show HN: Conduit – Automatic Port Forwarding for Docker Containers

https://github.com/Oranda-IO/Conduit
1•orandaio•55m ago•0 comments

RFC 9925: Unsigned X.509 Certificates

https://datatracker.ietf.org/doc/rfc9925/
1•raquuk•58m ago•0 comments

I used Claude AI to build this website that shows upcoming indie game festivals

https://festival-watch.vercel.app/
2•rotub•58m ago•1 comments

Chivalry Test

https://chivalryscore.com
1•onSmallMessage•59m ago•1 comments

We found 118 performance bugs across 2 PRs written with Claude Code

https://www.codeflash.ai/blog-posts/hidden-cost-of-coding-agents
5•misrasaurabh1•1h ago•1 comments

Vegetarians have 'substantially lower risk' of five types of cancer

https://www.theguardian.com/society/2026/feb/27/vegetarians-have-substantially-lower-risk-of-five...
1•plaguna•1h ago•0 comments

Man jailed after selling £7M of fake plane parts

https://www.bbc.com/news/articles/c78xz5j848vo
2•dataflow•1h ago•0 comments

Pplx-Embed: Embedding Models for Web-Scale Retrieval

https://research.perplexity.ai/articles/pplx-embed-state-of-the-art-embedding-models-for-web-scal...
1•jxmorris12•1h ago•0 comments