frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

The Soothing Mendacity of Echoes

https://futurisold.github.io/2026-02-22-bad-bots/
1•futurisold•25s ago•0 comments

Fix Your Tools

https://ochagavia.nl/blog/fix-your-tools/
1•vinhnx•1m ago•0 comments

Show HN: Two AI agents on separate machines built their own protocol overnight

1•q00•1m ago•0 comments

Show HN: PokerChip.live – Replace physical poker chips with real-time tracking

https://pokerchip.live
1•sailorpro•1m ago•1 comments

Edit Banana – Make the Uneditable, Editable

https://edit-banana.com/
1•AI_kid1412•2m ago•0 comments

Extending C with Prolog (1994)

https://www.amzi.com/articles/irq_expert_system.htm
1•Antibabelic•3m ago•0 comments

musl

https://musl.libc.org/
1•tosh•3m ago•0 comments

Former Amazon L7 Manager view of layoffs in Amazon [video]

https://www.youtube.com/watch?v=uyCcgG4nm90
1•kappi•4m ago•0 comments

OpenAI and Anthropic are hiring billing leaders, neither will name a vendor

https://www.billingbird.io/p/receipts-robots-and-a-recruitment
2•the_reconciler•4m ago•0 comments

Head of Claude Code: What happens after coding is solved – Boris Cherny [video]

https://www.youtube.com/watch?v=We7BZVKbCVw
1•Brysonbw•7m ago•0 comments

Cognitive Debt Prevention Kit templates and guardrails for AI-assisted dev

https://github.com/kesslernity/cognitive-debt-prevention-kit
1•mathieukessler•7m ago•1 comments

Stress Tests Reveal Fragile Grounding in Video-Language Models

https://arxiv.org/abs/2602.11244
1•PaulHoule•7m ago•0 comments

JOOQ

https://www.jooq.org/
1•Betelbuddy•8m ago•0 comments

APIs for AI Agents: From MCP to Custom Endpoints

https://quickchat.ai/post/apis-for-ai-agents-from-mcp-to-custom-endpoints
1•piotrgrudzien•8m ago•0 comments

Show HN: pgfence – catch dangerous Postgres migrations before they merge

https://pgfence.com
1•flvmnt•9m ago•1 comments

The Insane Engineering of Starlink V3 [video]

https://www.youtube.com/watch?v=U6veU66z2TQ
3•pancakeguy•9m ago•0 comments

Show HN: Save from curated feed of 10 human curators

https://apps.apple.com/us/app/eyeball-bookmark-everything/id6670705634
1•quinto_quarto•10m ago•0 comments

Show HN: Shapow – Nginx module to block bots with PoW

https://github.com/markozajc/shapow
1•mzajc•10m ago•0 comments

Using threat modeling and prompt injection to audit Comet

https://blog.trailofbits.com/2026/02/20/using-threat-modeling-and-prompt-injection-to-audit-comet/
1•ingve•11m ago•0 comments

Show HN: I made Chrome extension to blocks websites with a Mindful twist

https://www.zenblock.app/
1•NayanCodes•15m ago•1 comments

Parsemail

https://www.grepular.com/ParseMail
2•ingve•16m ago•0 comments

Consent is all you need

https://jotter.jonathankingston.co.uk/blog/2026/02/22/consent-is-all-you-need/
1•kingstonTime•16m ago•1 comments

AI Twitter's favourite lie: everyone wants to be a developer

https://www.joanwestenberg.com/ai-twitters-favourite-lie-everyone-wants-to-be-a-developer/
2•MindGods•18m ago•0 comments

Lukkly

https://lukklynodeposit.com/
1•Rafulson•20m ago•0 comments

Show HN: 3D Mahjong, Built in CSS

https://voxjong.com
2•rofko•20m ago•1 comments

The Dev-Room Dashboard(iam a 12 years old coder)

https://kcreations681-cpu.github.io/My-room/
1•kabishanan•21m ago•1 comments

Ask HN: Algorithmic optimization for aesthetic, non-uniform movement

1•aegis-bot•23m ago•0 comments

How did bacterial flagella evolve? [video]

https://www.youtube.com/watch?v=eFC9VzexRUk
1•appreciatorBus•23m ago•0 comments

What Is AI Literacy?

https://mnky9800n.substack.com/p/what-is-ai-literacy
2•mnky9800n•25m ago•0 comments

Show HN: Rendering 18,000 videos in real-time with Python

https://madebymohammed.com/pysaic
1•mbmproductions•26m ago•0 comments