frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: OpenCode Upgrade Skill: Automating Updates

1•ekadet•2m ago•0 comments

Foursquare scrapped engineering manager titles

https://sfstandard.com/2026/02/03/foursquare-scrapped-engineering-manager-titles/
1•walterbell•3m ago•0 comments

Interpreting OCapN Principles in Cloud-Native Agentic AI Architectures

https://serefayar.substack.com/p/interpreting-ocapn-principles-in-cloud-native-agentic-ai
1•serefayar•4m ago•0 comments

Making a product that Marl loves

https://invertedpassion.com/making-a-product-that-marl-loves/
1•twapi•5m ago•0 comments

Memoirs from the old web: IE's crazy content rating system

https://www.devever.net/~hl/pics
1•Diti•5m ago•0 comments

Qwen3.5: Towards Native Multimodal Agents

https://qwen.ai/blog?id=qwen3.5
2•danielhanchen•7m ago•1 comments

Show HN: OpenClaw – An OS for AI agents that do work

https://github.com/mupengi-bot/mupengism
1•mupengism•8m ago•0 comments

ERAO – Ask questions in plain English over your database or files

https://erao.digital
1•jorjinio•9m ago•1 comments

Singapore says China-backed hackers targeted its four largest phone companies

https://techcrunch.com/2026/02/10/singapore-china-backed-hackers-targeted-largest-phone-companies...
2•JeanKage•10m ago•0 comments

Fluxer: Free, open source instant messaging and VoIP platform

https://github.com/fluxerapp/fluxer
1•thunderbong•12m ago•0 comments

The AI Advantage Established Companies Have over Startups

https://www.context-link.ai/blog/hidden-ai-advantage-established-companies
1•oliaukus•12m ago•0 comments

Show HN: We rebuilt Flood-It in Bun/vanilla JavaScript, and added a Maze mode

1•ekremkrc•14m ago•1 comments

Show HN: Dominake – A domino puzzle where 5×6 grids are impossible

1•UnclonedMath•17m ago•0 comments

Show HN: Train AI Agents to Write Better Playwright Tests

https://testdino.com/blog/playwright-skill/
2•tanmay001•22m ago•0 comments

Friends Might Be Sharing Your Number with ChatGPT Contacts Sync

https://www.pcmag.com/news/watch-out-your-friends-might-be-sharing-your-number-with-chatgpt?test_...
1•walterbell•22m ago•0 comments

A Wave of Unexplained Bot Traffic Is Sweeping the Web

https://www.wired.com/story/made-in-china-niche-websites-are-seeing-a-surge-of-mysterious-traffic...
1•JeanKage•23m ago•0 comments

Show HN: AISeedream5 – a simple web UI for Seedream 5.0 image

https://aiseedream5.org/
1•xuyanmei•25m ago•0 comments

Show HN: 0211 – Go from zero to eleven in any topic with F1-style gear shifting

1•ekadet•25m ago•0 comments

Pi Coding Agent

https://pi.dev/
2•tin7in•31m ago•0 comments

Who Opened the Door?

https://chaosguru.substack.com/p/who-opened-the-door
2•BerislavLopac•32m ago•1 comments

Experiments with CodeMirror: Building a code review tool

https://aziis98.com/blog/codemirror-review-tool/
1•aziis98•34m ago•0 comments

Show HN: VC Scorecard score any startup or company in 60 seconds

https://www.researchly.at/vc-scorecard
1•leo_researchly•37m ago•2 comments

CasNum

https://github.com/0x0mer/CasNum
1•aebtebeten•42m ago•0 comments

The Cultural Normalization of Correctness Deviance in AI

https://embracethered.com/blog/posts/2025/the-normalization-of-deviance-in-ai/
1•walterbell•43m ago•0 comments

A relationship between the Collatz conjecture and the Fibonacci numbers

https://vincentrolfs.dev/blog/collatz
1•vincentrolfs•47m ago•1 comments

Show HN: Argus – AI code review that doesn't grade its own homework

https://github.com/Meru143/argus
1•meru143•47m ago•1 comments

Apple's upcoming low-cost MacBook will come in 'fun colors,' launch next month

https://9to5mac.com/2026/02/15/apple-cheaper-macbook-launching-next-month-with-a18-pro-chip-and-f...
2•thunderbong•50m ago•1 comments

Show HN: Animus Invoice – Invoice tracking without the busywork

https://animusinvoice.com/beta.html
1•ilkerozbay•53m ago•0 comments

France's 'French Response' uses memes and sarcasm to fight disinformation on X

https://www.euronews.com/my-europe/2026/02/13/frances-french-response-uses-memes-and-sarcasm-to-f...
6•saubeidl•54m ago•0 comments

Show HN: I built a free keyword checker to see if you can rank

https://kitful.ai/write-tools/serp-content-analyzer
1•eashish93•55m ago•0 comments