frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: Software is content now and I built a platform for it

https://www.prom.dev/home
1•hjack_•59s ago•0 comments

Show HN: PardusDB – SQLite-like vector database in Rust

https://github.com/JasonHonKL/PardusDB
1•JasonHEIN•2m ago•0 comments

UK's Ofcom fines porn site £800,000 for not rolling out age checks

https://www.bbc.com/news/articles/cz6ejedj59no
1•tokyobreakfast•2m ago•0 comments

Show HN: 20+ Claude Code agents coordinating on real work (open source)

https://github.com/mutable-state-inc/lean-collab
2•austinbaggio•5m ago•0 comments

An AI Agent Published a Hit Piece on Me

https://theshamblog.com/an-ai-agent-published-a-hit-piece-on-me/
6•scottshambaugh•5m ago•0 comments

Multi-vector Grep for code agents, save 15% tokens, local

https://huggingface.co/blog/lightonai/colgrep-lateon-code
1•raphaelty•6m ago•0 comments

Which company is going to get hacked next?

https://breachpool.com
1•mooreds•7m ago•0 comments

Show HN: AI Shortcuts – Hotkeys for ChatGPT on macOS

https://www.aihotkeys.tech/
2•Mikheyrojo•8m ago•0 comments

How the world will be if P = NP?

1•fouadelkh•9m ago•0 comments

Show HN: Octree – open-source AI LaTeX Editor

https://www.useoctree.com
1•basilyusuf1709•10m ago•0 comments

Pushing Tensor Accelerators Beyond MatMul in a User-Schedulable Language

https://arxiv.org/abs/2512.02371
1•matt_d•10m ago•0 comments

Show HN: Agent Tools – 136 deterministic data tools for AI agents (MCP/A2A/REST)

https://github.com/AtmaticAI/agent-tools
2•sathish-mg•11m ago•0 comments

CIA releases new video in bid to lure Chinese military officers to spy for US

https://www.youtube.com/watch?v=dZ5lNt8bLpM
1•throwaway2037•12m ago•1 comments

Show HN: AI Shortcuts – System-Wide AI Hotkeys for macOS

1•Mikheyrojo•12m ago•0 comments

Show HN: AI Shortcuts – System-Wide AI Hotkeys for macOS

1•Mikheyrojo•12m ago•0 comments

Gemini 3 Deep Think: Advancing science, research and engineering

https://blog.google/innovation-and-ai/models-and-research/gemini-models/gemini-3-deep-think/
1•meetpateltech•12m ago•0 comments

What CI looks like at a 100-person team

https://www.mendral.com/blog/ci-at-scale
2•shad42•13m ago•0 comments

Akshay on X: "What is GIL in Python?" / X

https://twitter.com/akshay_pachaar/status/2021959091024019561
1•bilsbie•13m ago•0 comments

China to punish universities that fail to sanction research misconduct

https://www.nature.com/articles/d41586-026-00321-5
4•bikenaga•14m ago•0 comments

Ask HN: How are you working now?

1•NiloCK•15m ago•0 comments

Show HN: A diff tool that understands JSON

https://diffchecker.dev/json/
1•subhash_k•16m ago•0 comments

X accused of violating sanctions by selling Premium accounts to Iranian leaders

https://www.wired.com/story/elon-musk-x-premium-accounts-iran/
8•OgsyedIE•17m ago•0 comments

UpScrolled social network struggles to moderate hate speech after fast growth

https://techcrunch.com/2026/02/11/upscrolleds-social-network-is-struggling-to-moderate-hate-speec...
1•SilverElfin•17m ago•0 comments

Beginning autonomous operations with the 6th-generation Waymo Driver

https://waymo.com/blog/2026/02/ro-on-6th-gen-waymo-driver
2•ra7•18m ago•0 comments

Show HN: ClawDeploy – OpenClaw deployment for non-technical users

https://clawdeploy.com
2•gregzeng95•18m ago•0 comments

FTC Chairman Issues Warning Letter to Apple CEO

https://www.ftc.gov/news-events/news/press-releases/2026/02/federal-trade-commission-chairman-and...
4•geox•19m ago•0 comments

Official Launch of Seedance 2.0

https://seed.bytedance.com/en/blog/seedance-2-0-%E6%AD%A3%E5%BC%8F%E5%8F%91%E5%B8%83
2•DustinEchoes•19m ago•0 comments

I'm 23, building my first startup with $0. Roast my plan

1•dattapt•19m ago•2 comments

When AI Tools Train on AI Output: Model Collapse in Daily Workflows

https://cacm.acm.org/blogcacm/when-ai-tools-train-on-ai-output-model-collapse-in-daily-workflows/
3•pseudolus•21m ago•0 comments

Tiny Tool Town

https://www.tinytooltown.com/
2•sebg•23m ago•0 comments