frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

GPT-5.1-Codex-Max is taking on Gemini

https://www.augmentedswe.com/p/gpt-5-codex-max
1•wordsaboutcode•48s ago•0 comments

Show HN: Makefiles, Metalanguages, Matrioshka Automata

https://robot-wranglers.github.io/compose.mk/
1•robot-wrangler•1m ago•0 comments

A Battle with My Blood – Tatiana Schlossberg

https://www.newyorker.com/culture/the-weekend-essay/a-battle-with-my-blood
1•pseudolus•1m ago•1 comments

Jailbreaking LLMs via Game-Theory Scenarios

https://arxiv.org/abs/2511.16278
1•belter•5m ago•0 comments

Call Center Lion Air Medan

1•Niggah_Bash•7m ago•0 comments

Internet Protocol over Avian Carriers (1990)

https://www.rfc-editor.org/rfc/rfc1149
1•wmichelin•7m ago•1 comments

Show HN: Bindu – an auth, payment, and communication layer for AI agents

https://github.com/GetBindu/Bindu
1•ai_biden•8m ago•0 comments

73% of AI startups are just prompt engineering

https://pub.towardsai.net/i-reverse-engineered-200-ai-startups-73-are-lying-a8610acab0d3
2•kllrnohj•9m ago•0 comments

Metrik – Real-time LLM latency for voice agents and free API

https://metrik-dashboard.vercel.app/
1•mbouassa•10m ago•1 comments

Show HN: Jabcode Studio, high-density QR-like 2D barcodes for sharing files

https://jabcode.studio
1•jabber-feller•12m ago•1 comments

Show HN: AI Factor Model Stock Screener

https://sophistia.ai
1•valeagent•12m ago•0 comments

Bagaimana Cara Menghubungi AirAsia Indonesia

1•Niggah_Bash•12m ago•0 comments

Paris court blocks auction of earliest-known calculator

https://www.bbc.com/news/articles/c2kpkq90ygno
1•pseudolus•14m ago•1 comments

Mount Proton Drive on Linux using rclone and systemd

https://github.com/dadtronics/protondrive-linux
1•cf100clunk•15m ago•0 comments

Full-Time Work Is Increasing Among Married Moms

https://ifstudies.org/blog/full-time-work-is-increasing-among-married-moms
1•skx001•18m ago•0 comments

BOM's new boss asked to examine $96.5M bill for website redesign

https://www.abc.net.au/news/2025-11-23/bureau-of-meteorology-new-website-cost-blowout-to-96-milli...
1•NvrBeenToAus•20m ago•1 comments

We stopped roadmap work for a week and fixed 189 bugs

https://lalitm.com/fixits-are-good-for-the-soul/
1•lalitmaganti•21m ago•0 comments

Gov. People Announces 6 Critical Tech Areas for the War Department

https://www.war.gov/News/Releases/Release/Article/4333074/under-secretary-of-war-for-research-and...
2•donutloop•25m ago•0 comments

Show HN: Reduce time debugging AI slop in prod

https://github.com/dingus-technology/DINGUS
1•SleepyWalrus•25m ago•0 comments

Interviewing Andrea Borman [video]

https://www.youtube.com/watch?v=iLpIzfZ_2zI
1•spacebuffer•26m ago•0 comments

Gabe Newell: "We don't worry about Piracy" (2009) [video]

https://www.youtube.com/watch?v=Imf-QeQCexk
1•uyzstvqs•26m ago•0 comments

You can save money on LLM tokens as a developer with MCP / ChatGPT apps

https://www.mikeborozdin.com/post/how-mcp-and-chatgpt-apps-can-save-you-tokens
1•mikeborozdin•31m ago•0 comments

Kickstart.nvim: A minimal, single-file starting point for Neovim configuration

https://github.com/nvim-lua/kickstart.nvim
1•nathan-barry•31m ago•0 comments

Volvo ends relationship with Luminar, removes Lidar from vehicles

https://www.repairerdrivennews.com/2025/11/21/volvo-ends-relationship-with-luminar-removes-lidar-...
2•bookofjoe•34m ago•0 comments

How Does an Electron Microscope Work?

https://www.thermofisher.com/blog/materials/how-does-an-electron-microscope-work/
1•kamaraju•35m ago•0 comments

X begins rolling out 'About this account' location feature to users' profiles

https://techcrunch.com/2025/11/21/x-begins-rolling-out-the-about-this-account-feature-to-users-pr...
4•xqcgrek2•38m ago•0 comments

Evals drive the next chapter in AI for businesses

https://openai.com/index/evals-drive-next-chapter-of-ai/
1•gmays•39m ago•0 comments

Bytes before FLOPS: your algorithm is (mostly) fine, your data isn't

https://www.bitsdraumar.is/bytes-before-flops/
1•bofersen•39m ago•0 comments

Meta Looks to Power Trading to Support Its AI Energy Needs

https://www.bloomberg.com/news/articles/2025-11-21/meta-enters-power-trading-to-support-ai-data-c...
2•geox•42m ago•0 comments

Working Title (Insurance)

https://www.bitsaboutmoney.com/archive/working-title-insurance/
1•surprisetalk•44m ago•0 comments