frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Meta-study showing that the returns to education are not so high

https://onlinelibrary.wiley.com/doi/10.1111/kykl.70041
1•aswegs8•28s ago•0 comments

How safe are kids using social media? We did the groundwork

https://www.malwarebytes.com/blog/family-and-parenting/2026/02/how-safe-are-kids-using-social-med...
1•bhartzer•1m ago•0 comments

The Housing Crisis as a Land Crisis

https://progressandpoverty.substack.com/p/the-housing-crisis-as-a-land-crisis
1•infinite8s•1m ago•0 comments

MCP Server Implementation

https://docsalot.dev/blog/mcp-servers-what-they-are-and-why-your-docs-need-one
1•fazkan•2m ago•0 comments

VillageSQL

https://villagesql.com/
1•jonbaer•3m ago•0 comments

The cost of AI coding agents isn't from AI at all

https://www.coderabbit.ai/ja/blog/the-hidden-cost-of-ai-coding-agents-isnt-from-ai-at-all
1•TheAnkurTyagi•3m ago•0 comments

Flop Or Top – OSS model to predict movie IMDB ratings

https://floportop.fit/
1•Igor_Wiwi•4m ago•1 comments

President Trump Delivers "Most-Favored-Nation" Pricing for Prescription Drugs

https://trumprx.gov/
1•runamuck•4m ago•0 comments

The Stanford Emerging Technology Review 2026 [pdf]

https://setr.stanford.edu/sites/default/files/2026-02/SETR2026_web-260207.pdf
2•Nydhal•5m ago•0 comments

How2Everything: Mining the web to evaluate and improve LLMs on real procedures

https://allenai.org/blog/how2everything
1•maxloh•5m ago•1 comments

When Trump Officials' Claims About Shootings Unravel in Court

https://www.nytimes.com/2026/02/10/us/politics/homeland-security-shootings.html
3•duxup•8m ago•1 comments

Meeting Was Avoidable

https://qz.com/workplace-meetings-email-writing
1•smooke•9m ago•0 comments

Vim Review (VR)

https://github.com/bobrenjc93/vr
2•bobrenjc93•9m ago•0 comments

Container Timing: measuring web components performance

https://blogs.igalia.com/dape/2026/02/10/container-timing-measuring-web-components-performance/
1•josephscott•9m ago•0 comments

Runtime validation is still fucked in AI coding agents

1•sebringj•9m ago•1 comments

What AGI can do you for your average enterprise?

https://metacriticcapital.substack.com/p/what-agi-can-do-for-your-average
1•MP_1729•11m ago•0 comments

Show HN: RoadmapAI – Turn Discord chatter into a product roadmap with AI

https://theroadmapai.com
1•karakhanyans•12m ago•2 comments

Emotive – An emoji picker for Telescope in Neovim

https://github.com/techne98/emotive.nvim
1•fixedprog•13m ago•0 comments

Show HN: CSL-Core – Formally Verified Neuro-Symbolic Safety Engine for AI

https://github.com/Chimera-Protocol/csl-core
1•aytuakarlar•14m ago•3 comments

Public Domain Image Archive – Infinite View

https://pdimagearchive.org/infinite-view/
2•helloplanets•15m ago•0 comments

We recreated the Anthropic C compiler agent

https://vizops.ai/blog/agent-scaling-laws/
1•se4u•17m ago•1 comments

Show HN: Stripe-no-webhooks – Sync your Stripe data to your Postgres DB

https://github.com/pretzelai/stripe-no-webhooks
1•prasoonds•17m ago•0 comments

AI Agents Are Running Naked

https://expanso.io/blog/2026-002-your-ai-agents-are-running-naked/
1•TheIronYuppie•20m ago•0 comments

Power-options = GUI and TLP and auto-cpufreq and cpupower

https://github.com/TheAlexDev23/power-options
1•segfault0x23•21m ago•0 comments

Show HN: GitEcho – set-and-forget Git mirroring on every push

https://github.com/prashantsengar/GitEcho
1•prashantsengar•22m ago•0 comments

Mindfulness enables more effective endoscopies in awake patients

https://medicalxpress.com/news/2026-02-mindfulness-enables-effective-endoscopies-patients.html
3•bikenaga•22m ago•1 comments

The mathematics of compression in database systems

https://www.bitsxpages.com/p/the-mathematics-of-compression-in
4•agavra•25m ago•0 comments

The Datacenter as a Computer (2013)

https://research.google/pubs/the-datacenter-as-a-computer-an-introduction-to-the-design-of-wareho...
2•tosh•25m ago•0 comments

Show HN: Open-Source SDK for AI Knowledge Work

https://github.com/ClioAI/kw-sdk
1•ankit219•26m ago•1 comments

Study: LLMs found to echo false claims in medical notes and social media

https://www.mountsinai.org/about/newsroom/2026/can-medical-ai-lie-large-study-maps-how-llms-handl...
1•giuliomagnifico•27m ago•0 comments