frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•9mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

The A.I. Disruption Is Here, and It's Not Terrible

https://www.nytimes.com/2026/02/18/opinion/ai-software.html
1•bandwitch•1m ago•0 comments

OxiDB embeddable(iOS, macOS, Linux, Win) document database written in Rust

https://github.com/parisxmas/OxiDB
1•mrtksn•1m ago•0 comments

SunnyFlight – Find cheap weekend flights to sunny destinations

https://sunnyflight.com/
1•coderai•4m ago•1 comments

Show HN: ReARM – Release-Level Supply Chain Evidence Platform

https://rearmhq.com/
1•taleodor•6m ago•0 comments

The new Gemini-based Google Translate can be hacked with simple words

https://the-decoder.com/the-new-gemini-based-google-translate-can-be-hacked-with-simple-words/
2•amai•8m ago•1 comments

Against Taste

https://twitter.com/WillManidis/status/2023866928608002183
1•mellosouls•8m ago•0 comments

Parts of Antarctica May Have Crossed a Tipping Point

https://umbrellatoday.app/blog/202602-antarctica-tipping-point
2•s-xyz•8m ago•1 comments

Lakital vs. Uxntal

https://wastingmoves.com/lakital_vs_uxntal.html
1•tosh•11m ago•0 comments

One Page of Async Rust

https://dotat.at/@/2026-02-16-async.html
1•ingve•12m ago•0 comments

Upright: An Open Source Synthetic Monitoring System

https://dev.37signals.com/introducing-upright/
1•tzury•13m ago•0 comments

Billionaires Gone Wild

https://paulkrugman.substack.com/p/billionaires-gone-wild
4•rbanffy•13m ago•0 comments

First genome sequence of Psychrobacter SC65A.3 preserved in 5K-year-old cave ice

https://www.frontiersin.org/journals/microbiology/articles/10.3389/fmicb.2025.1713017/full
1•thunderbong•14m ago•0 comments

GLM-5: From Vibe Coding to Agentic Engineering

https://huggingface.co/papers/2602.15763
3•rvz•16m ago•0 comments

Semantic closure: why compilers know when they are right and LLMs do not

https://sderosiaux.substack.com/p/semantic-closure-why-compilers-know
4•chtefi•19m ago•0 comments

Carelessness versus Craftsmanship in Cryptography

https://blog.trailofbits.com/2026/02/18/carelessness-versus-craftsmanship-in-cryptography/
2•ingve•20m ago•0 comments

The Future of Context Engineering

https://telemetryagent.dev/blog/future-of-context-engineering
3•martvdjagt•21m ago•0 comments

Apollo 1

https://en.wikipedia.org/wiki/Apollo_1
2•simonebrunozzi•21m ago•0 comments

ICE tripled its reliance on Microsoft in last six months, leaked files reveal

https://www.972mag.com/ice-microsoft-azure-leaked-files/
3•Qem•22m ago•0 comments

Copy-left open-source license for AI code use

3•program_whiz•28m ago•0 comments

Are We Becoming Architects or Butlers to LLMs?

http://muratbuffalo.blogspot.com/2026/02/butlers-or-architects.html
4•cstever•28m ago•0 comments

Show HN: Resonant – Local-only speech-to-text for macOS (no cloud)

https://www.onresonant.com/
2•sourcetms•30m ago•0 comments

Unicode Homoglyph Path Injection in Chromium Native Messaging

https://treechain.ai/white-papers/unicode-homoglyph-path-injection-in-chromium-native-messaging/
1•treechain•31m ago•0 comments

BrowserPod: Universal in-browser sandbox powered by WASM (starting with Node.js)

https://labs.leaningtech.com/blog/browserpod-10
3•apignotti•32m ago•1 comments

Effects of a Modular Sleep System on Sleep Quality and Physiological Stability

https://www.mdpi.com/2076-3417/16/3/1194
1•PaulHoule•33m ago•0 comments

Show HN: SharpSkill – I Gamified a LeetCode-like tool to crush Tech Interviews

https://sharpskill.fr/en
1•CocoZozo•34m ago•0 comments

Tell HN: We analyzed our dev time.80% is still infrastructure'setup',notfeatures

3•thesssaism•34m ago•0 comments

Show HN: Rebrain.gg – Doom learn, don't doom scroll

4•FailMore•36m ago•0 comments

We're Measuring DataCenter Sustainability Wrong, Metrics Are 30% of Emissions

https://spectrum.ieee.org/data-center-sustainability-metrics
2•oldnetguy•37m ago•0 comments

Tell HN: Technical debt isn't messy code, it's architectural compound interest

2•thesssaism•37m ago•0 comments

Microsoft says bug causes Copilot to summarize confidential emails

https://www.bleepingcomputer.com/news/microsoft/microsoft-says-bug-causes-copilot-to-summarize-co...
3•tablets•38m ago•1 comments