frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

The Cost of Killing 'Silly Science'

https://www.profgmedia.com/p/the-cost-of-killing-silly-science
1•phtrivier•32s ago•0 comments

Wall Street's undignified SpaceX mania

https://economist.com/finance-and-economics/2026/06/09/wall-streets-undignified-spacex-mania
1•andsoitis•1m ago•0 comments

UniFi Physical Security Expansion: vape detection

https://blog.ui.com/article/unifi-protect-campus-security-ready
1•janandonly•2m ago•0 comments

Still Out of Control

https://kevinkelly.substack.com/p/still-out-of-control
1•cainxinth•3m ago•0 comments

A Human in Control

https://daniel.haxx.se/blog/2026/06/10/a-human-in-control/
1•jicea•5m ago•0 comments

E-car drivers frustrated: VW interface for third-party providers gone

https://www.heise.de/en/news/E-car-drivers-frustrated-VW-interface-for-third-party-providers-gone...
1•doener•7m ago•0 comments

Show HN: Skillzmouse: Distributed skills and scripts for agentic coding

https://bitbucket.org/workingsoftware/skillzmouse/src/main/README.md
1•dools•10m ago•0 comments

Encrypted chats expose Kosovar organised crime network

https://www.europol.europa.eu/media-press/newsroom/news/encrypted-chats-expose-kosovar-organised-...
1•jruohonen•11m ago•0 comments

C++26: Cleaning up string literals

https://www.sandordargo.com/blog/2026/06/10/cpp26-string-literals-cleaned-up
1•ibobev•12m ago•0 comments

Sovereign

https://lemire.me/blog/2026/06/09/22693/
1•ibobev•13m ago•0 comments

Show HN: AgentHUD – Live TUI and daily digest for parallel Claude Code sessions

https://github.com/neochoon/agenthud
1•neochoon•13m ago•0 comments

Why your asthma inhaler is so expensive (in the US)

https://educatedguesswork.org/posts/asthma-inhaler-pricing/
1•ibobev•13m ago•0 comments

Adet tells you how people do things here

https://merkoba.com/adet.html
1•madprops•14m ago•0 comments

Einstein Was Wrong? Why Dark Energy Just Started Pulling Space-Time Backward

https://www.youtube.com/watch?v=LPEcLdc_ygA
1•Asheed•15m ago•0 comments

The Test Suite Was the Incident

https://christophermeiklejohn.com/ai/zabriskie/agents/reliability/testing/2026/06/10/the-test-sui...
1•jruohonen•15m ago•0 comments

Show HN: Apodex-1.0-H – Beats Claude-Opus-4.7 on deep research (90.3 BrowseComp)

https://www.apodex.ai/
1•wuqiaocauc•15m ago•1 comments

It's not enough to have better ideals

https://werd.io/its-not-enough-to-have-better-ideals/
2•benwerd•15m ago•0 comments

It's (Still) All About Boundaries

https://bounded.dev/blog/its-still-all-about-boundaries/
1•RyeCombinator•16m ago•0 comments

Auto complete tickets using Claude Code loop on telegram with linear MCP

https://niptao.com/blog/an-engineer-you-manage-from-a-group-chat/
1•singlas•17m ago•1 comments

Apple's Reframe: A Moment That Never Was

https://tty.mansuri.me/posts/~apple-reframe-a-moment-that-never-was/
1•princetman•18m ago•0 comments

I built a 40-year backtester to test if leveraged ETF and gold beats VOO

https://wealthquestlab.com
1•alexyurepercept•20m ago•1 comments

We went multi-region then undid it

https://useautumn.com/blog/how-we-built-a-multi-region-architecture-and-why-we-went-back
1•ayushrodrigues•21m ago•0 comments

Show HN: Publora – One API/MCP for AI agents to post across 10 social networks

https://publora.com
1•sbulaev•24m ago•0 comments

Cursor users must consent to data collection in order to use Fable 5

https://cursor.com/docs/models/claude-fable-5
2•__natty__•27m ago•1 comments

DMR effect on drag reduction of a streamlined body

https://www.cambridge.org/core/journals/journal-of-fluid-mechanics/article/dmr-effect-on-drag-red...
1•deadbishop•29m ago•0 comments

Show HN: CLI for self-hosted Invoice Ninja

https://github.com/DrDBanner/inmanage
2•ycomrsys•30m ago•1 comments

They're Made Out of Compute

https://magzimof.com/made-out-of-compute/
2•shaimagz•33m ago•0 comments

High-severity vulnerability in Linux caused by a single faulty character

https://arstechnica.com/security/2026/06/a-single-errant-character-in-the-linux-kernel-allows-att...
3•joozio•34m ago•0 comments

Is symbolic AI more relevant than ever?

https://www.heise.de/en/blog/Is-symbolic-AI-more-relevant-than-ever-11323023.html
2•goloroden•40m ago•0 comments

Can tech companies learn to love cheaper AI models?

https://techcrunch.com/2026/06/09/can-tech-companies-learn-to-love-cheaper-models/
3•parveshblogger•42m ago•0 comments