frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

No special treatment for SpaceX in the S&P 500 [pdf]

https://www.spglobal.com/spdji/en/documents/indexnews/announcements/20260604-1483731/1483731_spdj...
1•borski•48s ago•0 comments

Anticompetitive directors

https://www.columbialawreview.org/content/anticompetitive-directors/
1•hhs•1m ago•0 comments

Constrained Adaptive Rejection Sampling

https://arxiv.org/abs/2510.01902
1•matt_d•2m ago•0 comments

Foxconn and TSMC are running an 800-year-old operating system

https://twitter.com/josefchen/status/2060346552959303981
1•josefchen•3m ago•0 comments

Show HN: Skim any YouTube video. be happy

https://chromewebstore.google.com/detail/skim-get-to-the-point-fre/eenbaojdcmnbdlhkmambidocigepdobm
1•betterhealth12•4m ago•0 comments

Atari Robot Demo by Boz [video]

https://www.youtube.com/watch?v=cAKIhNi2v_Q
1•dp-hackernews•5m ago•0 comments

Ask HN: Which game's online mode is the best "3rd space" to find co-founders?

1•JumpinJack_Cash•9m ago•0 comments

Do Transformers Need Three Projections? Systematic Study of QKV Variants

https://arxiv.org/abs/2606.04032
9•Anon84•11m ago•1 comments

Migrating Sidekiq Background Jobs to Temporal in Ruby on Rails (2025)

https://release.com/blog/temporal-vs-sidekiq
1•mooreds•11m ago•0 comments

Metadata in Malloy: Annotations and Tags (2025)

https://docs.malloydata.dev/blog/2025-06-16-annotations-and-tags/
1•mooreds•13m ago•0 comments

How MCP Is Changing the Way Product Teams Work with AI

https://bagel.ai/blog/how-mcp-is-changing-how-product-teams-work-with-ai/
1•mooreds•14m ago•0 comments

Man-Computer Symbiosis J. C. R. Licklider (1960)

https://groups.csail.mit.edu/medg/people/psz/Licklider.html
2•rballpug•15m ago•0 comments

How to Stop a Killer Asteroid

https://thereader.mitpress.mit.edu/how-to-stop-a-killer-asteroid/
3•EA-3167•19m ago•0 comments

How big tobacco helped shape the design of ultra-processed foods

https://www.ucsf.edu/news/2026/06/432011/how-big-tobacco-helped-shape-design-ultra-processed-foods
1•hhs•21m ago•0 comments

Latent Agents: A Post-Training Procedure for Internalized Multi-Agent Debate

https://arxiv.org/abs/2604.24881
2•PaulHoule•21m ago•0 comments

Valve says it's ready to launch the Steam Machine this summer

https://www.theverge.com/games/943657/valve-steam-machine-frame-summer-launch-verified
2•droidjj•24m ago•0 comments

Introducing Boron Buckyballs: Theory that B80 cages can’t be made is disproved

https://cen.acs.org/materials/nanomaterials/buckyballs-boron-buckminster-fullerene-nanomaterials/...
2•crescit_eundo•24m ago•1 comments

Shouting in the Datacenter (2008) [video]

https://www.youtube.com/watch?v=tDacjrSCeq4
1•st_goliath•26m ago•0 comments

RIP Tech Interviews, Oxy Will Not Miss You

https://sageox.ai/blog/rip-tech-interviews
1•skadamat•26m ago•1 comments

White House will dump $700M of public funds into costly, unreliable coal again

https://electrek.co/2026/06/04/white-house-will-dump-700m-of-public-funds-into-costly-unreliable-...
2•Bender•26m ago•0 comments

Google releases fitbit air specs

https://support.google.com/googlehealth/thread/438625393/unleash-your-creativity-and-style-we%E2%...
1•subroutine•26m ago•0 comments

Flutter: macOS Malvertising Campaign Spreads New FlutterShell Backdoor

https://unit42.paloaltonetworks.com/flutterbridge-new-fluttershell-backdoor/
1•brazukadev•27m ago•0 comments

1ShotGen – Turn rough ideas into one-shot prompts for AI coding agents

https://1shotgen.com/
2•zachisparanoid•28m ago•0 comments

The rise of digital advertising and its economic implications (2024)

https://www.stlouisfed.org/on-the-economy/2024/oct/rise-digital-advertising-economic-implications
1•hhs•28m ago•0 comments

SpaceX IPO

https://spacexipo.com/
2•0xedb•32m ago•1 comments

The thorny question of work-life balance in European startups

https://www.ft.com/content/d8be5090-8b2f-46ce-a108-675d70b7ba8b
2•rustoo•33m ago•0 comments

Using Safetensors with Flax

https://www.gilesthomas.com/2026/06/flax-and-safetensors
1•gpjt•33m ago•0 comments

SpaceX, Other Mega IPOs Denied Fast Index Entry by S&P

https://www.bloomberg.com/news/articles/2026-06-04/s-p-dow-jones-keeps-megacap-ipo-rules-as-is-af...
7•tristanj•35m ago•2 comments

AI model predicts building fire spread, redirecting evacuees to safer exits

https://techxplore.com/news/2026-06-ai-redirecting-evacuees-safer-exits.html
1•lschueller•35m ago•0 comments

Shell, Awk, and Make Should Be Combined

https://www.oilshell.org/blog/2016/11/13.html
2•Chris2048•37m ago•0 comments