frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Wario64 given DCMA takedown for posting public link to PSN release page

https://bsky.app/profile/d8i.com/post/3mneg2q3zwc26
1•indrora•46s ago•0 comments

Show HN: Keen Code – a minimal Go coding agent built by AI agents

https://github.com/mochow13/keen-code
1•mochow13•49s ago•0 comments

Project Solara - composing a new platform for agent-first devices

https://commandline.microsoft.com/project-solara-build-2026/
1•ChrisArchitect•2m ago•0 comments

The Ladder Paradox

https://www.desmos.com/notebook/qq6cjzpykh/view
2•thunderbong•4m ago•0 comments

Show HN: Tired of duct-taping access control into agent prompts. Here's the fix

https://github.com/yaodub/cast
1•zwigglers•4m ago•1 comments

Any Updates on AlphaChip?

https://en.wikipedia.org/wiki/AlphaChip_(controversy)
1•Marius77•5m ago•1 comments

Evolutionary Psychology Explains Why Dating Apps Failed [video]

https://www.youtube.com/watch?v=Z_-tar5ZZM4
1•mgh2•5m ago•0 comments

Handling Graphs with SQL/PGQ in PostgreSQL

https://www.cybertec-postgresql.com/en/handling-graphs-with-sql-pgq-in-postgresql/
1•mrkaye97•5m ago•0 comments

Budget 2026–27: Free Access to Australian Standards Confirmed

https://www.design.org.au/dianews/budget-202627-free-access-to-australian-standards-confirmed
1•mkj•6m ago•0 comments

Show HN: Aura, an LLM coding harness that dogfooded itself

https://github.com/CarpseDeam/Aura-IDE
1•ConfusedData89•6m ago•0 comments

Take Action: LAPD Removed Crime Location Data. Here's Why It Matters

https://blog.spotcrime.com/2026/06/take-action-lapd-removed-crime-location.html
1•apwheele•6m ago•0 comments

Brazil Banned Addictive Design. The Crucial Regulatory Choices Are Still Ahead

https://www.techpolicy.press/brazil-banned-addictive-design-the-crucial-regulatory-choices-are-st...
1•rbanffy•6m ago•0 comments

Impulse Space raises $500M as orbital maneuvering race heats up

https://arstechnica.com/space/2026/06/impulse-space-raises-500-million-as-orbital-maneuvering-rac...
1•rbanffy•7m ago•0 comments

A Manifesto Against AI Slop

https://diegoux.com/blog/ai-slop-manifesto/
2•diegof79•8m ago•0 comments

What's inside the trending "skills" repos for Claude Code

https://aisignals.heyneo.com/
1•gauravvij137•8m ago•1 comments

Microsoft's Project Solara is an Android OS designed for agents instead of apps

https://arstechnica.com/gadgets/2026/06/microsofts-project-solara-is-an-android-os-designed-for-a...
1•Brajeshwar•8m ago•0 comments

SnapToCode – Screenshot any UI and get clean Tailwind code

https://chromewebstore.google.com/detail/snaptocode/jpchamlmjfoccmkdoiaibbpgkidapcnk
1•adithagrawaal•8m ago•1 comments

Dating Net Worth

https://datingnetworth.com/
1•surprisetalk•9m ago•0 comments

Show HN: Health.md - Apple Health → Markdown

https://github.com/CodyBontecou/health-md
1•codybontecou•9m ago•0 comments

Who Is the Villain in Mars Sample Return?

https://mceglowski.substack.com/p/who-is-the-villain-in-mars-sample
1•calcifer•9m ago•0 comments

Three Good Interactive Explainers

https://unsung.aresluna.org/three-good-interactive-explainers/
1•speckx•9m ago•0 comments

Ex-girlfriend of former Google CEO Eric Schmidt ordered to pay him $10M

https://www.latimes.com/business/story/2026-06-02/ex-girlfriend-of-former-google-ceo-ordered-to-p...
2•1vuio0pswjnm7•10m ago•0 comments

"Populous: The Beginning" frame breakdown

https://github.com/peterderivaz/populous/blob/main/README.md
1•peterderivaz•12m ago•1 comments

EU Parliament to Ditch Google for European Alternative Qwant

https://www.euractiv.com/news/european-parliament-to-ditch-google-for-european-alternative/
5•raffael_de•13m ago•0 comments

Show HN: Overflow – compact your context window before it overflows

https://playoverflow.com/
1•jfu213•13m ago•1 comments

Skeg: A vector database that gives the RAM back to your model

https://github.com/skegdb/skeg
1•lupodevelop•17m ago•0 comments

Alexandr Wang's bid to revive Meta's AI edge

https://www.ft.com/content/d26faf9d-6ef0-4480-ab80-bac6a36fe173
2•merksittich•21m ago•0 comments

Show HN: OpenSOP, We got tired of agents lying to us, so we built them a harness

https://opensop.ai/
3•carlosamg•22m ago•1 comments

Akaganite, a managed Rust toolchain for licensed console developers (Xbox, PS 5)

https://akaganite.com
2•pjmlp•23m ago•0 comments

Project Sunrise Nears Reality as Qantas' Airbus A350-1000ULR Makes Maiden Flight

https://simpleflying.com/project-sunrise-qantas-a350-1000ulr-maiden-flight/
3•rbanffy•24m ago•0 comments