frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•8mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: Figma MCP Without Tool Explosion – Let AI Execute JavaScript

https://github.com/youware-labs/figma-pilot
1•marv1nnnnn•34s ago•0 comments

Playing Board Games with Deep Convolutional Neural Network on 8bit Motorola 6809

https://ipsj.ixsq.nii.ac.jp/records/229345
1•mci•1m ago•0 comments

Reverse Engineering Reality

https://oth-book.lovable.app
1•con•3m ago•0 comments

We Studied 150 Developers Using AI (Here's What's Changed) [video]

https://www.youtube.com/watch?v=b9EbCb5A408
1•mpweiher•3m ago•0 comments

Show HN: Changeflow – Giving up on pixel diffs after 10 years of false positives

https://changeflow.com/
1•stevewillbe•4m ago•0 comments

Disenshittification Nation

https://pluralistic.net/2026/01/29/post-american-canada/
2•hn_acker•4m ago•0 comments

Claude Code Daily Benchmarks for Degradation Tracking

https://marginlab.ai/trackers/claude-code/
6•qwesr123•5m ago•1 comments

Google Disrupts Ipidea Proxy Network

https://www.securityweek.com/google-disrupts-ipidea-proxy-network/
1•alephnerd•5m ago•1 comments

Hunting AitM Phishing Infrastructure Using Certificate Transparency

https://j027.net/hunting-evilginx/
1•j027•6m ago•0 comments

Show HN: I built a pSEO game wiki with Astro and fixed Schema validation errors

https://gamestrategyhub.com/
1•causalzap•6m ago•0 comments

Frigate NVR Critical RCE Vulnerability Severity

1•shadybraden•7m ago•0 comments

Show HN: Planet Cert – Practice Tests for AWS, Cisco, and AI Certs

https://planetcert.com/
1•Alex_Weinberg•7m ago•0 comments

Terry Pratchett's novels may have pointed to his dementia 10y before diagnosis

https://theconversation.com/terry-pratchetts-novels-may-have-held-clues-to-his-dementia-a-decade-...
1•kareemm•9m ago•0 comments

Git protects you [audio]

https://www.buzzsprout.com/2469780/episodes/18555806-20-git-protects-you
1•jammcq•10m ago•0 comments

Text2Vid

https://text2vid.org
1•zhouhua•10m ago•0 comments

Goodhart's Law: When a Measure Becomes a Target, It Loses Its Value

https://read.perspectiveship.com/p/the-cobra-effect
1•birdculture•11m ago•0 comments

Microbubble-induced erosion releases micro- and nanoplastics into water

https://www.science.org/doi/10.1126/sciadv.aea4729
1•PaulHoule•12m ago•0 comments

What is a loot box and why is there one at The Pentagon?

https://taskandpurpose.com/news/pentagon-lucky-box/
2•cainxinth•12m ago•0 comments

Show HN: LLM-assisted research paper reproduction and understanding

https://zllmplayground.com/transend
1•bladecd•13m ago•0 comments

What does AI-assisted development look like in a big open-source project?

https://www.getunleash.io/blog/ai-assisted-development-open-source-project
4•alexcasalboni•13m ago•1 comments

Data Science Weekly – Issue 636

https://datascienceweekly.substack.com/p/data-science-weekly-issue-636
1•sebg•13m ago•0 comments

"Remove Before Flight" tags bought on eBay in 2010 were from Challenger

https://arstechnica.com/space/2026/01/attached-to-tragedy-tracing-challenger-remove-before-flight...
1•chha•14m ago•0 comments

Show HN: Cloudness – An open-source tool to deploy and run apps on Kubernetes

2•Karthik_N•15m ago•0 comments

Show HN: BarrierX – AI that finds which lost deals are worth re-engaging now

https://barrierx.ai/
1•IAMsterdam•16m ago•0 comments

Variable Fonts Workshop

https://variablefonts.gdwithgd.com/
2•noreplica•17m ago•0 comments

The Second Great Error Model Convergence

https://matklad.github.io/2025/12/29/second-error-model-convergence.html
1•surprisetalk•18m ago•0 comments

China Is Erasing Signs of Pessimism and Despair

https://www.nytimes.com/2025/10/08/world/asia/china-censorship-pessimism-despair.html
1•surprisetalk•18m ago•0 comments

Proving (literally) that ChatGPT isn't conscious

https://www.theintrinsicperspective.com/p/proving-literally-that-chatgpt-isnt
1•surprisetalk•19m ago•0 comments

Show HN: Candid – Your front-row seat to politics

https://www.candidmedia.ai/
1•ericatcandid•19m ago•0 comments

Photoshop is overkill for most workflows – so I built a browser-based editor

https://www.picify.co/editor
2•xohails•19m ago•0 comments