frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•7mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Steve Jobs introduced MacBook Pro 20 years ago at Macworld San Francisco 2006

https://www.youtube.com/watch?v=I6JWqllbhXE
1•7777777phil•4m ago•0 comments

Employees Are Using AI

https://guardianhealth.dev/blog/employees-already-using-ai/
1•rndkeithw•9m ago•0 comments

Show HN: I made 25 tech predictions and mass-published them

1•JoseOSAF•11m ago•0 comments

Show HN: Endless: Run Your Own Social Network

https://github.com/XS-Xspert-Software/Social-Media
1•thegoodduck•22m ago•0 comments

AI Won't Kill Open Source – It Will Amplify It

https://petabridge.com/blog/ai-wont-kill-open-source/
1•alexzeitler•24m ago•0 comments

Sakana AI Agent Wins AtCoder Heuristic Contest (First AI to Place First)

https://sakana.ai/ahc058/
1•davidst•29m ago•0 comments

Extreme Recall: Which Politicians Come to Mind?

https://www.tandfonline.com/doi/full/10.1080/17457289.2025.2474411
1•neehao•31m ago•0 comments

Carjackers swipe biometric Merc, plus owner's finger

https://www.theregister.com/2005/04/04/fingerprint_merc_chop/
3•mysterypie•34m ago•1 comments

The Three AI Bets

https://alearningaday.blog/2026/01/10/the-three-ai-bets/
1•herbertl•34m ago•0 comments

Generating real-time subtitles from YouTube audio (even when captions fail)

https://drive.google.com/drive/folders/1I_z6HjGCVUwgYs1UXlB7sg4nj8Rwfory?usp=sharing
1•techspecs•38m ago•1 comments

Lafferty and Chesterton: Seeing "Through Other Eyes" into "The Weirdest World"

https://bernardus66.substack.com/p/ra-lafferty-and-gk-chesterton-seeing
1•stacktrust•43m ago•0 comments

Starmer rallies international support to take on Musk

https://www.telegraph.co.uk/politics/2026/01/10/musk-accuses-labour-of-being-fascist/
9•beejiu•45m ago•1 comments

Money Is a Technology

https://blog.mempko.com/money-is-a-technology/
3•mempko•48m ago•0 comments

Global ripple effects of corporate tax reforms

https://www.nber.org/papers/w34627
2•hhs•53m ago•0 comments

What are you using today to monitor uptime for small or personal projects?

https://updown.fly.dev/
3•ejncman•1h ago•1 comments

Get Salty [video]

https://www.youtube.com/watch?v=NfxHiT-0inM
3•marysminefnuf•1h ago•1 comments

CES 2026: These 32 Tech Products Made Some of the Biggest Impressions

https://www.cnet.com/pictures/ces-2026-overall-products/
2•SilverElfin•1h ago•0 comments

Carina Hong of Axiom Math at the Neuron

https://www.youtube.com/watch?v=xldMXTPGMGI
1•rasengan0•1h ago•0 comments

Show HN: GlyphLang – An AI-first programming language

10•goose0004•1h ago•6 comments

Show HN: TheTabber – Create, repurpose, and post across 9+ platforms

https://thetabber.com/
1•dibasdauliya•1h ago•0 comments

Show HN: Librario, a book metadata API that aggregates G Books, ISBNDB, and more

19•jamesponddotco•1h ago•6 comments

Cocopilot: Self-Updating Repository

https://acbart.github.io/cocopilot/
2•acbart•1h ago•1 comments

Disaggregated machine learning via in-physics computing at radio frequency

https://www.science.org/doi/10.1126/sciadv.adz0817
2•gnabgib•1h ago•0 comments

Polymaths: An Argument for Analogies

https://nonzerosum.games/polymaths1.html
1•samixg•1h ago•1 comments

The "Good Will Hunting" Problem in Generative AI

https://medium.com/@chipmunkworks/ai-the-will-hunting-of-our-age-59952c1744f1
2•treelover•1h ago•3 comments

Show HN: Embex – 9K organic downloads in 2 weeks with zero marketing

https://www.bridgerust.dev/embex/introduction/
1•mimchak•1h ago•0 comments

MCP Joins the Linux Foundation

https://github.blog/open-source/maintainers/mcp-joins-the-linux-foundation-what-this-means-for-de...
1•raju•1h ago•1 comments

Private equity firms acquired more than 500 autism centers in past decade: study

https://www.brown.edu/news/2026-01-07/private-equity-autism-centers
80•hhs•1h ago•33 comments

Mapping and editing learned functional geometry inside a CNN (with controls)

https://github.com/boglim1984/functional-geometry-hebbian-manifold
1•boglim1984•1h ago•1 comments

CFT: "sqawk" 0.8.0 – optimized SQL Awk utility with Rust's sqlparser

https://github.com/jgarzik/sqawk/tree/v0.8.0
1•jgarzik•1h ago•1 comments