frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•7mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: A mobile-first React share sheet with native sharing

https://sharesheet.gwendall.com
2•ges•3m ago•0 comments

Ask HN: Tips for getting the ROM for an old speech synthesizer?

2•ctoth•3m ago•0 comments

Show HN: Karmic Tail Calculator – A Destiny Matrix Patterns

https://karmictail.net
1•lion__93332•4m ago•0 comments

Show HN: I forced Apple to admit a "Product Issue" using AI and CIA principles

https://medium.com/@ryu360i/when-authorization-breaches-availability-analyzing-the-27-2kb-icloud-...
1•ryuzaburo•4m ago•0 comments

PluriSnake gameplay [Sun Jan 11, 2025 puzzle] – Beta available [video]

https://www.youtube.com/watch?v=JAjd5HgbOhU
1•amichail•7m ago•1 comments

Ask HN: What was the best sci-fi book of 2025?

2•Erikun•8m ago•0 comments

I mapped out how debugging works during production incidents

https://nemorize.com/roadmaps/debugging-under-pressure
1•reverseblade2•8m ago•1 comments

Desperately Seeking Squircles (2018)

https://www.figma.com/blog/desperately-seeking-squircles/
2•kjeetgill•8m ago•0 comments

Show HN: AI Vibe Coding Hackathon

https://vibe.devpost.com
1•abdibrokhim•9m ago•0 comments

NCSA Mosaic 2.7, one of the first graphical web browsers

https://github.com/alandipert/ncsa-mosaic
1•stmw•12m ago•0 comments

guys why does armenian completely break Claude

https://twitter.com/dyushag/status/1993143599286886525
12•ag8•13m ago•3 comments

Systematically generating tests that would have caught Anthropic's top‑K bug

https://theorem.dev/blog/anthropic-bug-test/
2•jasongross•14m ago•0 comments

Sampling at negative temperature

https://cavendishlabs.org/blog/negative-temperature/
4•ag8•16m ago•0 comments

Show HN: Sunshine Optimist: Optimistic takes on daylight and sunset times

https://sunshineoptimist.com
1•willj•16m ago•0 comments

Worldview – persistent strategic context for Claude Code

https://www.extremeclarity.ai/worldview
1•faizanbhat•17m ago•1 comments

The Machinery of Terror

https://chrishedges.substack.com/p/the-machinery-of-terror
1•chmaynard•17m ago•0 comments

QR Spaces – One QR and custom domain to share all your links

3•iamgaazi•17m ago•2 comments

The Subtle Injury – Being pretty good

https://tevonsb.com/thoughts/subtle-injury/
2•tevon•18m ago•1 comments

From fragmented code to consistent output with AI rules

https://www.stromcapital.fi/blog/cursor-rules
1•ronistrom•19m ago•0 comments

Why (We Don't Need To?) Care About Debt-to-GDP?

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=5271557
1•neehao•20m ago•0 comments

Show HN: A MCP for controlling terminal UI apps built with bubbletea and ratatui

https://github.com/michaellee8/mcp-tui-server
1•michaellee8•22m ago•0 comments

Green Waste: Inefficient Allocation of Green Subsidies

https://papers.ssrn.com/sol3/papers.cfm?abstract_id=6048714
1•neehao•22m ago•0 comments

Mississippi Transformed Its Schools from Worst to Best

https://www.nytimes.com/2026/01/11/us/mississippi-schools-transformation.html
1•ghaff•24m ago•1 comments

Exponential growth continued – cargo-semver-checks 2025 Year in Review

https://predr.ag/blog/cargo-semver-checks-2025-year-in-review/
1•todsacerdoti•25m ago•0 comments

LLMs – Part 2: Order Matters – Positional Encoding

https://vasupasupuleti.substack.com/p/llms-part-2-order-matters-positional
1•vpasupuleti10•31m ago•1 comments

LLMs – Part 1: Tokenization and Embeddings

https://vasupasupuleti.substack.com/p/llms-part-1-tokenization-and-embeddings
1•vpasupuleti10•31m ago•1 comments

AI's Bottleneck Isn't Models or Tools, It's Security

https://zkorman.com/posts/ai-bottleneck-is-security/12
1•chillax•33m ago•1 comments

Keeping 20,000 GPUs Healthy

https://modal.com/blog/gpu-health
1•susam•34m ago•0 comments

Canada's Scaling Problem Isn't Compute, It's Coastlines

https://zeitgeistml.substack.com/p/canadas-scaling-problem-isnt-compute
5•eh_tk•34m ago•1 comments

The Curious Case of Stack Pivot Detection

https://seclists.org/oss-sec/2026/q1/48
1•todsacerdoti•35m ago•0 comments