frontpage.
newsnewestaskshowjobs

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•1y ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

Show HN: The TypeScript Semantic Layer for ClickHouse

https://github.com/hypequery/hypequery
1•lureilly1•2m ago•0 comments

Bulk image and file compressor & converter.

https://squeezes.vercel.app
1•marpe•5m ago•2 comments

NodeQuest – a browser puzzle game that teaches n8n workflow automation

https://nodequest.io
1•pusucip•10m ago•0 comments

End-to-end model that listens, sees, thinks and responds on video in real time

https://twitter.com/minchoi/status/2070347790115565792
1•dawkins•11m ago•1 comments

A debugging story: Learning debugging principles from a production outage

https://www.iinuwa.xyz/blog/a-debugging-story/
1•dorianniemiec•12m ago•0 comments

Browser agent that reads a page in ~2k tokens, not ~180k

https://www.npmjs.com/package/pixelpi
2•josharsh•14m ago•0 comments

Badge of Infamy, by Lester del Rey (1957) [pdf]

https://ebook-mecca.com/online/Badge%20of%20Infamy%20-%20Lester%20Del%20Rey.pdf
1•joebig•17m ago•0 comments

DeepSeek open-sources inference optimizations with 60–85% faster generation [pdf]

https://github.com/deepseek-ai/DeepSpec/blob/main/DSpark_paper.pdf
5•aurenvale•20m ago•0 comments

The White House's post-quantum executive order is an important milestone

https://blog.cloudflare.com/post-quantum-eo-2026/
1•taubek•21m ago•0 comments

Beer CSS – Build material design in record time

https://www.beercss.com
2•Seb-C•33m ago•0 comments

Show HN: Fognitix – an autonomous desktop browser that drives itself

https://www.fognitix.com/
1•fognitix•34m ago•0 comments

Show HN: Waitlio – The waitlist software for your next launch

https://waitlio.com
1•543310•35m ago•0 comments

An extension of the traditional PDF standard

https://github.com/AlexandrosGounis/pdfx
1•SVI•35m ago•0 comments

JEP Draft: Deprecate the macOS/X64 Port for Removal

https://openjdk.org/jeps/8386091
1•Tomte•38m ago•0 comments

NYC's last horse track, Aqueduct, ending live races

https://apnews.com/article/nyc-horse-racing-track-closing-aqueduct-c52bac4978ba99d2c921408ffdf05f13
1•geox•48m ago•0 comments

What Everyone Gets Wrong About AI and Learning [video]

https://www.youtube.com/watch?v=0xS68sl2D70
1•thunderbong•51m ago•0 comments

How much does a token cost?

https://www.erikjs.com/experiences/llm-token-counter
1•csgod•54m ago•0 comments

A New Fossil Discovery Just Rewrote 150 Years of Evolutionary Theory

https://www.404media.co/a-new-fossil-discovery-just-rewrote-150-years-of-evolutionary-theory/
1•susiecambria•1h ago•0 comments

The "Digital Cash Envelope": Making Crypto Gifting as Easy as Cash

https://www.indiehackers.com/post/the-digital-cash-envelope-making-crypto-gifting-as-easy-as-cash...
1•mybucks_online•1h ago•0 comments

Eyes Do More Than See (1965) – Isaac Asimov

http://www-graphics.stanford.edu/~tolis/toli/other/eyes.html
1•kegenaar•1h ago•0 comments

Fine-tuned a model on Advaita Vedanta text

https://huggingface.co/aaravshirpurkar/turiya-model
2•aaravshirpurkar•1h ago•1 comments

Plane slams into Beijing's tallest building, sends debris raining down [video]

https://www.youtube.com/watch?v=YNJYR_HE8uo
3•Imustaskforhelp•1h ago•0 comments

Search SDK – Integrate web search into agents

https://search-sdk.dev/
1•haxzie•1h ago•0 comments

Sequence Modeling with CTC

https://distill.pub/2017/ctc/
1•diginova•1h ago•0 comments

WebSite Grader and Analytics for Small Businesses

https://stackra.app/
1•LBeck84•1h ago•0 comments

Show HN: PhoneCode: Local-First ADE Running Natively on Android

https://github.com/dttdrv/phonecode
1•dttdrv•1h ago•0 comments

A model-free runtime that holds photonic/quantum hardware steady under drift

https://compute.neophotonics.ca/
1•quantumbum•1h ago•0 comments

Higher rate limits on the Claude API

https://platform.claude.com/docs/en/api/rate-limits
1•thedebuglife•1h ago•0 comments

Show HN: Cyclearchive.com – search vintage cycling magazines

https://cyclearchive.com/search/
1•alastairr•1h ago•0 comments

How to evaluate multimodal VLMs for your video use case

https://labs.videodb.io/research/how-to-evaluate-multimodal-vlms-for-your-video-use-case
1•notTechy•1h ago•0 comments