frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Show HN: Zeno – A framework for verifiable RL rewards (code, math, and more)

https://github.com/Think-a-Tron/zeno
2•Sai_Praneeth•6mo ago
With TRL, it's now straightforward to RL-finetune LLMs, but picking good reward functions is still the weakest link.

Zeno is an open-source toolkit for verifiable, deterministic reward functions for RL on LLMs.

While the initial release focuses on Python code generation, the goal is broader: make RL reward design for LLMs transparent, modular, and extendable across domains (math, retrieval, reasoning, tool-use, etc.)

What's in Zeno for now? - Auditable, stateless reward functions for Python code - docstrings, ruff linting, type hints, recursion, and more - Works directly with Huggingface's TRL or any RL loop - plug reward functions in as needed. - MIT licensed and minimal.

Roadmap: Python code is just the starting point. Extensions for math problem solving, planning and agentic behaviors are in todo.

Repo: https://github.com/think-a-tron/zeno

Docs and more details in the README

Comments, critiques, and real-world use cases encouraged, especially if you want to push beyond code.

NZ's draft science curriculum favours rote learning over critical thinking

https://theconversation.com/nzs-draft-science-curriculum-favours-rote-learning-over-critical-thin...
2•billybuckwheat•43s ago•0 comments

U-turn: Google wants to bring JPEG XL back to Chrome

https://www.heise.de/en/news/U-turn-Google-wants-to-bring-JPEG-XL-back-to-Chrome-11089880.html
1•peterwyatt-pdfa•57s ago•1 comments

Jeff Dean on Important AI Trends [video]

https://www.youtube.com/watch?v=AnTw_t21ayE
1•todsacerdoti•2m ago•0 comments

What a CTO should know about tech

https://deadsimpletech.com/blog/cto_tech_capabilities
1•mirawelner•5m ago•0 comments

A Software Engineer's Guide to Agentic Software Development

https://brittanyellich.com/agentic-software-development/
1•overcommitted•6m ago•0 comments

Alphabet in Motion: An ABC Pop-Up Book about Typography

https://www.kellianderson.com/books/alphabetinmotion.html
2•bhattisatish•7m ago•1 comments

The Druridge Bay Ruin [video]

https://www.youtube.com/watch?v=mCceufLwJxU
2•DoreenMichele•11m ago•0 comments

Memories of .us

https://computer.rip/2025-11-11-dot-us.html
1•todsacerdoti•13m ago•0 comments

How I talk to whales

https://www.nytimes.com/2025/11/23/opinion/whale-language-ai.html
2•flabber•13m ago•0 comments

Show HN: I wrote my lecture notes in Typst

https://github.com/zhengnanli/ss-notes
2•subtlemuffins•14m ago•0 comments

Turbine Transport Transformer

https://mitxela.com/projects/turbine_transport_transformer
1•mhb•14m ago•0 comments

Kubricks' 2001: One Man's Incredible Odyssey (2015)

http://nzpetesmatteshot.blogspot.com/2015/01/kubricks-2001-one-mans-incredible.html
1•exvi•14m ago•0 comments

Mind-altering 'brain weapons' no longer only science fiction, say researchers

https://www.theguardian.com/world/2025/nov/22/mind-altering-brain-weapons-no-longer-only-science-...
1•zdw•15m ago•0 comments

Magicians of the Miniature (2014)

http://nzpetesmatteshot.blogspot.com/2014/12/magicians-of-miniature.html
1•exvi•16m ago•0 comments

I built a $19 forensic ATS scanner because Jobscan costs $50/mo

https://www.interviewghost.us/
1•ryanpedram•17m ago•1 comments

Video posted by Garry Tan shows suspect who robbed his friend of $11M in crypto

https://www.sfchronicle.com/crime/article/sf-cryptocurrency-robbery-21203804.php
2•markerz•17m ago•0 comments

Show HN: I built a CLI to use devcontainers without VS Code

https://github.com/UPwith-me/Container-Maker
2•DEVINHE111•19m ago•0 comments

Mitigating Application Resource Overload with Targeted Task Cancellation

http://muratbuffalo.blogspot.com/2025/11/mitigating-application-resource.html
1•zdw•19m ago•0 comments

Unpaid Labor Allegations Cast Shadow over Naver WEBTOON's Market Dominance

https://www.animenewsnetwork.com/feature/2025-11-05/unpaid-labor-allegations-cast-shadow-over-nav...
2•PaulHoule•20m ago•0 comments

Through the Looking Glass: The Traditional Glass Shot Matte Painting (2016)

http://nzpetesmatteshot.blogspot.com/2016/08/through-looking-glass-traditional-glass.html
1•exvi•20m ago•0 comments

Eggroll: Novel general-purpose machine learning algorithm provides 100x speed

https://eshyperscale.github.io/
2•felineflock•22m ago•0 comments

Astrl– a free AI-powered Khan Academy for self-guided learning

https://tryastrl.com/
1•jjwilkin•36m ago•1 comments

We're Stuck in an Infinite Loop of Terrible Tech

https://timyc.substack.com/p/were-stuck-in-an-infinite-loop-of
3•TimDotC•37m ago•1 comments

An Auto Holy Grail: Motors That Don't Rely on Chinese Rare Earths

https://www.nytimes.com/2025/11/24/business/automakers-rare-earth-minerals-magnets.html
1•mmooss•38m ago•0 comments

Anthropic introduces cheaper, more powerful, more efficient Opus 4.5 model

https://arstechnica.com/ai/2025/11/anthropic-introduces-opus-4-5-cuts-api-pricing-and-enables-muc...
1•jnord•39m ago•1 comments

Humanoid robot walked 66 miles in 3 days, right into the Guinness World Records

https://www.cbsnews.com/news/china-humanoid-robot-agibot-a2-walks-66-miles-guinness-world-records/
1•satonakamoto•40m ago•1 comments

Jakarta overtakes Tokyo as largest city, according to UN

https://www.abc.net.au/news/2025-11-25/jakarta-overtakes-tokyo-as-worlds-largest-city/106049122
2•Gaishan•41m ago•1 comments

Endogenous Automation Will Hit You

https://lydianottingham.substack.com/p/endogenous-automation-will-hit-you
1•eatitraw•43m ago•1 comments

Revolut hits $75B valuation

https://news.crunchbase.com/fintech/revolut-valuation-spikes-secondary-share-sale/
2•rudderdev•43m ago•3 comments

Beddel: Secure, Declarative, and Extensible Agent Runtimes

https://github.com/botanarede/beddel-alpha
1•mesenga•44m ago•1 comments