frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

We Mourn Our Craft

https://nolanlawson.com/2026/02/07/we-mourn-our-craft/
116•ColinWright•1h ago•87 comments

Speed up responses with fast mode

https://code.claude.com/docs/en/fast-mode
22•surprisetalk•1h ago•23 comments

Hoot: Scheme on WebAssembly

https://www.spritely.institute/hoot/
121•AlexeyBrin•7h ago•24 comments

U.S. Jobs Disappear at Fastest January Pace Since Great Recession

https://www.forbes.com/sites/mikestunson/2026/02/05/us-jobs-disappear-at-fastest-january-pace-sin...
118•alephnerd•2h ago•77 comments

Stories from 25 Years of Software Development

https://susam.net/twenty-five-years-of-computing.html
62•vinhnx•5h ago•7 comments

OpenCiv3: Open-source, cross-platform reimagining of Civilization III

https://openciv3.org/
828•klaussilveira•21h ago•248 comments

Al Lowe on model trains, funny deaths and working with Disney

https://spillhistorie.no/2026/02/06/interview-with-sierra-veteran-al-lowe/
55•thelok•3h ago•7 comments

Brookhaven Lab's RHIC Concludes 25-Year Run with Final Collisions

https://www.hpcwire.com/off-the-wire/brookhaven-labs-rhic-concludes-25-year-run-with-final-collis...
4•gnufx•38m ago•0 comments

The AI boom is causing shortages everywhere else

https://www.washingtonpost.com/technology/2026/02/07/ai-spending-economy-shortages/
108•1vuio0pswjnm7•8h ago•138 comments

The Waymo World Model

https://waymo.com/blog/2026/02/the-waymo-world-model-a-new-frontier-for-autonomous-driving-simula...
1058•xnx•1d ago•611 comments

Reinforcement Learning from Human Feedback

https://rlhfbook.com/
76•onurkanbkrc•6h ago•5 comments

Start all of your commands with a comma (2009)

https://rhodesmill.org/brandon/2009/commands-with-comma/
484•theblazehen•2d ago•175 comments

I Write Games in C (yes, C)

https://jonathanwhiting.com/writing/blog/games_in_c/
8•valyala•2h ago•1 comments

SectorC: A C Compiler in 512 bytes

https://xorvoid.com/sectorc.html
7•valyala•2h ago•0 comments

Vocal Guide – belt sing without killing yourself

https://jesperordrup.github.io/vocal-guide/
209•jesperordrup•12h ago•70 comments

France's homegrown open source online office suite

https://github.com/suitenumerique
557•nar001•6h ago•256 comments

Coding agents have replaced every framework I used

https://blog.alaindichiappari.dev/p/software-engineering-is-back
222•alainrk•6h ago•343 comments

A Fresh Look at IBM 3270 Information Display System

https://www.rs-online.com/designspark/a-fresh-look-at-ibm-3270-information-display-system
36•rbanffy•4d ago•7 comments

Selection Rather Than Prediction

https://voratiq.com/blog/selection-rather-than-prediction/
8•languid-photic•3d ago•1 comments

History and Timeline of the Proco Rat Pedal (2021)

https://web.archive.org/web/20211030011207/https://thejhsshow.com/articles/history-and-timeline-o...
19•brudgers•5d ago•4 comments

72M Points of Interest

https://tech.marksblogg.com/overture-places-pois.html
29•marklit•5d ago•2 comments

Unseen Footage of Atari Battlezone Arcade Cabinet Production

https://arcadeblogger.com/2026/02/02/unseen-footage-of-atari-battlezone-cabinet-production/
114•videotopia•4d ago•31 comments

Where did all the starships go?

https://www.datawrapper.de/blog/science-fiction-decline
76•speckx•4d ago•75 comments

Show HN: I saw this cool navigation reveal, so I made a simple HTML+CSS version

https://github.com/Momciloo/fun-with-clip-path
5•momciloo•2h ago•0 comments

Show HN: Look Ma, No Linux: Shell, App Installer, Vi, Cc on ESP32-S3 / BreezyBox

https://github.com/valdanylchuk/breezydemo
273•isitcontent•22h ago•38 comments

Learning from context is harder than we thought

https://hy.tencent.com/research/100025?langVersion=en
201•limoce•4d ago•111 comments

Show HN: Kappal – CLI to Run Docker Compose YML on Kubernetes for Local Dev

https://github.com/sandys/kappal
22•sandGorgon•2d ago•11 comments

Monty: A minimal, secure Python interpreter written in Rust for use by AI

https://github.com/pydantic/monty
286•dmpetrov•22h ago•153 comments

Making geo joins faster with H3 indexes

https://floedb.ai/blog/how-we-made-geo-joins-400-faster-with-h3-indexes
155•matheusalmeida•2d ago•48 comments

Software factories and the agentic moment

https://factory.strongdm.ai/
71•mellosouls•4h ago•75 comments
Open in hackernews

Sutton and Barto book implementation

https://github.com/ivanbelenky/RL
80•ivanbelenky•9mo ago

Comments

sage76•9mo ago
Damn this is a lot of work. Bookmarked.
ivanbelenky•9mo ago
It has not been stress tested, or optimized, tread lightly and thanks a lot for appreciating the work.
mark_l_watson•9mo ago
Very nice, thanks for doing this.

I have experimented a lot with the "official" Common Lisp and Python examples for the Sutton/Barto RL book, and I will enjoy your implementations also!

For reference, original examples in Lisp and Python: http://incompleteideas.net/book/code/code2nd.html

A bunch of implementations with all kinds of use cases (e.g., using OpenAI RL Gym, etc.):

Here are some resources with code examples and implementations related to the Sutton and Barto "Reinforcement Learning: An Introduction" book:

Code for Sutton & Barto Book: Reinforcement Learning: An Introduction: The official website for the book provides links to various software and re-implementations in different languages, including Python, Julia, and Lisp. This is a great starting point to find code directly associated with the book's examples and exercises.

Link: http://incompleteideas.net/book/code/code2nd.html jovsa/rl-examples-sutton-and-barto-book on GitHub: This repository offers Python implementations of examples from the book, organized by chapter. It includes code for figures and examples from various chapters, covering topics like Gridworld, Blackjack, and the Mountain Car task.

Link: https://github.com/jovsa/rl-examples-sutton-and-barto-book kamenbliznashki/sutton_barto on GitHub: This repository provides Python implementations of RL algorithms for the examples and figures in the Sutton and Barto book. It covers a wide range of topics from multi-armed bandits to policy gradient methods.

Link: https://github.com/kamenbliznashki/sutton_barto boldyshev/sutton on GitHub: This repository contains Python implementations of example experiments (figures) and programming exercises from the second edition of the book. Chapters are added as the author studies the book, making it a potentially growing resource.

Link: https://github.com/boldyshev/sutton AntonioSerrano/Implementation-of-RL-algorithms-from-Sutton-and-Barto-2018 on GitHub: This repository offers implementations in Python using OpenAI Gym and Tensorflow, covering exercises and solutions to complement the book and David Silver's RL course. It includes various algorithms like Dynamic Programming, Monte Carlo, Temporal Difference, and Policy Gradient methods.

Link: https://github.com/AntonioSerrano/Implementation-of-RL-algor...

ivanbelenky•9mo ago
my code is not as good as anything above most probably. Ive done this exploring while studying. No linter no typechecker, grug engineer mentality. But thanks nevertheless for the comment :)
mark_l_watson•9mo ago
well, it looks good to me.
mark_l_watson•9mo ago
I want to add a second comment:

Professors White & White (a husband and wife team) have a very good set of courses on RL on Coursera:

https://www.coursera.org/specializations/reinforcement-learn...

ivanbelenky•9mo ago
Lovely!
AndrewKemendo•9mo ago
Let me know if anyone fills out the true online Sarsa section with a working example in a robot
vlad•9mo ago
The authors were professor and grad student at UMass Amherst, and are the current winners of the Turing Award.

https://www.cics.umass.edu/

https://www.nsf.gov/news/ai-pioneers-andrew-barto-richard-su...

ultrasounder•9mo ago
Super helpful while I come upto speed with this field in general. Currently taking the XCS234(RL @ Stanford online) and this book is referenced for everything.