frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models

https://dani2442.github.io/posts/continuous-rl/
24•sebzuddas•1h ago

Comments

measurablefunc•1h ago
It's not clear or obvious why continuous semantics should be applicable on a digital computer. This might seem like nitpicking but it's not, there is a fundamental issue that is always swept under the rug in these kinds of analysis which is about reconciling finitary arithmetic over bit strings & the analytical equations which only work w/ infinite precision over the real or complex numbers as they are usually defined (equivalence classes of cauchy sequences or dedekind cuts).

There are no dedekind cuts or cauchy sequences on digital computers so the fact that the analytical equations map to algorithms at all is very non-obvious.

jampekka•52m ago
Continuous formulations are used with digital computers all the time. Limited precision of floats sometimes causes numerical instability for some algorithms, but usually these are fixable with different (sometimes less efficient) implementations.

Discretizing e.g. time or space is perhaps a bigger issue, but the issues are usually well understood and mitigated by e.g. advanced numerical integration schemes, discrete-continuous formulations or just cranking up the discretization resolution.

Analytical tools for discrete formulations are usually a lot less developed and don't as easily admit closed-form solutions.

phreeza•48m ago
Doesn't continuous time basically mean "this is what we expect for sufficiently small time steps"? Very similar to how one would for example take the first order Taylor dynamics and use them for "sufficiently small" perturbations from equilibrium. Is there any other magic to continuous time systems that one would not expect to be solved by sufficiently small time steps?
measurablefunc•19m ago
You should look into condition numbers & how that applies to numerical stability of discretized optimization. If you take a continuous formulation & naively discretize you might get lucky & get a convergent & stable implementation but more often than not you will end up w/ subtle bugs & instabilities for ill-conditioned initial conditions.
phreeza•8m ago
I understand that much, but it seems like "your naive timestep may need to be smaller than you think or you need to do some extra work" rather than the more fundamental objection from OP?
measurablefunc•1m ago
The translation from continuous to discrete is not automatic. There is a missing verification in the linked analysis. The mapping must be verified for stability for the proper class of initial/boundary conditions. Increasing the resolution from 64 bit floata to 128 bit floata doesn't automatically give you a stable discretized optimizer from a continuous formulation.
nareyko•55m ago
One interesting connection is that many production AI systems don't explicitly implement RL frameworks, but still behave like RL systems.

You still have: state -> user context action -> model output reward -> engagement or success metric

Once that loop exists, optimization dynamics start to look very similar.

Cloudly•53m ago
Ever since the control bug bit me in my EE undergrad years I am happy to see how useful the knowledge remains. Of course the underlying math of optimization remains general but the direct applications of control theory made it much more appetizing for me to struggle through.

The curious case of retro demo scene graphics

https://www.datagubbe.se/aipixels/
149•zdw•3h ago•28 comments

I use excalidraw to manage my diagrams for my blog

https://blog.lysk.tech/excalidraw-frame-export/
40•mlysk•2h ago•18 comments

ChatGPT won't let you type until Cloudflare reads your React state

https://www.buchodi.com/chatgpt-wont-let-you-type-until-cloudflare-reads-your-react-state-i-decry...
625•alberto-m•13h ago•401 comments

Hamilton-Jacobi-Bellman Equation: Reinforcement Learning and Diffusion Models

https://dani2442.github.io/posts/continuous-rl/
24•sebzuddas•1h ago•8 comments

VHDL's Crown Jewel

https://www.sigasi.com/opinion/jan/vhdls-crown-jewel/
52•cokernel_hacker•4h ago•24 comments

Copilot edited an ad into my PR

https://notes.zachmanson.com/copilot-edited-an-ad-into-my-pr/
526•pavo-etc•5h ago•162 comments

Voyager 1 runs on 69 KB of memory and an 8-track tape recorder

https://techfixated.com/a-1977-time-capsule-voyager-1-runs-on-69-kb-of-memory-and-an-8-track-tape...
524•speckx•17h ago•201 comments

15 Years of Forking

https://www.waterfox.com/blog/15-years-of-forking/
164•MrAlex94•2d ago•23 comments

Coding Agents Could Make Free Software Matter Again

https://www.gjlondon.com/blog/ai-agents-could-make-free-software-matter-again/
174•rogueleaderr•11h ago•167 comments

C++26 is done: ISO C++ standards meeting Trip Report

https://herbsutter.com/2026/03/29/c26-is-done-trip-report-march-2026-iso-c-standards-meeting-lond...
228•pjmlp•15h ago•197 comments

Philly courts will ban all smart eyeglasses starting next week

https://www.inquirer.com/news/philadelphia/smart-glasses-ai-meta-courts-20260326.html
236•Philadelphia•7h ago•85 comments

Hardware Image Compression

https://www.ludicon.com/castano/blog/2026/03/hardware-image-compression/
22•luu•1d ago•5 comments

Douglas Lenat's Automated Mathematician Source Code

https://github.com/white-flame/am
16•hydrolox•4d ago•1 comments

Eclipse GlassFish: This Isn't Your Father's GlassFish

https://foojay.io/today/eclipse-glassfish-this-isnt-your-fathers-glassfish/
19•henk53•4d ago•7 comments

Stripe Is Down

https://dashboard.stripe.com/login
13•tompccs•33m ago•4 comments

Pretext: TypeScript library for multiline text measurement and layout

https://github.com/chenglou/pretext
303•emersonmacro•1d ago•54 comments

I am definitely missing the pre-AI writing era

https://www.lesswrong.com/posts/BJ4pnropWdnzzgeJc/i-am-definitely-missing-the-pre-ai-writing-era
41•joozio•2h ago•22 comments

Midnight train from GA: A view of America from the tracks as airports struggle

https://apnews.com/article/airports-shutdown-long-lines-train-travel-amtrak-e4d8ea591b3b036142c2b...
96•walterbell•13h ago•76 comments

ninja: a small build system with a focus on speed

https://github.com/ninja-build/ninja
13•tosh•2d ago•1 comments

15 years, one server, 8GB RAM and 500k users – how Webminal refuses to die

https://community.webminal.org/t/15-years-one-server-8gb-ram-and-500k-users-how-webminal-refuses-...
88•giis•3h ago•18 comments

My MacBook keyboard is broken and it's insanely expensive to fix

https://tobiasberg.net/posts/my-macbook-keyboard-is-broken-and-its-insanely-expensive-to-fix/
182•TobiasBerg•14h ago•224 comments

Apple nailed AI by doing fucking nothing lol

https://xcancel.com/cryptopunk7213/status/2038351931589193953?s=20
7•doener•32m ago•1 comments

The Cognitive Dark Forest

https://ryelang.org/blog/posts/cognitive-dark-forest/
430•kaycebasques•13h ago•192 comments

Gonon: Building a Clock with No Numerals

https://tonygaeta.com/perceptor/code/gonon
29•nullpath•3d ago•19 comments

"Roadrunner": a bipedal, wheeled robot for multi-modal locomotion [video]

https://www.youtube.com/watch?v=9kae-UAME1U
29•surprisetalk•4d ago•14 comments

Interview: Nobonoko, Master of the Minimal Sequencer

https://fi-le.net/nobo/
32•fi-le•2d ago•3 comments

When Coupled Volcanoes Talk, These Researchers Listen

https://www.quantamagazine.org/when-coupled-volcanoes-talk-these-researchers-listen-20260327/
4•ibobev•2d ago•0 comments

The road signs that teach travellers about France

https://www.bbc.com/travel/article/20260327-the-road-signs-that-teach-travellers-about-france
104•1659447091•12h ago•48 comments

How to Survive in the Tech industry in 2026

https://blog.phuaxueyong.com/post/2026-03-23-how-to-survive-tech-in-2026/
10•xueyongg•1h ago•3 comments

HD Audio Driver for Windows 98SE / Me

https://github.com/andrew-hoffman/wdmhda
42•userbinator•4h ago•6 comments