news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

Open in hackernews

TREAD: Token Routing for Efficient Architecture-Agnostic Diffusion Training

https://arxiv.org/abs/2501.04765

27•fzliu•2h ago

Comments

platers•1h ago

I'm struggling to understand where the gains are coming from. What is the intuition for why DiT training was so inefficient?

joshred•49m ago

This is the high-level explanation of the simplest diffusion architecture. The model trains by taking an image and iteratively adding noise to the image until there is only noise. Then they take that sequence of noisier and noisier images and they reverse it. The result is that they start with only noise, and they predict the removal of noise at step until they get to the final step (which should be the original image (or training input)).

That process means they may require a hundred or more training iterations on a single image. I haven't digested the paper, but it sounds like they are proposing something conceptually similar to skip layers (but significantly more involved).

earthnail•52m ago

Wow, Ommer’s students never fail to impress. 37x faster for a generic architecture, ie no domain specific tricks. Insane.

Anna's Archive: An Update from the Team

https://annas-archive.org/blog/an-update-from-the-team.html

530•jerheinze•3h ago•193 comments

Show HN: We started building an AI dev tool but it turned into a Sims-style game

https://www.youtube.com/watch?v=sRPnX_f2V_c

32•max-raven•47m ago•14 comments

My Retro TVs

https://www.myretrotvs.com/

79•the-mitr•2h ago•15 comments

Show HN: Whispering – Open-source, local-first dictation you can trust

https://github.com/epicenter-so/epicenter/tree/main/apps/whispering

67•braden-w•2h ago•17 comments

How much do electric car batteries degrade?

https://www.sustainabilitybynumbers.com/p/electric-car-battery-degradation

42•xnx•1h ago•37 comments

Show HN: I built an app to block Shorts and Reels

https://scrollguard.app/

352•adrianhacar•2d ago•133 comments

FFmpeg Assembly Language Lessons

https://github.com/FFmpeg/asm-lessons

236•flykespice•5h ago•68 comments

The Cutaway Illustrations of Fred Freeman

https://5wgraphicsblog.com/2016/10/24/the-cutaway-illustrations-of-fred-freeman/

37•Michelangelo11•2d ago•3 comments

TREAD: Token Routing for Efficient Architecture-Agnostic Diffusion Training

https://arxiv.org/abs/2501.04765

27•fzliu•2h ago•3 comments

The Weight of a Cell

https://www.asimov.press/p/cell-weight

57•arbesman•4h ago•20 comments

Launch HN: Reality Defender (YC W22) – API for Deepfake and GenAI Detection

https://www.realitydefender.com/platform/api

46•bpcrd•4h ago•22 comments

Web apps in a single, portable, self-updating, vanilla HTML file

https://hyperclay.com/

541•pil0u•13h ago•192 comments

Who Invented Backpropagation?

https://people.idsia.ch/~juergen/who-invented-backpropagation.html

127•nothrowaways•3h ago•63 comments

Typechecker Zoo

https://sdiehl.github.io/typechecker-zoo/

97•todsacerdoti•3d ago•17 comments

Finding a Successor to the FHS

https://lwn.net/SubscriberLink/1032947/67e23ce1a3f9f129/

15•firexcy•12h ago•6 comments

Electromechanical reshaping, an alternative to laser eye surgery

https://medicalxpress.com/news/2025-08-alternative-lasik-lasers.html

191•Gaishan•10h ago•83 comments

The lottery ticket hypothesis: why neural networks work

https://nearlyright.com/how-ai-researchers-accidentally-discovered-that-everything-they-thought-about-learning-was-wrong/

10•076ae80a-3c97-4•2h ago•0 comments

Turning an iPad Pro into the Ultimate Classic Macintosh (2021)

https://blog.gingerbeardman.com/2021/04/17/turning-an-ipad-pro-into-the-ultimate-classic-macintosh/

56•rcarmo•2h ago•7 comments

A gigantic jet caught on camera: A spritacular moment for NASA astronaut

https://science.nasa.gov/science-research/heliophysics/a-gigantic-jet-caught-on-camera-a-spritacular-moment-for-nasa-astronaut-nicole-ayers/

365•acossta•3d ago•87 comments

Image Fulgurator (2011)

https://juliusvonbismarck.com/bank/index.php/projects/image-fulgurator/2/

33•Liftyee•2d ago•2 comments

T-Mobile claimed selling location data without consent is legal–judges disagree

https://arstechnica.com/tech-policy/2025/08/t-mobile-claimed-selling-location-data-without-consent-is-legal-judges-disagree/

10•Bender•13m ago•0 comments

Vibe coding tips and tricks

https://github.com/awslabs/mcp/blob/main/VIBE_CODING_TIPS_TRICKS.md

146•mooreds•6h ago•70 comments

SystemD Service Hardening

https://roguesecurity.dev/blog/systemd-hardening

218•todsacerdoti•14h ago•80 comments

Countrywide natural experiment links built environment to physical activity

https://www.nature.com/articles/s41586-025-09321-3

29•Anon84•2d ago•17 comments

Sky Calendar

https://abramsplanetarium.org/SkyCalendar/index.html

51•NaOH•3d ago•3 comments

The Lives and Loves of James Baldwin

https://www.newyorker.com/magazine/2025/08/18/baldwin-a-love-story-nicholas-boggs-book-review

79•Caiero•20h ago•11 comments

8x19 Text Mode Font Origins

https://www.os2museum.com/wp/8x19-text-mode-font-origins/

62•userbinator•2d ago•21 comments

MCP doesn't need tools, it needs code

https://lucumr.pocoo.org/2025/8/18/code-mcps/

175•the_mitsuhiko•9h ago•111 comments

Class-action suit claims Otter AI records private work conversations

https://www.npr.org/2025/08/15/g-s1-83087/otter-ai-transcription-class-action-lawsuit

129•nsedlet•5h ago•30 comments

Weather Radar APIs in 2025: A Founder's Complete Market Overview

https://www.rainviewer.com/blog/weather-radar-apis-2025-overview.html

38•sea-gold•2d ago•30 comments