news newest ask show jobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

2:4 Semi-Structured Sparsity: 27% Faster AI Inference on NVIDIA Hardware

https://hpc-ai.com/blog/explore_Semi-structured_sparcity

7•HappyTeam•2h ago

Comments

HappyTeam•2h ago

We often explore ways to make deep learning models more efficient. One fundamental insight is that deep learning models are inherently sparse—many weights can be safely neglected and zeroed out without significant accuracy loss. This idea, known as model pruning, was first introduced by Yann LeCun in the 1980s through the pioneering work Optimal Brain Damage. Since then, both software frameworks and hardware accelerators have evolved to take advantage of this sparsity, enabling more efficient inference and reduced memory consumption.

Most pruning methods produce unstructured sparsity, where any individual weight can be zeroed out. While this maximizes flexibility, it poses challenges for hardware acceleration. A more hardware-friendly alternative is 2:4 semi-structured sparsity, where out of every four consecutive weights, exactly two are zero. This pattern strikes a balance between model flexibility and computational efficiency, making it ideal for modern GPU architectures.

justinyokota•1h ago

Hey do you support B200/B300? I found the blog only talking about H100/H200.

Improving state machine code generation in the Rust compiler

https://trifectatech.org/blog/improving-state-machine-code-generation/

1•fanf2•3m ago•0 comments

Dark Screenshots of the Soul

https://www.wysr.xyz/p/dark-screenshots-of-the-soul

1•martialg•6m ago•0 comments

OpenAI teams up with Oracle and SoftBank to build 5 new Stargate data centers

https://www.wired.com/story/openai-oracle-softbank-data-center-stargate-us/

2•thoughtpeddler•11m ago•0 comments

Traefik's 10-Year Journey from Zero to Standard

https://traefik.io/blog/celebrating-10-years-of-traefik

1•beckford•15m ago•0 comments

Ericsson to power VodafoneThree's core network

https://www.telcotitans.com/vodafonewatch/ericsson-nokia-win-big-as-vodafonethree-opens-wallet-fo...

1•NKosmatos•18m ago•1 comments

The Science Behind Scratchgate and What It Means for Repairing the iPhone 17 Pro

https://www.ifixit.com/News/113388/iphone-17-pro-teardown

1•Improvement•18m ago•0 comments

The Job Market Is Hell

https://www.aol.com/news/job-market-hell-115900454.html

1•bbzjk7•19m ago•0 comments

Why haven't PWAs killed native apps yet?

https://kevinbasset.medium.com/why-havent-pwas-killed-native-apps-yet-29beca4425fa

1•anon1395•20m ago•1 comments

That Secret Service SIM farm story is bogus

https://cybersect.substack.com/p/that-secret-service-sim-farm-story

3•sixhobbits•20m ago•0 comments

The J.D. Vance show and American Authoritarianism

https://www.diggitmagazine.com/jd-vance-show-and-american-authoritarianism

3•AntonioBarthes•22m ago•0 comments

Indian open source ventures take on Google Photos, SAP

https://timesofindia.indiatimes.com/technology/times-techies/indian-open-source-ventures-take-on-...

1•setalp•23m ago•0 comments

Multiscreen Device Play (MSDP) with SignalR on Android [video]

https://www.youtube.com/shorts/_J7LfKgrEzk

1•eric_khun•29m ago•0 comments

AI Article Generator – Transform Keywords into Professional Content

https://ai-article.loveyouall.qzz.io/

1•carloshmccarlos•30m ago•2 comments

Dragons Lair on the Amiga – How a laserdisc game fit onto 6 floppy disks – MVG [video]

https://www.youtube.com/watch?v=dyiwHF67Gvg

2•doener•31m ago•0 comments

Canada's 13M Buildings

https://tech.marksblogg.com/canadas-buildings.html

1•marklit•32m ago•0 comments

Image to Image AI

https://imagetoimage.tech/

1•brekmls•34m ago•0 comments

Hiring only senior engineers is a terrible policy that will kill companies

https://workweave.dev/blog/hiring-only-senior-engineers-is-killing-companies

2•AntonZ234•38m ago•0 comments

Vapor for VS Code

https://blog.vapor.codes/posts/vapor-for-vscode/

1•frizlab•40m ago•1 comments

Show HN: I made a tool to make my Git operations easy

1•rafmardev•41m ago•1 comments

The Magical Number Seven, Plus or Minus Two [pdf]

https://labs.la.utexas.edu/gilden/files/2016/04/MagicNumberSeven-Miller1956.pdf

1•signa11•42m ago•0 comments

They tested video games in the nineties

https://spillhistorie.no/2025/09/17/how-they-tested-video-games-in-the-nineties/

4•Michelangelo11•42m ago•0 comments

The Growth of the Swift Server Ecosystem

https://www.swift.org/blog/swift-on-the-server-ecosystem/

2•frizlab•43m ago•1 comments

Legal AI Claims 5x faster than first-year Big Law associates

https://twitter.com/moondrencht/status/1970755020720070886

2•flipper_ft•43m ago•0 comments

Show HN: Pantheon MCP – a central server for AI agent definitions

https://pantheon-mcp.com

1•valado•44m ago•0 comments

You Are Not Late

https://kk.org/thetechnium/you-are-not-late/

1•a-s-k-af•46m ago•0 comments

Need to raise capital quick, pls help

https://docs.google.com/presentation/d/1BJ-WjrudNnRQLp9EgzqS_om_ao30jjQexeZhHpsmT4g/edit?usp=sharing

2•izzyhack•46m ago•0 comments

Parse complex documents in LangChain with new provider UndatasIO

https://docs.langchain.com/oss/python/integrations/document_loaders/undatasio

1•jojogh•48m ago•1 comments

Show HN: GravOptAdaptive – Drop-In PyTorch Optimizer, 25% Faster Training

https://drereg.gumroad.com/l/joehz

1•DREDREG•50m ago•0 comments

Why your house is an investment

https://estimateproperty.blogspot.com/2025/09/why-your-house-is-incredible-investment.html

1•tommarkov•51m ago•0 comments

Japanese city passes two-hours-a-day smartphone usage ordinance

https://www.theregister.com/2025/09/24/japan_toyoake_smartphone_limitation_ordinance/

1•pseudolus•52m ago•0 comments