frontpage.
newsnewestaskshowjobs

Made with ♥ by @iamnishanth

Open Source @Github

fp.

Open in hackernews

Backing Up Spotify

https://annas-archive.li/blog/backing-up-spotify.html
141•vitplister•2h ago

Comments

lelouch9099•1h ago
How legal is this with regards to copyright laws?
phainopepla2•1h ago
Not legal
basisword•44m ago
It's not. It's awful people justifying awful behaviour. And it's why we can't have nice things. There are always assholes ready to exploit others.
nemomarx•36m ago
There's some irony here considering Spotify used pirated mp3s at the start of their operations, I suppose.
poly2it•2m ago
Some people's urges to destroy all traces of human civilisation astonish me. What do you think Spotify is going to do with all its music when it ceases to exist in however many years? No, we must collectively feed Daniel Ek the Hungry.
Aurornis•44m ago
Not legal. This group does not concern themselves with copyright law.
toomuchtodo•43m ago
Adherence to the legal framework is a function of your risk appetite.
ronsor•5m ago
Very, if we delete copyright like we're supposed to.
artninja1988•1h ago
Wow. Anna is a godsend. Hopefully now we get some really good open source music models
zoklet-enjoyer•1h ago
Wow. Now I just need some hard drives and a way to download that without my ISP doing something about it. That's amazing.
basisword•52m ago
Am I understanding this wrong? Ripping the metadata I'm fine with. But it sounds like they've ripped every song from Spotify and they're going to release them?

Edit: It seems like they are. Stealing from tens of thousands of artists, big and small, and calling it "preservation" or "archiving" is scummy.

efilife•45m ago
Why is this stealing? You can already listen to everything that's on Spotify with a free account. You are free to also record the audio while it's playing. I suppose grabbing the actual file should't matter? Or is this about releasing? And robbing people of plays they would otherwise get through Spotify?
basisword•41m ago
If you listen to something on Spotify with a free account the artists still get paid. This isn't a case where you're ripping off so mega-corp. You're ripping off thousands of artists from major label ones to tiny indies. Take the metadata and build something cool. Stealing the files and releasing them is something else entirely.
prmoustache•23m ago
You can record what you play from Spotify and you are already free to play the record again and again and again without the artist being paid.

Most people do not because they find it less convenient than paying 20bucks a month or whatever is the current price in 2025 but that doesn't change the reality.

For most people the appeal of Spotify is not the music itself but the playlists that are shared thanks to its ubiquity. This is the reason other services struggle to make a dent even if they have better quality, UI and algos.

Spotify started by disrupting the market using pirated music by the way so you are pretty much endorsing and encouraging piracy when "paying" your favorite artists through Spotify.

WD-42•43m ago
Nobody is gonna download a 300TB torrent just to get the latest Taylor Swift album. There are much easier avenues than that.

What’s actually scummy is Spotify paying artists $1 per 1000 streams.

Buy CDs. Use Bandcamp.

basisword•40m ago
How about we let the individual artists decide?
Nextgrid•41m ago
Music piracy is already a thing, not to mention you don't even need to torrent nowadays when music is available for free on YouTube. Those who don't want to pay already don't pay so nothing changes there.

The value of Spotify is the convenience, and this collection does not change that in any way. Your argument would apply if someone were to make a Spotify clone with the same UX using this data.

Slow_Hand•36m ago
While I wouldn't call this scummy I do agree with your sentiment. It is technically stealing and those copyrights should be respected.

Full disclosure, I am a career musician AND have been known to pirate material. That said, I think this is a valuable archive to build. There are a lot of recordings that will not endure without some kind of archiving. So while it's not a perfect solution, I do think it has an important role to play in preservation for future generations.

Perhaps it's best to have a light barrier to entry. Something like "Yes, you can listen to these records, but it should be in the spirit of requesting the material for review, and not just as a no-pay alternative to listening on Spotify." Give it just enough friction where people would rather pay the $12/month to use a streaming service.

Also, it's not like streaming services are a lucrative source of income for most artists. I expect the small amount of revenue lost to listeners of Anna's Archive are just (fractions of) a penny in the bucket of any income that a serious artist would stand to make.

prmoustache•25m ago
Stealing is not the correct word.
nutjob2•12m ago
Don't worry, they let Spotify keep the original files.
klabb3•11m ago
The people I know who go through the trouble of pirating and downloading vast libraries of music are all musicians themselves, or at the very least total music nerds. They don’t want to lose access to their stuff, plus if they ever need to import audio into a DAW, DRM is a no-go. They are the same people who spend large amounts of money on vinyls, and support smaller independent artists through concerts, merch and (back in the day) CDs.

It used to be more mixed, but today, piracy is often the only option to ”own” any media at all.

WD-42•51m ago
Incredible.

> A while ago, we discovered a way to scrape Spotify at scale.

They wont and shouldn’t divulge the details, but I imagine that would be a fun read!

bmikaili•15m ago
they're probably just using something like https://github.com/nor-dee/spotizerr-spotify
frereubu•49m ago
Site is down for me. Archive link: https://archive.is/jf3HW
ipsum2•40m ago
Ironic. But its working for me.
mawax•37m ago
Probably not down, but blocked by your ISP. Try a VPN. Same thing happens here.
xnx•47m ago
Merry Christmas!
crazygringo•45m ago
This is insane.

I definitely was not aware Spotify DRM had been cracked to enable downloading at scale like this.

The thing is, this doesn't even seem particularly useful for average consumers/listeners, since Spotify itself is so convenient, and trying to locate individual tracks in massive torrent files of presumably 10,000's of tracks each sounds horrible.

But this does seem like it will be a godsend for researchers working on things like music classification and generation. The only thing is, you can't really publicly admit exactly what dataset you trained/tested on...?

Definitely wondering if this was in response to desire from AI researchers/companies who wanted this stuff. Or if the major record labels already license their entire catalogs for training purposes cheaply enough, so this really is just solely intended as a preservation effort?

Aurornis•40m ago
> The thing is, this doesn't even seem particularly useful for average consumers/listeners, since Spotify itself is so convenient, and trying to locate individual tracks in massive torrent files of presumably 10,000's of tracks each sounds horrible.

I wouldn’t be so sure. There are already tools to automatically locate and stream pirated TV and movie content automatic and on demand. They’re so common that I had non-technical family members bragging at Thanksgiving about how they bought at box at their local Best Buy that has an app which plays any movie or TV show they want on demand without paying anything. They didn’t understand what was happening, but they said it worked great.

> Definitely wondering if this was in response to desire from AI researchers/companies who wanted this stuff.

The Anna’s archive group is ideologically motivated. They’re definitely not doing this for AI companies.

crazygringo•28m ago
> The Anna’s archive group is ideologically motivated.

Very interesting, thank you. So using this for AI will just be a side effect.

And good point -- yup, can now definitely imagine apps building an interface to search and download. I guess I just wonder how seeding and bandwidth would work for the long tail of tracks rarely accessed, if people are only ever downloading tiny chunks.

5-•27m ago
> The Anna’s archive group is ideologically motivated. They’re definitely not doing this for AI companies.

https://en.wikipedia.org/wiki/Useful_idiot

basisword•39m ago
>> But this does seem like it will be a godsend for researchers working on things like music classification and generation. The only thing is, you can't really publicly admit exactly what dataset you trained/tested on...?

Didn't Meta already publicly admit they trained their current models on pirated content? They're too big to fail. I look forward to my music Slop.

throwaway613745•43m ago
I wonder how deep the hole they're gonna put whoever runs this site into is gonna be?
Etheryte•26m ago
To put this into perspective, What.CD [0] was widely considered to be the music library of Alexandria, unparalleled in both its high quality standard and it's depth. What had in the ballpark of a few million torrents when it got raided and shut down. Anna's rip of Spotify includes roughly 186 million unique records. Granted, the tail end is a mixed bag of bot music and whatnot, but the scale is staggering.

[0] https://en.wikipedia.org/wiki/What.CD

syntaxing•25m ago
Moral and legal discussion aside, this is technically very impressive. I also wouldn’t be surprised if this somehow kickstarts open source music generative AI from China.
ipsum2•24m ago
Can someone explain why C#/Db (major/minor) is the third most popular key? Very unexpected for me, since its relatively more difficult to play.
kzrdude•16m ago
Electronic dance music is the biggest genre in the data. So then easy to play shouldn't matter. It's still an interesting question. I think playing Db is pretty nice on the piano even if it's not the easiest.
klysm•15m ago
Difficult to play in what instrument?
Fizzadar•22m ago
I have Spotify premium but the constant shuffle of content availability has meant I’ve stared routinely archiving my liked songs to avoid any rug pull. Zspotify and co still work a charm.
yegle•18m ago
Not that we should, but it's technically feasible to have a music streaming server with the torrent as the backend, and selectively download the part of the torrent in respond to on-demand streaming request from the client.
nutjob2•8m ago
I wonder how definitive their collection is and how much ripping Google Music/YouTube would improve on this.

A distributed ripping project to do that would be a fine thing.

yellow_lead•8m ago
Is the music torrent not up yet? Only see the metadata one here: https://annas-archive.li/torrents/spotify
artninja1988•7m ago
Yeah, in the article they write:

The data will be released in different stages on our Torrents page:

[X] Metadata (Dec 2025)

[ ] Music files (releasing in order of popularity)

[ ] Additional file metadata (torrent paths and checksums)

[ ] Album art

[ ] .zstdpatch files (to reconstruct original files before we added embedded metadata)

yellow_lead•6m ago
Oh I see, thanks! I missed that

Indie Game Awards retracts awards from game for using Gen AI during development

https://bsky.app/profile/indiegameawards.gg/post/3magufzccy22i
1•ArlenBales•25s ago•0 comments

Freedesktop's website is blocked to "protect against AI companies"

https://www.freedesktop.org/wiki/
1•alutnopk•1m ago•1 comments

State of CSS 2025: Modern Features Every Business Needs

https://aexaware.com/blog/state-of-css-2025-modern-features-every-business-needs/
1•jainilprajapati•3m ago•0 comments

'Pretty birds and silly moos': the women behind the Sex Discrimination Act

https://www.theguardian.com/world/2025/dec/18/pretty-birds-and-silly-moos-the-women-behind-the-se...
1•binning•4m ago•0 comments

The Female Artist Who Showed How Plants and Insects Relate

https://worldsensorium.com/the-female-artist-who-showed-how-plants-and-insects-relate/
1•dnetesn•5m ago•0 comments

Shackled, alone and scared: the grim reality for women who give birth in prison

https://www.theguardian.com/global-development/2025/dec/18/shackled-alone-pregnant-women-forced-b...
1•binning•5m ago•0 comments

When We Were Lunch

https://nautil.us/when-we-were-lunch-1255357/
1•dnetesn•6m ago•0 comments

The Big Ideas in Relay

https://www.youtube.com/watch?v=yInL1aPZjvI
1•todsacerdoti•7m ago•0 comments

Menstrual dignity in the face of period poverty

https://writesobereditsober.substack.com/p/girls-just-wanna-have-periods-and
1•binning•8m ago•0 comments

Show HN: Tab Master Chrome extension for managing tabs with auto-suspend

https://chromewebstore.google.com/detail/tab-master-save-tabs-auto/cffmohngbglhnnneppndhcifppjpmpae
1•aabdoahmed•9m ago•1 comments

Show HN: Сulsans – Thread-safe async-aware queue for Python

https://github.com/x42005e1f/culsans
1•x42005e1f•9m ago•0 comments

Show HN: Memora – MCP persistent memory server knowledge graph vis

https://github.com/agentic-mcp-tools/memora
1•spokv•10m ago•0 comments

Accurate Matrix Animation

https://rezmason.github.io/matrix/?version=trinity
1•memalign•11m ago•1 comments

Signaling in the Age of AI: Evidence from Cover Letters

https://arxiv.org/abs/2509.25054
1•tacon•12m ago•0 comments

Call for Applications: Early Career Rescue Fellowship

https://uni-freiburg.de/frias/call-for-applications-early-career-rescue-fellowship/
1•OhMeadhbh•12m ago•1 comments

What Happens When You Keep Cutting Paper Forever? [video]

https://www.youtube.com/watch?v=ux-AWup9aVI
1•akshatjiwan•14m ago•0 comments

Ask HN: Could we detect brain-eye overload

1•RCLS•17m ago•0 comments

Tinker: General Availability and Vision Input

https://thinkingmachines.ai/blog/tinker-general-availability/
1•pmaddams•18m ago•0 comments

WebPipes– A web-based Virtual Pipe Organ with realistic console UI

https://www.openpipes.org/webpipes/
1•temporal_123•19m ago•1 comments

Secureblue: A security-focused desktop and server Linux operating system

https://secureblue.dev/
1•pmaddams•19m ago•0 comments

Sequoia's Shaun Maguire accused an innocent Palestinian of being Brown shooter

https://www.fastcompany.com/91463942/sequoia-shaun-maguire-brown-university-shooter-palestine
7•newspaper1•20m ago•0 comments

Code Orange: Fail Small – our resilience plan following recent incidents

https://blog.cloudflare.com/fail-small-resilience-plan/
2•eenchev•21m ago•0 comments

String Theory in 2037, Brian Green and Edward Witten

https://www.youtube.com/watch?v=sAbP0magTVY
2•tzury•22m ago•0 comments

SomaFM Holiday Playlists

https://somafm.com/
1•ompogUe•23m ago•1 comments

A Year of Western European Train Delays Visualized

https://chuuchuu.com/2025wrapped
2•marton_s•23m ago•0 comments

The Meek Did Inherit the Earth, at Least Among Ants

https://www.nytimes.com/2025/12/19/science/ants-exoskeletons-weak.html
3•marojejian•28m ago•1 comments

The Art of Pivoting – Techniques for Intelligence Analysts

https://github.com/adulau/the-art-of-pivoting
2•adulau•29m ago•0 comments

Two Canadians lose $2.3M to crypto scams

https://www.ctvnews.ca/canada/article/i-was-heartbroken-two-canadians-lose-23-million-to-crypto-s...
1•mattm•31m ago•0 comments

The Race to Build the World's Best Friend

https://www.nytimes.com/2025/12/20/opinion/ai-chat-gpt-models-personality.html
1•mitchbob•31m ago•1 comments

Google Photos, except it's Epstein's camera roll

https://www.jmail.world/photos
3•pouwerkerk•32m ago•0 comments