From CameronMacLeod (2022) - and much more complete analysis (587 points, 2023, 155 comments) https://news.ycombinator.com/item?id=38531428
Or Slate (2009) (50 points, 16 comments) https://news.ycombinator.com/item?id=893353
Thanks for the other links, the question in this title is one I've day-dreamily thought about on occasion, but never dug into. Will have a read of all three.
- OG shazam paper https://www.ee.columbia.edu/~dpwe/papers/Wang03-shazam.pdf (he has a talk on youtube btw look it up if really care)
- https://news.ycombinator.com/item?id=18069968 shazam employee blogpost
- https://news.ycombinator.com/item?id=38538996 shazam cofounder endorsed explainer
- go algo repro https://news.ycombinator.com/item?id=41127726
as with all ML things... the code is much less % of the value than the data...
Audible Magic claims to be able to recognize multiple performances of the same songs, and even parodies.[1] Using, of course, "AI technology" and much more compute.
[1] https://www.audiblemagic.com/2024/02/07/identifying-cover-so...
cellular•39m ago