You may be interested in watermarks that are invisible to the human eye but are visible to spectral analysis. You can even encode data within this invisible watermark, and learn a modification that is resistant to various image manipulations and even resistant to lossy compressions, in research that goes back to 2001 and has seen interesting improvements with advances in DNNs, Transformers and Diffusion.
Another that works for audio content is: https://github.com/swesterfeld/audiowmark
Incidentally, the latter also supports applying the watermark payload to HLS segments.
With a traditional "large-file" video format, this is non-trivial, as you need to reencode (or at least remux) the entire video for each user. This is not only computationally intensive, it also prevents you from hosting your videos on a CDN, increasing bandwidth costs.
With a segmented format like HLS, you can just embed the watermark in a few key segments, leaving the rest of the video intact. As long as the user can't distinguish between watermarked and non-watermarked segments, you get most of the benefit at a much lower cost. Crucially, you can keep serving the non-watermarked segments off a CDN.
This makes me wonder whether there are "segment-native" watermarking algorithms. The idea would be to pre-encode each segment multiple times, with each file containing a different watermark. A random variant of each segment would be chosen for each download, with the IDs of the chosen variants saved in a database. THe security of such an algorithm would rely on the combinatorial explosion resulting from the number of allowed variant combinations. For example, with just 3 variants per segment, 15 segments would give you 59 million distinct watermarks.
laserspeed•7h ago