I got sick of doing this over and over, so I wrote a wrapper around RocksDB that deduplicates values after a Put() operation. Currently only exact deduplication is performed, but I want to extend it in a number of ways, including semantic (fuzzy) deduplication for things like images and text.
Any feedback on the project would be appreciated: