I evaluated the Copyleaks copyright/plagiarism detection platform’s ability to detect phonetically and semantically equivalent text that differs orthographically from the original source using tools I made and link to in the paper.
Copyleaks’ own marketing materials cite accuracy rates above 99 % on “paraphrased and disguised” text. In contrast, our trials yielded detection rates as low as 0 %, with multiple transformed works passing undetected despite maintaining near-verbatim semantic and phonetic equivalence to the originals.
ycombiredd•2h ago
Here's a Medium version if you don't like clicking PDFs.
ycombiredd•2h ago
Copyleaks’ own marketing materials cite accuracy rates above 99 % on “paraphrased and disguised” text. In contrast, our trials yielded detection rates as low as 0 %, with multiple transformed works passing undetected despite maintaining near-verbatim semantic and phonetic equivalence to the originals.
ycombiredd•2h ago
https://medium.com/@scott.vr/part-of-your-world-e6a0d78d46b9