SQUR found more flags than human pentesters on the XBEN CTF benchmark suite. We reached 91 out of 104 flags (87.5%), exceeding the best reported human score.
CTF is not the same as hacking or pentesting, but the results indicate a strong exploit capability nevertheless.
If we would assume malicious users achieve comparable results, we should all dress warm for the upcoming cybersecurity challenges.
adamlundqvist•1h ago
CTF is not the same as hacking or pentesting, but the results indicate a strong exploit capability nevertheless.
If we would assume malicious users achieve comparable results, we should all dress warm for the upcoming cybersecurity challenges.