It has long been common for scrapers to adopt the header patterns of search engine crawlers to hide in logs and bypass simple filters. The logical next step is for smaller AI players to present themselves as the largest players in the space.
Some search engines provide a list of their scraper IP ranges specifically so you can verify if scraper activity is really them or an imitator.
EDIT: Thanks to the comment below for looking this up and confirming this IP matches OpenAI’s range.
https://openai.com/searchbot.json
I don't know if imitating a major crawler is actually worth it, WAFs can easily tell you're faking it via IP/DNS lookups so surely you'll just end up on CloudFlare and co's naughty lists.
(the site may occasionally fail to load)
drwhyandhow•1h ago