> If it's requesting something which is allowed in robots.txt, it's a legitimate...

zrm · 2025-05-14T09:07:40 1747213660

Why would it be only 0.001% of requests? You can fill your actual pages with links to pages disallowed in robots.txt which are hidden from a human user but visible to a bot scraping the site. Adversarial bots ignoring robots.txt would be following those links everywhere. It could just as easily be 50% of requests and each time it happens, they lose that IP address.