Forum Moderators: goodroi

Message Too Old, No Replies

boitho.com bot violating robots.txt

Specifically requested only forbidden files

         

jazzguy

8:08 pm on May 5, 2005 (gmt 0)

10+ Year Member



"boitho.com-dc/0.75 ( http*//www.boitho.com/dcbot.html )" came from 129.241.104.168. It specifically targetted disallowed files from robots.txt, ignoring all other pages.

The info page says it's a distributed crawler, so just like my policy for the cronic robots.txt violater Grub, I banned the user agent and the entire IP block associated with the offending IP.

bcolflesh

8:30 pm on Jun 14, 2005 (gmt 0)

WebmasterWorld Senior Member 10+ Year Member



Already covered multiple times in this thread.

I keep searching the thread but I cannot find the contents of the robots.txt file you claim to have posted - what does "troll" mean in this context? If you have "troll" as a robots.txt directive, this may be the problem - please check one of the validators I mentioned earlier.

This 111 message thread spans 12 pages: 111