@anonymous-piwik-user opened this Issue on January 31st 2013

The following strings are bots/spiders that are being registered in the Not-Bots section when using log import. Using Piwik 1.10.1

Ezooms/1.0; ezooms.bot@gmail.com

@anonymous-piwik-user commented on January 31st 2013

Added a few more.

news bot /2.1

@mattab commented on February 7th 2013 Owner

Surprising, because 'spider' is already in the array of user agent to classify as Bots..

@anonymous-piwik-user commented on February 10th 2013

Here is a list of strings as seen in the log files. I had to remove the 'http:' part of the url's in order to paste this due to some kind of anti-spam setting that was rejecting the links.

Piwik: Baiduspider/2.0
Log: "Mozilla/5.0 (compatible; Baiduspider/2.0; +//www.baidu.com/search/spider.html)"

Piwik: Baiduspider-image
Log: "//image.baidu.com/i?ct=503316480&z=0&tn=baiduimagedetail" "Baiduspider-image+(+//www.baidu.com/search/spider.htm)"

Piwik: Ezooms/1.0; ezooms.bot@
Log: "Mozilla/5.0 (compatible; Ezooms/1.0; ezooms.bot@gmail.com)"

Piwik: Sosospider/2.0;
Log: "Mozilla/5.0(compatible; Sosospider/2.0; +//help.soso.com/webspider.htm)"

Piwik: JikeSpider
Log: "Mozilla/5.0 (compatible; JikeSpider; +//shoulu.jike.com/spider.html)"

Piwik: news bot /2.1
Log: "Mozilla/5.0 (compatible; news bot /2.1)"

Piwik: Blekkobot
Log: "Mozilla/5.0 (compatible; Blekkobot; ScoutJet; +//blekko.com/about/blekkobot)"

Piwik: ScoutJet
Log: "Mozilla/5.0 (compatible; Blekkobot; ScoutJet; +//blekko.com/about/blekkobot)"

The 'Blekkobot' and "ScoutJet' bot appear to be the same in the logs, but are detected separately in Piwik's log import.

@anonymous-piwik-user commented on February 10th 2013

Concerning the 'spider' keyword. I upgraded the Piwik system the customers see to the 1.10.1. I was not sure if the log analytic copies that exist on the web servers to do the import were updated. I have updated those today to be sure, and will report back after our next import.

Thank you

@mattab commented on April 5th 2013 Owner

Havent heard feedback so I assume it works fine

This Issue was closed on April 5th 2013
Powered by GitHub Issue Mirror