New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Google Publisher Plugin bot crawler isn't excluded from visits #9567
Comments
@sgiehl can you have a look at this one? |
@TheCodePianist Do you have access to your access logs? Would you mind having a look there, if all those requests are coming with the same useragent? |
Piwik recognizes all visits to be from Mountain View, CA. Browser is allways Chrome, device varies between Mac and Android. The IP address is different for each visit, but DNS lookup always follows this pattern (where x represents the IP): crawl-xxx-xxx-xxx-xxx.googlebot.com Hope this helps, if not let me know which information you need! :) |
We can only exclude those visits using the IP or the useragent. As the first may vary it would be better to use the useragent. The useragent isn't displayed within Piwik. You can only get that information from your webservers access logs. Are you able to get those? |
Thanks, @TheCodePianist, Do you have access to the access logs? |
Sure, but I am not quite sure I got what you need. I searched the access log for the IP-addresses and picked three examples for you to choose from (the IPs at the beginning of the entry match the ones Piwik shows as the visitor IP):
|
I've created matomo-org/device-detector#5415 which will fix this issue. |
@TheCodePianist are you using Wordpress and the Google Adsense plugin? Or the Google Publisher Toolbar in Chrome? |
Will be fixed with the next version of piwik/device-detector |
Thank you for the fast response and fix! I am using the publisher-toolbar, but the visits tracked by Piwik are at times where my PC was turned off... |
For the first time after using Piwik for several projects, I am seeing heavy googlebot-activity for one of my pages. This project is the only one using AdSense.
However: Since I couldn't find a good way to exclude this "user" from my stats, I consider this as a kind of bug. Maybe you can add this to the bot-list for one of the next releases.
All page-links are called with the following URL parameter:
http://.../...&google_publisher_plugin_page_details=1
The text was updated successfully, but these errors were encountered: