Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Log Analytics: Support Godaddy style log files #4526

Closed
anonymous-matomo-user opened this issue Jan 14, 2014 · 6 comments
Closed

Log Analytics: Support Godaddy style log files #4526

anonymous-matomo-user opened this issue Jan 14, 2014 · 6 comments
Labels
duplicate For issues that already existed in our issue tracker and were reported previously. Enhancement For new feature suggestions that enhance Matomo's capabilities or add a new report, new API etc.
Milestone

Comments

@anonymous-matomo-user
Copy link

Good afternoon everyone,

I had difficulty with importing logfiles from GoDaddy successfully into Piwik and URLs replaced for privacy using

piwik/htdocs/misc/log-analytics/import_logs.py --idsite=1 --url=piwikurl --enable-http-errors --enable-http-redirects --enable-static -d /home/bitnami/logfile.log

If the GET field looks like "GET www.site.org/index.htm" the import fails and produces 'Page URL not defined.'
If it looks like "GET /index.htm" or "GET http://www.site.org/index.htm" the import is successful.

I believe the problem is occurring in the archive import tool at piwik/htdocs/misc/log-analytics/import_logs.py and not on the Piwik php side .

Looking at the text supplied to the piwik instance for importing the hits and printing the data of the JSON sent to the server shows that successful imports have 'http://' in the URL provided to piwik.

Tested with a Vagrant/Puppet install provided at http://piwik.org/blog/2012/08/get-started-with-piwik-development-with-puppet-and-vagrant/ (v 2.0.3)
and with the Piwik install provided by Bitnami (v 2.0.2)

Both produced working Piwik installations. I configured the sites in settings to accept site.org and www.site.org.

Workaround:
If host is specified in logfile, add 'http://' or remove host. sed -i 's/GET site.org/GET /g' logfile.log

@anonymous-matomo-user
Copy link
Author

Attachment: apache logfile
test.log

@anonymous-matomo-user
Copy link
Author

Attachment: json submitted to piwik
json.log

@mattab
Copy link
Member

mattab commented Jan 21, 2014

Thanks for the report. I've never seen logs under this format before. Which software / server is generating accesss logs such as "GET www.site.org/index.htm" ?

I'm asking to know if it would impact a lot of users or just a few. cheers

@mattab
Copy link
Member

mattab commented Feb 1, 2014

Consolidating milestones FTW

@mattab
Copy link
Member

mattab commented Feb 4, 2014

@user10001001 do you you mind replying to my question? Is it Goddady that generates these server files?

@mattab
Copy link
Member

mattab commented Mar 12, 2015

Issue was moved to the new repository for Piwik Log Analytics: https://github.com/piwik/piwik-log-analytics/issues

refs #7163

@mattab mattab closed this as completed Mar 12, 2015
@mattab mattab added the duplicate For issues that already existed in our issue tracker and were reported previously. label Mar 12, 2015
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
duplicate For issues that already existed in our issue tracker and were reported previously. Enhancement For new feature suggestions that enhance Matomo's capabilities or add a new report, new API etc.
Projects
None yet
Development

No branches or pull requests

2 participants