@mikkeschiren opened this Issue on March 10th 2019

Have not looked so deep into this - but is it possible to make transitions use archived data instead of the logs? If it it is not easily done - what is needed to change to make this possible?

@mikkeschiren commented on March 10th 2019

Use case: We have instances wit a lot of logs - and we need to clean up the logs very often, we get all the reports on archived data - but Transitions goes direct to the log tables. And after we clean up old logs, we do not get any data for Transitions.

@tsteur commented on March 10th 2019 Member

It's not really planned currently but may be good to do at some point. Especially now that the report is more exposed and could cause a lot more performance issues as it's known to be possibly slow with lots of logs.

@mikkeschiren commented on March 11th 2019

Ok, I will try to look into this in the near feature.

@mfb commented on September 11th 2019

We'd love to be able to generate Transitions reports from archived records. Our privacy policy requires that we archive/aggregate logs on a weekly basis, but we want to be able to analyze transitions from the past month or so..

@mattab commented on October 9th 2019 Member

We've discussed it internally and we are a bit worried of archiving the Transitions data because it represents a lot of data to aggregate + a lot of slow running SQL queries to get this data daily. For each Page URL and Page title we'd need to store the last 10-30 pages/events/referrers and next 10-30. So that's a lot of string data/url to store for each URL/page title on the site.

Instead we have another idea:
Maybe it would be possible for you to keep old RAW logs for a longer time,
but make sure that the RAW logs you keep in Matomo are fully anonymised.

Maybe we could build an easier feature for "Full anonymisation" of the data, to fully remove any potential personal data, but would still leave the actual pageviews transitions data so we could still process and report on Transitions using RAW data?

This was discussed in #12737 Enable "Super Privacy" mode to not track any personal data, aka "I do not want to be bothered with GDPR"

And also more partially in other issues:

  • Remove/anonymize personal information from URLs automatically #13516
  • Offer possibility to only record continent, country and region location #12735
Powered by GitHub Issue Mirror