Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to make hierarchical tables scale? (flatten, search, sub tables) #6763

Closed
mattab opened this issue Dec 1, 2014 · 2 comments
Closed

How to make hierarchical tables scale? (flatten, search, sub tables) #6763

mattab opened this issue Dec 1, 2014 · 2 comments
Assignees
Labels
c: Performance For when we could improve the performance / speed of Matomo. Major Indicates the severity or impact or benefit of an issue is much higher than normal but not critical. Task Indicates an issue is neither a feature nor a bug and it's purely a "technical" change.
Milestone

Comments

@mattab
Copy link
Member

mattab commented Dec 1, 2014

The goal of this issue is to investigate possible solutions to a set of performance and scalability problems that have affected Piwik since the beginning.

Use cases covered:

The following features don't work well when a datatable has thousands of sub-tables, as they will try to load in memory all sub-tables at once (can be many thousands tables used to build one hierarchical table report):

Possible solutions or ideas

  • Store blobs as XML instead of gzip serialized (helps Search)
  • Store blobs as JSON instead of gzip (can we search through JSON cells as with modern Postgresql?)
  • Store all sub-tables in the same piwik_archive_blob_ row instead of storing each sub-table in a new piwik_archive_blob_ row.
  • Experiment with different ways of storing millions of sub-tables (eg. store them in 100 chunks, or store them in equal chunks of ~50k rows)
  • Pre-process the Flattened reports (help Flatten)?

see general performance issue #6759, date range issues #4768, flatten performance issues #5098

@mattab mattab added Major Indicates the severity or impact or benefit of an issue is much higher than normal but not critical. Task Indicates an issue is neither a feature nor a bug and it's purely a "technical" change. c: Performance For when we could improve the performance / speed of Matomo. RFC Indicates the issue is a request for comments where the author is looking for feedback. labels Dec 1, 2014
@mattab mattab added this to the Mid term milestone Dec 1, 2014
@mattab mattab removed the RFC Indicates the issue is a request for comments where the author is looking for feedback. label Dec 18, 2014
@mattab mattab modified the milestones: Short term, Mid term Feb 7, 2015
@mattab mattab modified the milestones: Piwik 2.12.0, Short term Feb 19, 2015
@mattab
Copy link
Member Author

mattab commented Mar 9, 2015

Some serious progress has been made in pull requests: #7387 #7336 #7375

@mattab mattab changed the title How to make hierarchical tables scale? (flatten, search, date range aggregation, sub tables) How to make hierarchical tables scale? (flatten, search, sub tables) Mar 13, 2015
@mattab
Copy link
Member Author

mattab commented Mar 13, 2015

Done, flatten should now be fast for all reports! 👍

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
c: Performance For when we could improve the performance / speed of Matomo. Major Indicates the severity or impact or benefit of an issue is much higher than normal but not critical. Task Indicates an issue is neither a feature nor a bug and it's purely a "technical" change.
Projects
None yet
Development

No branches or pull requests

2 participants