New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Chinese GBK code disorder code problem #12732
Comments
Hi, can you further describe what issue you are referring to and how to reproduce it? |
Similar to the following Chinese search engines, GBK coding is not UTF-8 encoding, and matomo is not recognized, all converted to UTF-8 recognition, so the search term is chaotic. |
@qq383762126 The charset for |
I just checked my Matomo, I too have around 5% of records from Sogou being unreadable, for example, |
It sounds like those are browsers that are already sending invalid UTF-8 to Matomo, so there is little that can be fixed here. And as long as Matomo gets valid UTF-8 data now with #9785 it should be possible to store any unicode character. |
Can we solve the problem of "GBK coding" in Chinese search engine?
The text was updated successfully, but these errors were encountered: