SEO Ranking plugin not working - problem with ping? - Fix the random test failures #17919

tsteur · 2021-08-25T01:38:31Z

Seeing this in the tests

and an error in the UI

Seems this regressed.

sgiehl · 2021-08-25T08:52:58Z

That actually seems to fail randomly

Findus23 · 2021-08-25T09:07:22Z

This is more or less expected as it is crawling websites that randomly block or rate limit the requests.

tsteur · 2021-08-25T20:46:56Z

@Findus23 so it's maybe now also blocked on travis because we run tests more often maybe? Maybe in that case we need to write the tests slightly different to ideally detect rate limit and ignore it (skip test or so) but not fail when the data or URL changes (not fail for any other error).

To fix it on cloud maybe we would need to request the data using JavaScript to go directly to the search engines? But I assume then it might be bit harder to test things work?

Findus23 · 2021-08-26T08:40:26Z

But then again I thought we changed the tests to test against a local copy of the HTTP response, but I might be wrong here

sgiehl · 2021-08-26T08:50:43Z

I guess there are still some tests that try to fetch the data from bing, alexa and google to ensure the response doesn't change

tsteur · 2021-08-26T20:52:16Z

created an internal issue for cloud to fix the rate limiting there where we might simply try to get data using JS directly from search engine or so.

Meanwhile as part of this issue be great to look into making the test stable. Like we could force to run into the rate limit and then check if the response code or response message includes rate limit information then we can catch this error specifically and ignore it in tests.

geekdenz · 2021-09-14T04:39:42Z

That actually seems to fail randomly

You are right @sgiehl ! That was the clue.

See

matomo/plugins/SEO/tests/Integration/SEOTest.php

Line 42 in 41b8240

    
           $_SERVER['HTTP_USER_AGENT'] = $user_agents[mt_rand(0, count($user_agents) - 1)];

I debugged it and it seems it randomly fails on the User Agent that is passed.

So, I wrote this script:
https://gist.github.com/geekdenz/2496eaaf7c437ba49bc389e75a10b880

Ran it and found that it outputs an empty search result on offset $i % 3 === 2:

So, it seems to be this UA that it does not like:

matomo/plugins/SEO/tests/Integration/SEOTest.php

Line 39 in 41b8240

    
           'Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/59.0.3071.115 Safari/537.36',

I'll change it to a different one, as Bing seems to not work with that. Also, I think it should maybe not be completely random and should be different on every subsequent call. But i realise that might be hard to do. Could it be $build_no % count($user_agents)?

fixes #17919

geekdenz · 2021-09-14T07:30:45Z

FYI I was able to reproduce this in Safari changing the UA string to Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/59.0.3071.115 Safari/537.36

fixes #17919

justinvelluppillai · 2021-09-14T07:39:36Z

@geekdenz nice find. Don't know if mt_rand is great in a test either, wonder if it could do something along the lines of either using each of the user agents one after the other or what @tsteur suggested and keep trying a different user agent if one fails (depending on whether there's a reason to test different user agents or not).

geekdenz · 2021-09-14T21:32:40Z

I think using the same user agent every time would suffice actually.

geekdenz · 2021-09-14T23:26:42Z

@sgiehl @tsteur

Why are we using an array of user agents here? I feel like there would have been an original reason/issue with just using one. Or was it just added to avoid a problem that was merely anticipated?

sgiehl · 2021-09-15T09:45:31Z

@geekdenz The code to query the metrics uses $_SERVER['HTTP_USER_AGENT']. So the useragent of the current user. I wonder if it might make sense to check if the results are empty and retry the query with another useragent. So the results are fetched correctly even if the useragent is not supported 🤔
Using random user agents in tests sounds valid. Otherwise we might not have discovered this. But would be good to comment in the code somewhere that bing might not return results for some useragents maybe

* change bogus user agent that fails on bing.com fixes #17919 * only use one user agent for seo test

justinvelluppillai · 2021-09-15T22:24:15Z

@sgiehl @geekdenz I think the test should either test a few different UAs or just one rather than random. If we test one, we could use that same one with confidence for the backup useragent to retry with on empty results. Maybe worth making that as a separate issue - I've merged this one for now to prevent the failing test meantime

tsteur · 2021-09-17T03:50:53Z

@geekdenz this still seems to be failing randomly see https://app.travis-ci.com/github/matomo-org/matomo/jobs/537951969#L1130-L1130

or am I looking wrong? The test ran after the other PR was merged AFAIK

geekdenz · 2021-09-17T04:29:54Z

Yeah. Thanks for pointing it out. I saw this earlier as well but thought it might be an artifact of a merge not having happened somewhere. I think maybe the find I did was just looking like this in Safari. I'll have another look in Chrome. And then I might reinstate the array and make it retry with another UA if the test fails on the first etc. and only have it throw a failure of all fails.

justinvelluppillai · 2021-10-05T00:09:36Z

I think it's possible for this to fail if the actual search results on Bing contains the word "results". It's a rare case, maybe happens occasionally when microsoft.com publishes new company "results" for investors or at random. Perhaps a solution would be to make the regex that searches for the number of search results more restrictive. The change from @geekdenz at least prevents the test failing 1 in 3 times and may also be sufficient to close this.

peterhashair · 2021-10-11T02:00:37Z

I had a discussion with @justinvelluppillai on that one, we suggest printing the IP if tests failed in the travis, if the same IP keeps failing, probably because the IP blocked already. Let me know if there is a security risk of printing the IP of Travis.

tsteur added Bug For errors / faults / flaws / inconsistencies etc. Regression Indicates a feature used to work in a certain way but it no longer does even though it should. labels Aug 25, 2021

tsteur added this to the 4.5.0 milestone Aug 25, 2021

geekdenz self-assigned this Sep 14, 2021

geekdenz pushed a commit that referenced this issue Sep 14, 2021

change bogus user agent that fails on bing.com

d6617ae

fixes #17919

geekdenz mentioned this issue Sep 14, 2021

change bogus user agent that fails on bing.com #17993

Merged

11 tasks

geekdenz pushed a commit that referenced this issue Sep 14, 2021

change bogus user agent that fails on bing.com

de96268

fixes #17919

justinvelluppillai closed this as completed in #17993 Sep 15, 2021

justinvelluppillai pushed a commit that referenced this issue Sep 15, 2021

change bogus user agent that fails on bing.com (#17993)

eecc8b9

* change bogus user agent that fails on bing.com fixes #17919 * only use one user agent for seo test

tsteur reopened this Sep 17, 2021

justinvelluppillai unassigned geekdenz Sep 27, 2021

justinvelluppillai modified the milestones: 4.5.0, 4.6.0 Oct 7, 2021

tsteur changed the title ~~SEO Ranking plugin not working - problem with ping?~~ SEO Ranking plugin not working - problem with ping? - Fix the random test failures Oct 7, 2021

peterhashair self-assigned this Oct 7, 2021

peterhashair mentioned this issue Oct 7, 2021

SEO ranking Plugin tests update #18109

Merged

11 tasks

tsteur closed this as completed in #18109 Oct 8, 2021

tsteur reopened this Oct 8, 2021

peterhashair mentioned this issue Oct 11, 2021

SEOTest failed print REMOTE_ADDR #18125

Merged

11 tasks

peterhashair closed this as completed in #18125 Oct 11, 2021

peterhashair mentioned this issue Oct 12, 2021

update SEO Test #18142

Merged

11 tasks

justinvelluppillai added the not-in-changelog For issues or pull requests that should not be included in our release changelog on matomo.org. label Nov 29, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

SEO Ranking plugin not working - problem with ping? - Fix the random test failures #17919

SEO Ranking plugin not working - problem with ping? - Fix the random test failures #17919

tsteur commented Aug 25, 2021

sgiehl commented Aug 25, 2021

Findus23 commented Aug 25, 2021

tsteur commented Aug 25, 2021 •

edited

Findus23 commented Aug 26, 2021

sgiehl commented Aug 26, 2021

tsteur commented Aug 26, 2021

geekdenz commented Sep 14, 2021

geekdenz commented Sep 14, 2021

justinvelluppillai commented Sep 14, 2021

geekdenz commented Sep 14, 2021

geekdenz commented Sep 14, 2021

sgiehl commented Sep 15, 2021

justinvelluppillai commented Sep 15, 2021

tsteur commented Sep 17, 2021

geekdenz commented Sep 17, 2021

justinvelluppillai commented Oct 5, 2021

peterhashair commented Oct 11, 2021

SEO Ranking plugin not working - problem with ping? - Fix the random test failures #17919

SEO Ranking plugin not working - problem with ping? - Fix the random test failures #17919

Comments

tsteur commented Aug 25, 2021

sgiehl commented Aug 25, 2021

Findus23 commented Aug 25, 2021

tsteur commented Aug 25, 2021 • edited

Findus23 commented Aug 26, 2021

sgiehl commented Aug 26, 2021

tsteur commented Aug 26, 2021

geekdenz commented Sep 14, 2021

geekdenz commented Sep 14, 2021

justinvelluppillai commented Sep 14, 2021

geekdenz commented Sep 14, 2021

geekdenz commented Sep 14, 2021

sgiehl commented Sep 15, 2021

justinvelluppillai commented Sep 15, 2021

tsteur commented Sep 17, 2021

geekdenz commented Sep 17, 2021

justinvelluppillai commented Oct 5, 2021

peterhashair commented Oct 11, 2021

tsteur commented Aug 25, 2021 •

edited