@orlitzky opened this Issue on July 28th 2020

We hit this SQL error last night:

Error query: SQLSTATE[22001]: String data, right truncated: 1406 Data too long for column 'user_id' at row 1...
Parameters: array (
ERROR BulkTracking[2020-07-28 06:59:44 UTC] [6291e] 0 => '�?���',
ERROR BulkTracking[2020-07-28 06:59:44 UTC] [6291e] 1 => 'S�(<�',
ERROR BulkTracking[2020-07-28 06:59:44 UTC] [6291e] 2 => '�k�w',
ERROR BulkTracking[2020-07-28 06:59:44 UTC] [6291e] 3 => 237,
ERROR BulkTracking[2020-07-28 06:59:44 UTC] [6291e] 4 => '%252525252525252525252525252525252525252525252525252525252525252525252525252525252528...%252525252525252525252525252525252525252525252525252525252525252525252525252525252529a.langton<a class='mention' href='https://github.com/Sus'>@Sus</a>.ta.i.n.j.ex.k<a class='mention' href='https://github.com/fen'>@fen</a>.Gku.an.gx.r.ku.ai8.xn%252525252525252525252525252525252525252525252525252525252525252525252525252525252520.xn%252525252525252525252525252525252525252525252525252525252525252525252525252525252520.u.k<a class='mention' href='https://github.com/Meli'>@Meli</a>.S.a.Ri.c.h4223<a class='mention' href='https://github.com/e'>@e</a>.xultan.tacoustic.sfat.lettuceerz<a class='mention' href='https://github.com/fault'>@fault</a>.ybeamdulltnderwearertwe.s.e<a class='mention' href='https://github.com/p'>@p</a>.laus.i.bleljh<a class='mention' href='https://github.com/r'>@r</a>.eces.si.v.e.x.g.z<a class='mention' href='https://github.com/leanna'>@leanna</a>.langton<a class='mention' href='https://github.com/WWW'>@WWW</a>.EMEKAOLISA<a class='mention' href='https://github.com/www'>@www</a>.karunakumari46<a class='mention' href='https://github.com/sh'>@sh</a>.jdus.h.a.i.j.5.8.7.4.8574.85<a class='mention' href='https://github.com/c'>@c</a>.o.nne.c.t.tn.tu<a class='mention' href='https://github.com/Go'>@Go</a>.O.gle.email.2.%25252525252525252525252525252525252525252525252525252525252525252525252525252525255C%25252525252525252525252525252525252525252525252525252525252525252525252525252525255Cn1<a class='mention' href='https://github.com/sarahjohnsonw'>@sarahjohnsonw</a>.estbrookbertrew.e.r<a class='mention' href='https://github.com/hu'>@hu</a>.fe.ng.k.Ua.ngniu.bi..uk41<a class='mention' href='https://github.com/Www'>@Www</a>.Zanele<a class='mention' href='https://github.com/silvia'>@silvia</a>.woodw.o.r.t.h<a class='mention' href='https://github.com/w'>@w</a>.anting.parentcrazyre.stfir.stdro<a class='mention' href='https://github.com/www'>@www</a>.mondaymorninginspiration<a class='mention' href='https://github.com/fidelia'>@fidelia</a>.commons<a class='mention' href='https://github.com/Hu'>@Hu</a>.Fen.Gk.Uang.Ni.U.B.I.Xn--.U.K.6.2<a class='mention' href='https://github.com/p'>@p</a>.a.r.a.ju.mp.e.r.sj.a.s.s.en20.14<a class='mention' href='https://github.com/Leanna'>@Leanna</a>.Langton<a class='mention' href='https://github.com/Your'>@Your</a>.Qwe.Aqmail<a class='mention' href='https://github.com/Sus'>@Sus</a>.Ta.I.N.J.Ex.K',
ERROR BulkTracking[2020-07-28 06:59:44 UTC] [6291e] 5 => '2020-07-27 05:16:49',

Obviously that horrendous "user id" is to blame. The associated log file entry is,

www.example.org 172.107.167.119 - %252525252525252525252525252525252525252525252525252525252525252525252525252525252528...%252525252525252525252525252525252525252525252525252525252525252525252525252525252529a.langton<a class='mention' href='https://github.com/Sus'>@Sus</a>.ta.i.n.j.ex.k<a class='mention' href='https://github.com/fen'>@fen</a>.Gku.an.gx.r.ku.ai8.xn%252525252525252525252525252525252525252525252525252525252525252525252525252525252520.xn%252525252525252525252525252525252525252525252525252525252525252525252525252525252520.u.k<a class='mention' href='https://github.com/Meli'>@Meli</a>.S.a.Ri.c.h4223<a class='mention' href='https://github.com/e'>@e</a>.xultan.tacoustic.sfat.lettuceerz<a class='mention' href='https://github.com/fault'>@fault</a>.ybeamdulltnderwearertwe.s.e<a class='mention' href='https://github.com/p'>@p</a>.laus.i.bleljh<a class='mention' href='https://github.com/r'>@r</a>.eces.si.v.e.x.g.z<a class='mention' href='https://github.com/leanna'>@leanna</a>.langton<a class='mention' href='https://github.com/WWW'>@WWW</a>.EMEKAOLISA<a class='mention' href='https://github.com/www'>@www</a>.karunakumari46<a class='mention' href='https://github.com/sh'>@sh</a>.jdus.h.a.i.j.5.8.7.4.8574.85<a class='mention' href='https://github.com/c'>@c</a>.o.nne.c.t.tn.tu<a class='mention' href='https://github.com/Go'>@Go</a>.O.gle.email.2.%25252525252525252525252525252525252525252525252525252525252525252525252525252525255C%25252525252525252525252525252525252525252525252525252525252525252525252525252525255Cn1<a class='mention' href='https://github.com/sarahjohnsonw'>@sarahjohnsonw</a>.estbrookbertrew.e.r<a class='mention' href='https://github.com/hu'>@hu</a>.fe.ng.k.Ua.ngniu.bi..uk41<a class='mention' href='https://github.com/Www'>@Www</a>.Zanele<a class='mention' href='https://github.com/silvia'>@silvia</a>.woodw.o.r.t.h<a class='mention' href='https://github.com/w'>@w</a>.anting.parentcrazyre.stfir.stdro<a class='mention' href='https://github.com/www'>@www</a>.mondaymorninginspiration<a class='mention' href='https://github.com/fidelia'>@fidelia</a>.commons<a class='mention' href='https://github.com/Hu'>@Hu</a>.Fen.Gk.Uang.Ni.U.B.I.Xn--.U.K.6.2<a class='mention' href='https://github.com/p'>@p</a>.a.r.a.ju.mp.e.r.sj.a.s.s.en20.14<a class='mention' href='https://github.com/Leanna'>@Leanna</a>.Langton<a class='mention' href='https://github.com/Your'>@Your</a>.Qwe.Aqmail<a class='mention' href='https://github.com/Sus'>@Sus</a>.Ta.I.N.J.Ex.K [27/Jul/2020:01:16:49 -0400] "GET /en/node/2265/track HTTP/1.1" 301 - "https://%25252525252525252525252525252525252525252525252525252525252525252525252525252525252528...%25252525252525252525252525252525252525252525252525252525252525252525252525252525252529a.langton<a class='mention' href='https://github.com/Sus'>@Sus</a>.ta.i.n.j.ex.k<a class='mention' href='https://github.com/fen'>@fen</a>.Gku.an.gx.r.ku.ai8.xn%25252525252525252525252525252525252525252525252525252525252525252525252525252525252520.xn%25252525252525252525252525252525252525252525252525252525252525252525252525252525252520.u.k<a class='mention' href='https://github.com/Meli'>@Meli</a>.S.a.Ri.c.h4223<a class='mention' href='https://github.com/e'>@e</a>.xultan.tacoustic.sfat.lettuceerz<a class='mention' href='https://github.com/fault'>@fault</a>.ybeamdulltnderwearertwe.s.e<a class='mention' href='https://github.com/p'>@p</a>.laus.i.bleljh<a class='mention' href='https://github.com/r'>@r</a>.eces.si.v.e.x.g.z<a class='mention' href='https://github.com/leanna'>@leanna</a>.langton<a class='mention' href='https://github.com/WWW'>@WWW</a>.EMEKAOLISA<a class='mention' href='https://github.com/www'>@www</a>.karunakumari46<a class='mention' href='https://github.com/sh'>@sh</a>.jdus.h.a.i.j.5.8.7.4.8574.85<a class='mention' href='https://github.com/c'>@c</a>.o.nne.c.t.tn.tu<a class='mention' href='https://github.com/Go'>@Go</a>.O.gle.email.2.%2525252525252525252525252525252525252525252525252525252525252525252525252525252525255C%2525252525252525252525252525252525252525252525252525252525252525252525252525252525255Cn1<a class='mention' href='https://github.com/sarahjohnsonw'>@sarahjohnsonw</a>.estbrookbertrew.e.r<a class='mention' href='https://github.com/hu'>@hu</a>.fe.ng.k.Ua.ngniu.bi..uk41<a class='mention' href='https://github.com/Www'>@Www</a>.Zanele<a class='mention' href='https://github.com/silvia'>@silvia</a>.woodw.o.r.t.h<a class='mention' href='https://github.com/w'>@w</a>.anting.parentcrazyre.stfir.stdro<a class='mention' href='https://github.com/www'>@www</a>.mondaymorninginspiration<a class='mention' href='https://github.com/fidelia'>@fidelia</a>.commons<a class='mention' href='https://github.com/Hu'>@Hu</a>.Fen.Gk.Uang.Ni.U.B.I.Xn--.U.K.6.2<a class='mention' href='https://github.com/p'>@p</a>.a.r.a.ju.mp.e.r.sj.a.s.s.en20.14<a class='mention' href='https://github.com/Leanna'>@Leanna</a>.Langton<a class='mention' href='https://github.com/Your'>@Your</a>.Qwe.Aqmail<a class='mention' href='https://github.com/Sus'>@Sus</a>.Ta.I.N.J.Ex.K<a class='mention' href='https://github.com/www'>@www</a>.example.org/" "Mozilla/5.0 (X11; OpenBSD i386) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/43.0.2357.125 Safari/537.36"

This is during an off-line mass log file import. I guess that column should be sanitized, within reason?

@tsteur commented on July 28th 2020 Member

Thanks for letting us know @orlitzky

I've created a PR for this in https://github.com/matomo-org/matomo/pull/16250 which should fix this and truncate the userId automatically to 200.

This Issue was closed on July 31st 2020
Powered by GitHub Issue Mirror