Food for thoughts:
SELECTuri_host, uri_path, uri_query, COUNT(1) as cFROM wmf.webrequestWHERE webrequest_source IN ('text', 'mobile') AND <YEAR/MONTH/DAY/HOUR restricted>AND is_pageview AND pageview_info['page_title'] = '-'GROUP BY uri_host, uri_path, uri_queryORDER BY c DESC LIMIT 100;en.wikipedia.org /w/index.php ?redirs=0&search=Calvary%20%2B%20Film%20%2B%20Brendan%20Gleeson%20%2B%202014&fulltext=Search&ns0=1 356en.wikipedia.org /w/index.php ?redirs=0&search=The%20Woman%20in%20Black%202%3A%20Angel%20of%20Death%20%2B%20Film%20%2B%20Jeremy%20Irvine%20%2B%202014&fulltext=Search&ns0=1 317en.wikipedia.org /w/index.php ?redirs=0&search=We%20Are%20the%20Best%21%20%2B%20Film%20%2B%20Vanja%20Engstr%C3%B6m%20%2B%202013&fulltext=Search&ns0=1 207en.wikipedia.org /w/index.php ?redirs=0&search=Manhattan%20%2B%20Film%20%2B%20John%20Benjamin%20Hickey%20%2B%202015&fulltext=Search&ns0=1 200en.wikipedia.org /w/index.php ?redirs=0&search=Angus,%20Thongs%20and%20Perfect%20Snogging%20%2B%20Film%20%2B%20Aaron%20Taylor-Johnson%20%2B%202008&fulltext=Search&ns0=1 145en.wikipedia.org /w/index.php ?redirs=0&search=Once%20Upon%20a%20Time%20%2B%20Film%20%2B%20Ginnifer%20Goodwin%20%2B%202015&fulltext=Search&ns0=1 69en.wikipedia.org /w/index.php ?redirs=0&search=The%20Birdcage%20%2B%20Film%20%2B%20Robin%20Williams%20%2B%201996&fulltext=Search&ns0=1 68en.wikipedia.org /w/index.php ?redirs=0&search=We%20Are%20What%20We%20Are%20%2B%20Film%20%2B%20Vonia%20Arslanian%20%2B%202013&fulltext=Search&ns0=1 51en.wikipedia.org /w/index.php ?redirs=0&search=We're%20the%20Millers%20%2B%20Film%20%2B%20Jason%20Sudeikis%20%2B%202013&fulltext=Search&ns0=1 48en.wikipedia.org /w/index.php ?redirs=0&search=Marina%20%2B%20Film%20%2B%20Matteo%20Simoni%20%2B%202013&fulltext=Search&ns0=1 41en.wikipedia.org /w/index.php ?redirs=0&search=Zurich%20%2B%20Film%20%2B%20Wende%20Snijders%20%2B%202015&fulltext=Search&ns0=1 37en.wikipedia.org /w/index.php ?redirs=0&search=Extraterrestre%20%2B%20Film%20%2B%20Juli%C3%A1n%20Villagr%C3%A1n%20%2B%202011&fulltext=Search&ns0=1 34en.wikipedia.org /w/index.php ?redirs=0&search=The%20Book%20Thief%20%2B%20Film%20%2B%20Sophie%20N%C3%A9lisse%20%2B%202013&fulltext=Search&ns0=1 32en.wikipedia.org /w/index.php ?redirs=0&search=Exodus%3A%20Gods%20and%20Kings%20%2B%20Film%20%2B%20Christian%20Bale%20%2B%202014&fulltext=Search&ns0=1 29en.wikipedia.org /w/index.php ?redirs=0&search=Event%20Horizon%20%2B%20Film%20%2B%20Laurence%20Fishburne%20%2B%201997&fulltext=Search&ns0=1 26en.wikipedia.org /w/index.php ?redirs=0&search=Hart's%20War%20%2B%20Film%20%2B%20Bruce%20Willis%20%2B%202002&fulltext=Search&ns0=1 24en.wikipedia.org /w/index.php ?redirs=0&search=Selma%20%2B%20Film%20%2B%20David%20Oyelowo%20%2B%202014&fulltext=Search&ns0=1 23en.wikipedia.org /w/index.php ?redirs=0&search=The%20Bridge%20%2B%20Film%20%2B%20Sofia%20Helin%20%2B%202011&fulltext=Search&ns0=1 23en.wikipedia.org /w/index.php ?redirs=0&search=Rob%20the%20Mob%20%2B%20Film%20%2B%20Michael%20Pitt%20%2B%202014&fulltext=Search&ns0=1 23
On Wed, Dec 2, 2015 at 6:16 PM, Oliver Keyes <okeyes@wikimedia.org> wrote:I mean, now I want to know how we can have a condition where there's
no page title but it registers as a pageview.
On 2 December 2015 at 12:14, Joseph Allemandou
<jallemandou@wikimedia.org> wrote:
> Double checked:
> https://github.com/wikimedia/analytics-refinery-source/blob/master/refinery-core/src/main/java/org/wikimedia/analytics/refinery/core/PageviewDefinition.java#L117
>
> This value is the default when no page title is found.
> I agree it's not very explicit.
> Any suggestion on changing it, or should we just make sure it is documented
> ?
>
> On Wed, Dec 2, 2015 at 6:10 PM, Oliver Keyes <okeyes@wikimedia.org> wrote:
>>
>> Can someone dig into it? We should really be excluding that (unless it
>> is the page on the dash ;p)
>>
>> On 2 December 2015 at 12:00, Dan Andreescu <dandreescu@wikimedia.org>
>> wrote:
>> > I always wonder about that. There's also an actual page that could
>> > theoretically be hit:
>> > https://en.wikipedia.org/w/index.php?title=-&redirect=no
>> >
>> > On Wed, Dec 2, 2015 at 11:58 AM, Gabriel Wicke <gwicke@wikimedia.org>
>> > wrote:
>> >>
>> >> Historically, I vaguely remember that we have used that title for user
>> >> script / style loading with action=raw. I think that's gone from the
>> >> skin code, but it's possible that user scripts still reference this
>> >> title.
>> >>
>> >> Gabriel
>> >>
>> >> On Wed, Dec 2, 2015 at 8:41 AM, Oliver Keyes <okeyes@wikimedia.org>
>> >> wrote:
>> >> > One of the most prominent top articles has no page; it's "-". What is
>> >> > this?
>> >> >
>> >> > --
>> >> > Oliver Keyes
>> >> > Count Logula
>> >> > Wikimedia Foundation
>> >> >
>> >> > _______________________________________________
>> >> > Analytics mailing list
>> >> > Analytics@lists.wikimedia.org
>> >> > https://lists.wikimedia.org/mailman/listinfo/analytics
>> >>
>> >>
>> >>
>> >> --
>> >> Gabriel Wicke
>> >> Principal Engineer, Wikimedia Foundation
>> >>
>> >> _______________________________________________
>> >> Analytics mailing list
>> >> Analytics@lists.wikimedia.org
>> >> https://lists.wikimedia.org/mailman/listinfo/analytics
>> >
>> >
>> >
>> > _______________________________________________
>> > Analytics mailing list
>> > Analytics@lists.wikimedia.org
>> > https://lists.wikimedia.org/mailman/listinfo/analytics
>> >
>>
>>
>>
>> --
>> Oliver Keyes
>> Count Logula
>> Wikimedia Foundation
>>
>> _______________________________________________
>> Analytics mailing list
>> Analytics@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>
>
>
>
> --
> Joseph Allemandou
> Data Engineer @ Wikimedia Foundation
> IRC: joal
>
> _______________________________________________
> Analytics mailing list
> Analytics@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
--
Oliver Keyes
Count Logula
Wikimedia Foundation
_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
--Joseph AllemandouData Engineer @ Wikimedia FoundationIRC: joal
_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics