Ugh, this wasn't meant to go on-list
(obviously). Sorry!
-- Timo
On Wed, 19 Dec 2018 at 18:55, Timo Tijhof <ttijhof(a)wikimedia.org> wrote:
Hi Nuria,
As I understand it, in Safari and Chrome with the default settings,
there is a native browser feature that, when searching through the address
bar (Google powered) by default silently starts loading the url of the top
result shown below the address bar. Maybe there's a way we opted out, but I
think it applies to Wikipedia as well.
I'm replying privately because I didn't understand the last part of
your email, and maybe we are saying the same thing :)
-- Timo
On Wed, 19 Dec 2018 at 14:14, Nuria Ruiz <nuria(a)wikimedia.org> wrote:
> > I think that's for the Page Previews feature (i.e., when a user
> hovers over a link on desktop Wikipedia) or
> > its corresponding feature in the the Wikipedia for Android
> (triggered by default on link tap)
> The code that Fran pointed to only discounts "previews" by Android app
> as we stablished that convention a while back. Page previews (hovers over
> wikipedia links that display a short popup) are not counted as pageviews at
> all at this time.
>
> >By "prefetching", I meant X's Wikipedia page shows up in the
search
> results and the browser prefetches/preloads the search results but I do not
> click on X's Wikipedia page. If so, the >pageview data seem to over-count
> the number of visits to X's Wikipedia page.
> This functionality needs to be implemented by the client (it is not
> automagically implemented by the browser) and it is not implemented on
> Wikipedia. Searches trigger requests to the API, that return pageview urls,
> not pageview prefetches. You can follow these workflows on the dev tools of
> the browser you might be using. chrome://net-export/ is a new addition to
> the toolset that gives you a readable dump.
>
> >Are you saying that browser-based prefetch activity (e.g., with
> resource hinting like
https://www.w3.org/TR/resource-hints/ ) is also
> tagged the same way?
> No. Browser prefetches cannot be tagged, they are initiated by the
> browser. Wikipedia's pages do not do any prefetches and or pre-renderings
> of content using those directives as far as I can see. dns-prefetches are
> done for two domains: login and meta, neither of which counts as a pageview
> cause a dns prefetch does not receive an http response. Just instantiates a
> connection and resolves TLS if any. Prefetches might be indicated by a
> standard set of headers like "Link:" in the future (this would be
initiated
> by the browser) but that seems on the works.
>
> Thanks,
>
> Nuria
>
>
> On Wed, Dec 19, 2018 at 9:43 AM Adam Baso <abaso(a)wikimedia.org> wrote:
>
>> Fran, the preview to which you refer, I think that's for the Page
>> Previews feature (i.e., when a user hovers over a link on desktop
>> Wikipedia) or its corresponding feature in the the Wikipedia for Android
>> (triggered by default on link tap) and Wikipedia for iOS (force press)
>> apps, is that right? Are you saying that browser-based prefetch activity
>> (e.g., with resource hinting like
>>
https://www.w3.org/TR/resource-hints/ ) is also tagged the same way?
>>
>> Chenqi Zhu, I think what you're suggesting is the possibility that
>> browsers might be issuing HTTP prefetches for Wikimedia-hosted pages and
>> that could inflate pageviews. I'm not sure, but have you happened to
>> observe user agents making prefetches when resource hinting (
>>
https://www.w3.org/TR/resource-hints/ ) is absent? I'm not sure how
>> often, if at all, discovery platforms like search engines are actually
>> placing resource hints into markup (which is mostly deterministic as far as
>> browser behavior) for Wikimedia content...nor to what degree there might be
>> heuristics being used for prefetching independently of any resource hints.
>> Do you have any data or field observations to help clarify?
>>
>> Browser settings allude to this sort of behavior (e.g., in Chrome,
>> there's "Use a prediction service to load pages more quickly",
which is
>> described at
https://support.google.com/chrome/answer/114836 ),
>> although I think without digging into browser source code it's a bit hard
>> to know for certain what's going on "under the hood". We do use
preconnect
>> and prefetch semantics and the like in different contexts (cf.
>>
https://phabricator.wikimedia.org/search/query/G2tr8i0YZii9 ,
>>
https://phabricator.wikimedia.org/search/query/.dtx_hqaj3wj ,
>> ...there may be more).
>>
>> -Adam
>>
>>
>>
>> On Wed, Dec 19, 2018 at 11:00 AM Francisco Dans <fdans(a)wikimedia.org>
>> wrote:
>>
>>> Hi Chenqi,
>>>
>>> You can find out more about what constitutes a pageview in its
>>> Metawiki article:
>>>
>>>
https://meta.wikimedia.org/wiki/Research:Page_view#Definition
>>>
>>> As you can see, one of the conditions is whether the request being
>>> examined is a preview or not, in which case it is not counted as a page
>>> view. Hope this helps!
>>>
>>> Fran
>>>
>>> On Wed, Dec 19, 2018 at 5:30 PM Chenqi Zhu <czhu(a)stern.nyu.edu>
>>> wrote:
>>>
>>>> Hi everyone,
>>>>
>>>> I am trying to better understand the pageview data. I have a quick
>>>> question. I apologize if the question has been asked or it is so naive.
>>>>
>>>> If the web browser prefetches a Wikipedia page, does it count as
>>>> one pageview in the pageview data? By "prefetching", I meant
X's Wikipedia
>>>> page shows up in the search results and the browser prefetches/preloads
the
>>>> search results but I do not click on X's Wikipedia page. If so, the
>>>> pageview data seem to over-count the number of visits to X's
Wikipedia page.
>>>>
>>>> Thanks in advance for any insight.
>>>>
>>>>
>>>> Chenqi Zhu
>>>> New York University
>>>> 44 W 4th St., Suite 10-185(B),
>>>> New York, NY 10012, U.S.A.
>>>> _______________________________________________
>>>> Analytics mailing list
>>>> Analytics(a)lists.wikimedia.org
>>>>
https://lists.wikimedia.org/mailman/listinfo/analytics
>>>>
>>>
>>>
>>> --
>>> *Francisco Dans*
>>> Software Engineer, Analytics Team
>>> Wikimedia Foundation
>>> _______________________________________________
>>> Analytics mailing list
>>> Analytics(a)lists.wikimedia.org
>>>
https://lists.wikimedia.org/mailman/listinfo/analytics
>>>
>> _______________________________________________
>> Analytics mailing list
>> Analytics(a)lists.wikimedia.org
>>
https://lists.wikimedia.org/mailman/listinfo/analytics
>>
> _______________________________________________
> Analytics mailing list
> Analytics(a)lists.wikimedia.org
>
https://lists.wikimedia.org/mailman/listinfo/analytics
>
_______________________________________________