Hey Gilles,
Thanks much for getting this done. We hope to announce the media file requests dump any moment now.
The initial release will be without this new data, but I hope we can incorporate it asap.
Cheers,
Erik
From: Nuria Ruiz [mailto:nuria@wikimedia.org] Sent: Thursday, March 19, 2015 17:47 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Cc: Erik Zachte Subject: Re: [Analytics] Virtual file view hack for Media Viewer views
What happens after that hasn't been configured and is starting to be way beyond the scope of what the Multimedia team should be involved with.
True, but we need to know on our end whether some special treatment needs to be applied to this data.
For example, if these have to be counted as pageviews we should know. By Erik Z's reply early on we assume these are not to be counted as pageviews but rather "media files views" so they will "sit" on the refined tables with is_Pageview=0 and some code will harvest those to count them as "media file request counts".
Thanks,
Nuria
On Thu, Mar 19, 2015 at 6:35 AM, Andrew Otto aotto@wikimedia.org wrote:
Any webrequest is picked up by varnishkafka and goes into the webrequest logs in the cluster. If you want special treatment of your request, say different formatting or different logs, we’ll have to do something else :)
On Mar 19, 2015, at 04:45, Gilles Dubuc gilles@wikimedia.org wrote:
Hi Nuria,
As far as I'm aware, it only goes to the varnish logs at the moment. What happens after that hasn't been configured and is starting to be way beyond the scope of what the Multimedia team should be involved with.
CCing Ori who knows whether the hits to the beacon URI are picked up by varnishkafka or not at the moment, since he set up the endpoint.
On Wed, Mar 18, 2015 at 3:42 PM, Nuria Ruiz nuria@wikimedia.org wrote:
Gilles:
And we know this data is coming via varnishkafka into the cluster, right? Did we checked that?
Thanks,
Nuria
On Wed, Mar 18, 2015 at 2:02 AM, Gilles Dubuc gilles@wikimedia.org wrote:
Our solution for this is now live.
Here's an example of a media beacon hit:
http://bits.wikimedia.org/beacon/media?duration=3709 http://bits.wikimedia.org/beacon/media?duration=3709&uri=http%3A%2F%2Fupload.beta.wmflabs.org%2Fwikipedia%2Fen%2Fthumb%2Fb%2Fb0%2FSunrise_over_fishing_boats_in_Kerala.jpg%2F640px-Sunrise_over_fishing_boats_in_Kerala.jpg &uri=http%3A%2F%2Fupload.beta.wmflabs.org%2Fwikipedia%2Fen%2Fthumb%2Fb%2Fb0%2FSunrise_over_fishing_boats_in_Kerala.jpg%2F640px-Sunrise_over_fishing_boats_in_Kerala.jpg
Beta is currently hitting that endpoint and production wikis will start doing the same once they start running 1.25wmf22
All views coming from Media Viewer will be hitting that endpoint. Note that there might be some loss of hits on browsers that don't support sendBeacon, since our fallback is a simple async AJAX request (we haven't tried to go beyond that with local storage and replaying the event, etc.) and this event might be fired in situations of tab/browser close as well as navigating away from the page. Thus keep in mind that a steady small increase of those hits over a long period of time might simply be the natural process of people upgrading their browsers to more modern ones.
On Fri, Feb 6, 2015 at 4:46 PM, Nuria Ruiz nuria@wikimedia.org wrote:
A dummy image request seems rather reasonable. (I assume varnish can handle such load of "atypical" requests.)
Right, the filtering for beacons is already in place in vcl and responses are sent right away so as far as I know there is no better place than varnish for this code. See example:
https://github.com/wikimedia/operations-puppet/blob/production/templates/var...
On Thu, Feb 5, 2015 at 10:38 PM, Federico Leva (Nemo) nemowiki@gmail.com wrote:
Gergo Tisza, 04/02/2015 21:00:
Do you see any fundamental problem with this?
A dummy image request seems rather reasonable. (I assume varnish can handle such load of "atypical" requests.) Making additional requests is ugly, but until we get SPDY our articles typically make dozens or hundreds requests, so the effect looks negligible.
Nemo
_______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics