agreed. Many of these articles will see spikes in traffic during the test (as the sample includes many celebrity articles) but the historical volume of traffic for the whole sample should give us a decent estimate of the throughput.

I also wouldn’t worry about any events other than MobileWebWikiGrok.page-impression and the events in the error log: all other events require user interaction.

Dario

On Jan 7, 2015, at 7:08 AM, Aaron Halfaker <ahalfaker@wikimedia.org> wrote:

Leila,

It might be worthwhile to merge that article set with the webrequest data we have in order to get a sense for how many pageloads/second to expect.  

-Aaron

On Tue, Jan 6, 2015 at 7:50 PM, Ryan Kaldari <rkaldari@wikimedia.org> wrote:
The highest volume events we are going to log will be:
1. For each of the 166,000 articles, one event when the page loads
2. For each of the 166,000 articles, one event when the WikiGrok widget enters the viewport (about half as often as #1)

These will be active for all mobile users, logged in and logged out, including many high pageview articles.

Given that information, do you have any idea if we are in danger of overloading EventLogging? If so, do you have recommendations on sampling? So far, everyone has said not to worry about it, but it would be good to get a sanity check for this test specifically.

Kaldari

On Tue, Jan 6, 2015 at 4:57 PM, Nuria Ruiz <nuria@wikimedia.org> wrote:
(cc-ing mobile-tech)

Since we do not the details of how wikigrok is used and its throughput of requests we can not "estimate" sampling ourselves. I imagine wikigrok is been deployed to a number of users and it is with that usage the mobile team could estimate the total throughput expected, with this throughput we can recommend sampling ratios. 


Thanks for asking about this without before deploying!


On Tue, Jan 6, 2015 at 4:55 PM, Ryan Kaldari <rkaldari@wikimedia.org> wrote:
I can elaborate on this after I finished the SWAT deployment.... Gimme 30 minutes or so.

On Tue, Jan 6, 2015 at 4:51 PM, Leila Zia <leila@wikimedia.org> wrote:
Hi,

  The mobile team is planning to switch WikiGrok on for non-logged in users next week (2014-01-12). The widget will be on on 166,029 article pages in enwiki. There are two EventLogging schema that may collect data heavily and we want to make sure EL can handle the influx of data.

and the list of pages affected is in:
wgq_page in enwiki.wikigrok_questions.

   It would be great if someone from the dev side let us know whether we will need sampling.

Thanks,
Leila


_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics



_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics



_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics


_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics