Is there any way you can look at traffic for particular pages? If so,
you could look at traffic to something like Special:Watchlist or
Special:UserLogin on a representative sample of wikis - anyone using
these two pages is very likely to represent a logged-in user, and
traffic numbers to them are high enough you might get useful data even
with the sampling limits.
Andrew.
- Andrew Gray
On 15 October 2014 21:50, Oliver Keyes <okeyes@wikimedia.org> wrote:
> Darnit. Ah well! Okay; finished building the code to retrieve this data.
> Takes ~400 seconds to handle a day of logs, so take into account
> parallelisation and I should (should!) have something to show in a couple of
> hours for the first 3 Qs. The fourth, it seems, is beyond our ken.
>
> On 15 October 2014 15:54, Max Semenik <maxsem.wiki@gmail.com> wrote:
>>
>> There's no data for IE6 in EventLogging because IE6 gets no JS these days.
>> Maybe, if there's old enough data...
>>
>> On Wed, Oct 15, 2014 at 12:46 PM, Oliver Keyes <okeyes@wikimedia.org>
>> wrote:
>>>
>>> Update: Yuvi's pointed me towards a login attempts schema. All 4 are
>>> doable. Data tomorrow morning EST at the latest.
>>>
>>> On 15 October 2014 15:19, Oliver Keyes <okeyes@wikimedia.org> wrote:
>>>>
>>>> (With "jiffy" read "a day"; even with sampling, big logs are big, and I
>>>> imagine we probably want ~30 days of data.)
>>>>
>>>> On 15 October 2014 15:18, Oliver Keyes <okeyes@wikimedia.org> wrote:
>>>>>
>>>>> First three are pretty trivial; last one is a bit of a pain, but doable
>>>>> if someone wants to poke me on IRC (/query Ironholds) and chat about what an
>>>>> unambiguous successful login action would look like in terms of requests.
>>>>> But I can do the first three in a jiffy.
>>>>>
>>>>> On 15 October 2014 13:32, Brandon Black <bblack@wikimedia.org> wrote:
>>>>>>
>>>>>>
>>>>>>
>>>>>> On Wed, Oct 15, 2014 at 3:43 PM, Oliver Keyes <okeyes@wikimedia.org>
>>>>>> wrote:
>>>>>>>
>>>>>>> You invoked my name!
>>>>>>>
>>>>>>> Emphasis is "logged-in". If you guys want more solid overall numbers,
>>>>>>> I can get those in short order; this seems like a pretty critical question
>>>>>>> to have data on, fast. Lemme know.
>>>>>>
>>>>>>
>>>>>> If you can source some good reliable numbers, probably what we care
>>>>>> about (all of which have been estimated to some degree in this thread
>>>>>> already, I think?) is:
>>>>>>
>>>>>> % of all requests from IE6
>>>>>> % of all https requests from IE6
>>>>>> % of all text/html https requests from IE6 (not so important IMHO, if
>>>>>> it's difficult)
>>>>>> % of all logged-in https requests (or alternatively, % of all
>>>>>> successful https login attempts) from IE6.
>>>>>>
>>>>>
>>>>>
>>>>>
>>>>> --
>>>>> Oliver Keyes
>>>>> Research Analyst
>>>>> Wikimedia Foundation
>>>>
>>>>
>>>>
>>>>
>>>> --
>>>> Oliver Keyes
>>>> Research Analyst
>>>> Wikimedia Foundation
>>>
>>>
>>>
>>>
>>> --
>>> Oliver Keyes
>>> Research Analyst
>>> Wikimedia Foundation
>>>
>>> _______________________________________________
>>> Analytics mailing list
>>> Analytics@lists.wikimedia.org
>>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>>
>>
>>
>>
>> --
>> Best regards,
>> Max Semenik ([[User:MaxSem]])
>>
>> _______________________________________________
>> Analytics mailing list
>> Analytics@lists.wikimedia.org
>> https://lists.wikimedia.org/mailman/listinfo/analytics
>>
>
>
>
> --
> Oliver Keyes
> Research Analyst
> Wikimedia Foundation
>
> _______________________________________________
> Analytics mailing list
> Analytics@lists.wikimedia.org
> https://lists.wikimedia.org/mailman/listinfo/analytics
>
--
andrew.gray@dunelm.org.uk
_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics