If historic data is okay, there's already a dataset released (https://figshare.com/articles/Activity_Sessions_datasets/1291033) that was designed specifically to answer questions around how to best calculate session length with regards to Wikipedia (http://arxiv.org/abs/1411.2878)

On Tue, Jun 28, 2016 at 3:42 PM, Marc Miquel <marcmiquel@gmail.com> wrote:
Hello!

I was thinking about user sessions, yes, so this would mean to aggregate pageviews visited by a user during a short amount of time (I should check the cutoff, but it could be around an hour or less). 

I am particularly interested in understanding the order in which pages are seen (start, end), duration, etc. 
I wouldn't need data from a long period neither, but I think data from multiple languages would be helpful.

I imagined reader data could be sensitive to privacy, but would an NDA with my university and some sort of data encoding help with this? As I said, it is for a scientific purpose. 

Thanks,

Marc

El dt., 28 juny 2016 a les 21:09, Nuria Ruiz (<nuria@wikimedia.org>) va escriure:

Hello!

>I am considering to study reader engagement for different article topics in different languages. Because of this, I would like to know if there is >any plan to make available pageviews dumps detailing activity log at session level per user - in a similar way to editor sessions.

Are you thinking of "all-pageviews-visited-by-a-certain-user"? If so, no we do not have any projects to provide that data as due to privacy concerns we neither have nor keep that information.

Thanks, 

Nuria



On Tue, Jun 28, 2016 at 6:55 PM, Leila Zia <leila@wikimedia.org> wrote:
+ Analytics


On Tue, Jun 28, 2016 at 6:36 AM, Marc Miquel <marcmiquel@gmail.com> wrote:

Hello, 

I have a question for you regarding pageviews datadumps. 

I am considering to study reader engagement for different article topics in different languages. Because of this, I would like to know if there is any plan to make available pageviews dumps detailing activity log at session level per user - in a similar way to editor sessions.

Since this would be for a research project I might ask funding for it, I would like to know if I could count on that, what is the nature of the available data, and what would be the procedure to obtain this data and if there would be any implication because of privacy concerns.

Thank you very much!

Best,

Marc Miquel


_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l



_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics


_______________________________________________
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics

_______________________________________________
Wiki-research-l mailing list
Wiki-research-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/wiki-research-l