Hi Joseph. Perhaps these approximations could already provide me valuable information. If it is possible to distinguish between mobile and pc visits, then I could filter the mobile and keep the more reliable pc-based data.
This is all I wanted to know by now to prepare my project. In case I need to progress with it, I will contact you. Thank you very much for the answer.
Cheers, Marc
El dc., 29 juny 2016 a les 10:24, Joseph Allemandou (< jallemandou@wikimedia.org>) va escriure:
Hi Marc,
The information you're after is not available in the data we collect, for at least two reasons
- We don't collect data allowing to detect user sessions (no id-cookie
or identifier)
- We don't collect time spent on page
Approximations could be made using finger-printing techniques as a proxy for sessions (with an important error on mobile due to ip-pooling), and successive events as boundaries for time spent on page. These approximations would in any case need an NDA.
Cheers Joseph
On Wed, Jun 29, 2016 at 9:16 AM, Marc Miquel marcmiquel@gmail.com wrote:
Thanks for the answer, Oliver. But I am not sure it answers my questions. I'd like to study aspects like how much time is spent in certain pages, as a proxy of how content is approached/read/understood. I'd be happy with time of entering the page, time of leaving. This is not entirely centered on 'user activity', but I said that because I imagined data would be stored in a similar way to editor sessions, or in a database and I would need to do the time calculations.
Cheers,
Marc
El dc., 29 juny, 2016 03:11, Oliver Keyes ironholds@gmail.com va escriure:
If historic data is okay, there's already a dataset released ( https://figshare.com/articles/Activity_Sessions_datasets/1291033) that was designed specifically to answer questions around how to best calculate session length with regards to Wikipedia (http://arxiv.org/abs/1411.2878 )
On Tue, Jun 28, 2016 at 3:42 PM, Marc Miquel marcmiquel@gmail.com wrote:
Hello!
I was thinking about user sessions, yes, so this would mean to aggregate pageviews visited by a user during a short amount of time (I should check the cutoff, but it could be around an hour or less).
I am particularly interested in understanding the order in which pages are seen (start, end), duration, etc. I wouldn't need data from a long period neither, but I think data from multiple languages would be helpful.
I imagined reader data could be sensitive to privacy, but would an NDA with my university and some sort of data encoding help with this? As I said, it is for a scientific purpose.
Thanks,
Marc
El dt., 28 juny 2016 a les 21:09, Nuria Ruiz (nuria@wikimedia.org) va escriure:
Hello!
I am considering to study reader engagement for different article
topics in different languages. Because of this, I would like to know if there is >any plan to make available pageviews dumps detailing activity log at session level per user - in a similar way to editor sessions.
Are you thinking of "all-pageviews-visited-by-a-certain-user"? If so, no we do not have any projects to provide that data as due to privacy concerns we neither have nor keep that information.
Thanks,
Nuria
On Tue, Jun 28, 2016 at 6:55 PM, Leila Zia leila@wikimedia.org wrote:
- Analytics
On Tue, Jun 28, 2016 at 6:36 AM, Marc Miquel marcmiquel@gmail.com wrote:
> Hello, > > I have a question for you regarding pageviews datadumps. > > I am considering to study reader engagement for different article > topics in different languages. Because of this, I would like to know if > there is any plan to make available pageviews dumps detailing activity log > at session level per user - in a similar way to editor sessions. > > Since this would be for a research project I might ask funding for > it, I would like to know if I could count on that, what is the nature of > the available data, and what would be the procedure to obtain this data and > if there would be any implication because of privacy concerns. > > Thank you very much! > > Best, > > Marc Miquel > ᐧ > > _______________________________________________ > Wiki-research-l mailing list > Wiki-research-l@lists.wikimedia.org > https://lists.wikimedia.org/mailman/listinfo/wiki-research-l > >
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Wiki-research-l mailing list Wiki-research-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/wiki-research-l
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
-- *Joseph Allemandou* Data Engineer @ Wikimedia Foundation IRC: joal _______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics