Thanks Taha for pointing that out. I have added the note to the blog post and am hope to start a conversation on what can we do to make the analysis and curation of Wikipedia traffic data more useful and meaningful both for research and policies.
BTW, very interesting phenomena of "sleep depth" for different languages in Taha Yasseri, Robert Sumi, János Kertész's paper. It provides insights into the time distribution of Wikipedia labour across working hours and working days. To certain extent, it shows us the current utility of the global "cognitive surplus" by the Wikipedia projects. Virtual labour is still conditioned by the diverse working environments across the world, as mentioned by the authors in the quote: below:
"
For example, the daily pattern of Asian languages (e.g., Japanese, Chinese and Korean) show higher activity during evenings and nights along with high level of activity at weekends. This can be related partly to the lengths of working hours in corresponding countries. This general image, which holds partially for Turkey and Russia and Israel too, could be in close relation with the high average working hours per day in those countries (more than 40 hours in all the mentioned cases, according to the dataset of The Organization for Economic Co-operation and Development:http://stats.oecd.org). Furthermore, among European countries, we also see the same tendency; in the countries with rather larger working times, edits are mostly done in later times in evenings."
Note also that the difference in the timeframes, what I have done by the infographics based on the Wikimedia's Squid reports (not the original traffic data) shows yearly changes. This is in contrast to the Yasseri et al.'s analysis of circadian *daily* and *weekly* patterns. Both have different angles and thus different needs from the Wikimedia Foundation for its traffic data.
Thus, we might want to share what has been done and what could be done regarding the current traffic data provided by the Wikimedia Foundation while acknowledging the sensitivity of the traffic data release,
Best,
han-teng liao