(including analytics@ public list)
Rafael:
As I think we have mention before please be so kind to e-mail analytics@ rather than individual people.
Again, we are exclusively looking for the absolute number of Wikipedia
updates per year per county. I think you mean "edits" to wikipedia. If so we are currently working on a project that sets up to deliver an estimate (likely an interval) of edits happening a country. we do not have an ETA for such a deliverable but the major project that you can follow is this one:
https://phabricator.wikimedia.org/T130256
You can ping us again by the end of next quarter (April 2017) by which we can probably give you more specific information.
Thanks,
Nuria
On Wed, Nov 2, 2016 at 12:53 PM, Rafael Escalona Reynoso re32@cornell.edu wrote:
Dear Dan,
I hope you are doing fine and that you remember me. I am the lead researcher at The Global Innovation Index (GII). I contacted you last year searching for data on Wikipedia uploads per country. I believe that this request got assigned a task number at some point. Here is what I know:
Can you please let me know of this request’s status?
Also, if any legal issues seem to be obstructing the compilation of this data, can you please refer us to someone from your legal department to explore the possibility of tailoring a contract/confidentiality agreement between the GII and Wikimedia?
Again, we are exclusively looking for the absolute number of Wikipedia updates per year per county. These used to be available via Wikimedia here:
https://stats.wikimedia.org/wikimedia/squids/
Hope to hear from you soon.
Sincerely,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall Cornell University Ithaca, NY 14853-6201
Phone: +1 (607) 262-0983
Email: re32@cornell.edu soumitra.dutta@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
On Mon, Apr 4, 2016 at 10:58 PM, Dan Andreescu dandreescu@wikimedia.org wrote:
Yes, once this data is properly anonymized, it should continue to be released in the same shape. We just have to make sure it's properly safe first.
*From: *Rafael Escalona Reynoso
*Sent: *Monday, April 4, 2016 19:21
*To: *Dan Andreescu
*Cc: *Jordan Litner; sacha.wunschvincent@wipo.int
*Subject: *RE: On Wikipedia edits archive per county.
Dan,
Thank you for the update. This is kind of what we were expecting. I have a follow-up question: Would the data be collected in the same fashion for subsequent years (2016, 2017, etc.)? Or will this be a single time exercise? Do let me know whenever you can.
Best,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall
Cornell University
Ithaca, NY 14853-6201
Phone 1: +1 (607) 262-0983
Phone 2: +1 (607) 255-9245
Email: re32@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
See www.globalinnovationindex.org
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org] *Sent:* Monday, April 04, 2016 6:13 PM
*To:* Rafael Escalona Reynoso *Cc:* Jordan Litner; sacha.wunschvincent@wipo.int *Subject:* Re: On Wikipedia edits archive per county.
Hey Rafael,
We haven't been able to prioritize this work yet. It's been moved here:
https://phabricator.wikimedia.org/T131280
It has two stakeholders but no resources to get it done due to privacy issues. So we won't be able to get the 2015 data cleaned up before your deadline. But we're meeting about it again tomorrow and we will still do it so you can have this data for either next year's report or an amendment.
On Mon, Apr 4, 2016 at 11:49 AM, Rafael Escalona Reynoso re32@cornell.edu wrote:
Dear Dan,
Hope you are doing fine. Just is quick note to follow up on the Wikipedia data. When do you think this data will be available? We are about to close the model and would very much like to have 2015 data included. Let me know.
Best,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall
Cornell University
Ithaca, NY 14853-6201
Phone 1: +1 (607) 262-0983
Phone 2: +1 (607) 255-9245
Email: re32@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
See www.globalinnovationindex.org
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org] *Sent:* Thursday, February 18, 2016 8:19 PM
*To:* Rafael Escalona Reynoso *Cc:* Jordan Litner; sacha.wunschvincent@wipo.int *Subject:* Re: On Wikipedia edits archive per county.
That's perfect, I added it to the request: https://phabricator. wikimedia.org/T127409
On Thursday, February 18, 2016, Rafael Escalona Reynoso re32@cornell.edu wrote:
Dan,
These were my thoughts exactly. Let me then elaborate on the value of the report, the index and why we feel that Wikipedia data is essential.
The report is co-published by Cornell University, INSEAD, and the World Intellectual Property Organization (WIPO, a specialized agency of the United Nations), with the collaboration of three Knowledge Partners: the Confederation of Indian Industry, du, and A.T. Kearney and IMP³rove – European Innovation Management Academy. Now in its ninth edition, the report has established itself as a premier reference among innovation metrics and as a tool to facilitate public-private dialogue and evidence-based policymaking.
The Global Innovation Index (GII) is a ranking of 141 economies in terms of their innovation capabilities and results. A total of 79 metrics in the form of data-based indicators are at its core. These rich metrics can be used —on the level of the index, the sub-indices, or as individual variables—to monitor performance over time and to benchmark developments against their peers. These can also help study country profiles over time, and to identify their relative strengths and weaknesses from the rich and unique GII dataset.
Each year the GII results are presented within the framework of a top-level international event:
• 2013 Geneva, Switzerland at the Opening Session of the United Nations Economic and Social Council (ECOSOC) High-Level Segment, organized by WIPO;
• 2014 Sydney, Australia in the context of the B20/G20 preparations; and
• 2015 London, United Kingdom before the Minister of Innovation and Industry.
This year the launch is scheduled for the summer in Beijing, China preceding the preparations for the 2016 G20 summit.
Recognizing the need for a broad horizontal vision of innovation applicable to developed and emerging economies alike, the GII includes indicators that go beyond the traditional measures such as expenditure in research and development. That said, an area that is of great relevance and limited to the GII is that of creative outputs. Within it, *Wikipedia monthly page edits (per million population 15-69 y/o)* is a key metric. This indicator, along with others that measure the number of generic top-level and country-code top-level domains and video uploads in YouTube, helps capture what we define as online creativity.
Lastly, we believe that the GII can be an important vehicle to signal that Wikipedia is a critical lever to innovation and a factor contributing to a new understanding of the digital information landscape and innovation globally.
Based on all the above, we would like to request that our petition to collect data on Wikipedia monthly page edits per country, reported quarterly per year be given priority within your tasks.
Sincerely,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall
Cornell University
Ithaca, NY 14853-6201
Phone 1: +1 (607) 262-0983
Phone 2: +1 (607) 255-9245
Email: re32@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
See www.globalinnovationindex.org
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org dandreescu@wikimedia.org] *Sent:* Thursday, February 18, 2016 11:17 AM *To:* Rafael Escalona Reynoso *Cc:* Jordan Litner; sacha.wunschvincent@wipo.int *Subject:* Re: On Wikipedia edits archive per county.
Where the request is coming from, with all due respect, does not matter. We aim to be neutral in how we make knowledge available (namely, we try to make it available to everyone, for free).
But, we have to prioritize somehow, and that process definitely takes into consideration the value our work has to the world. So, if you tell me more about what this data could help you accomplish, we could use that to argue that prioritizing your request might save lives, serve the mission of open knowledge, etc.
But to answer your other question directly, yes, a letter to Jimmy Wales would not have any effect on this priority process and might be seen by the community we serve as an attempt to circumvent our planning process.
On Thu, Feb 18, 2016 at 10:54 AM, Rafael Escalona Reynoso < re32@cornell.edu> wrote:
Dan,
Let me share with you the following thought. I just had a call with the Dean at the business school here at Cornell, who is the creator of the Global Innovation Index (and my direct boss). I explained the situation with the Wikipedia uploads data and how methodological changes are now making it impossible for us to collect it in the fashion that we were used to. He mentioned that he is an acquaintance with Jimmy Wales and offered to direct him a letter explaining what we need and the importance of the indicator for our index. My notion here is that the issue has more to do with a shortage of labor hand and quite large a backlog than with where the request is coming from. Also, I do not want the letter to come across as an imposition or to give the wrong message. Based on the above, would this letter help prioritize the collection of this data?
Let me know what you think.
Best,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall
Cornell University
Ithaca, NY 14853-6201
Phone 1: +1 (607) 262-0983
Phone 2: +1 (607) 255-9245
Email: re32@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
See www.globalinnovationindex.org
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org dandreescu@wikimedia.org] *Sent:* Wednesday, February 17, 2016 2:57 PM *To:* Rafael Escalona Reynoso *Cc:* Jordan Litner; sacha.wunschvincent@wipo.int *Subject:* Re: On Wikipedia edits archive per county.
As much as I love to help a fellow Cornellian, we are too small of a team to create one-off solutions like that. We either publish it for everyone or no-one. But even if we did that, we'd still have a lot of work to check whether cross-referencing that data with other data wouldn't hurt privacy.
What would help is if you filed a task in Phabricator and tagged it with the "Analytics" project, and described very precisely what data you need, at what time granularity, and what you need it for. We'll use that as proof that we need to prioritize the work sooner than later.
On Wed, Feb 17, 2016 at 2:45 PM, Rafael Escalona Reynoso re32@cornell.edu wrote:
Dan,
One last thing. We also report scaled data from Google on YouTube uploads and, as you mention, they have to protect privacy. However, we prepare for them an Excel sheet where they simply need to upload the totals for each country (which we never get to see) and they report back to us exclusively the normalized scores (0-100) and rankings for all countries we request information for. Using this procedure it becomes impossible to reverse-engineer the raw values used to obtain these totals and – again – we never get to see the actual data. Is there a chance that we could establish a similar type of arrangement with Wikimedia? Let me know.
Best,
Rafael.
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org dandreescu@wikimedia.org] *Sent:* Wednesday, February 17, 2016 2:23 PM *To:* Rafael Escalona Reynoso *Subject:* Re: On Wikipedia edits archive per county.
Thank you for this, again. Sorry to pester you with this again but, do you know of any other data (from a source different from Google) where online activity could be measured? Any leads would be quite appreciated.
mmm, not geolocated that I know of, and it's unlikely that you'd find that. Because either
an organization is for-profit, in which case they would sell that data
an organization is non-profit, in which case they'd likely need to
protect their users and bump up against the same *hard* problems we did
But I could be wrong, good luck in your search and do report back if you find any, and especially if you find approaches that help to protect privacy.
Dear Nuria, First of all, happy holidays. I write you directly for the following reason: as you know we use edits to Wikipedia per country per year to define an indicator in the Global Innovation Index (GII). This year were hoping to collect this data by March 31- first week of April 2017, the latest.
Reviewing the project link you provided for such data availability it looks like it will not be at hand in the time and form required from us. In addition, the project is not marked as a priority. From previous exchanges I believe also that resources for this project are also an issue.
Based on the above, is there a possibility that the provision of financial assistance can help expedite the production of this data in the time and form needed? As mentioned before, the indicator produced using this information is key in our creative outputs pillar and the index itself is an internationally recognized study showcasing data globally. If we accomplish to include new Wikipedia data this year it will be a win-win situation for both of us.
Please let me know at your soonest convenience of this possibility.
Sincerely,
Rafael Escalona Reynoso.
On Wed, Nov 2, 2016 at 5:31 PM Nuria Ruiz nuria@wikimedia.org wrote:
(including analytics@ public list)
Rafael:
As I think we have mention before please be so kind to e-mail analytics@ rather than individual people.
Again, we are exclusively looking for the absolute number of Wikipedia
updates per year per county. I think you mean "edits" to wikipedia. If so we are currently working on a project that sets up to deliver an estimate (likely an interval) of edits happening a country. we do not have an ETA for such a deliverable but the major project that you can follow is this one:
https://phabricator.wikimedia.org/T130256
You can ping us again by the end of next quarter (April 2017) by which we can probably give you more specific information.
Thanks,
Nuria
On Wed, Nov 2, 2016 at 12:53 PM, Rafael Escalona Reynoso <re32@cornell.edu
wrote:
Dear Dan,
I hope you are doing fine and that you remember me. I am the lead researcher at The Global Innovation Index (GII). I contacted you last year searching for data on Wikipedia uploads per country. I believe that this request got assigned a task number at some point. Here is what I know:
Can you please let me know of this request’s status?
Also, if any legal issues seem to be obstructing the compilation of this data, can you please refer us to someone from your legal department to explore the possibility of tailoring a contract/confidentiality agreement between the GII and Wikimedia?
Again, we are exclusively looking for the absolute number of Wikipedia updates per year per county. These used to be available via Wikimedia here:
https://stats.wikimedia.org/wikimedia/squids/
Hope to hear from you soon.
Sincerely,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall Cornell University Ithaca, NY 14853-6201
Phone: +1 (607) 262-0983
Email: re32@cornell.edu soumitra.dutta@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
On Mon, Apr 4, 2016 at 10:58 PM, Dan Andreescu dandreescu@wikimedia.org wrote:
Yes, once this data is properly anonymized, it should continue to be released in the same shape. We just have to make sure it's properly safe first.
*From: *Rafael Escalona Reynoso
*Sent: *Monday, April 4, 2016 19:21
*To: *Dan Andreescu
*Cc: *Jordan Litner; sacha.wunschvincent@wipo.int
*Subject: *RE: On Wikipedia edits archive per county.
Dan,
Thank you for the update. This is kind of what we were expecting. I have a follow-up question: Would the data be collected in the same fashion for subsequent years (2016, 2017, etc.)? Or will this be a single time exercise? Do let me know whenever you can.
Best,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall
Cornell University
Ithaca, NY 14853-6201
Phone 1: +1 (607) 262-0983
Phone 2: +1 (607) 255-9245
Email: re32@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
See www.globalinnovationindex.org
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org] *Sent:* Monday, April 04, 2016 6:13 PM
*To:* Rafael Escalona Reynoso *Cc:* Jordan Litner; sacha.wunschvincent@wipo.int *Subject:* Re: On Wikipedia edits archive per county.
Hey Rafael,
We haven't been able to prioritize this work yet. It's been moved here:
https://phabricator.wikimedia.org/T131280
It has two stakeholders but no resources to get it done due to privacy issues. So we won't be able to get the 2015 data cleaned up before your deadline. But we're meeting about it again tomorrow and we will still do it so you can have this data for either next year's report or an amendment.
On Mon, Apr 4, 2016 at 11:49 AM, Rafael Escalona Reynoso re32@cornell.edu wrote:
Dear Dan,
Hope you are doing fine. Just is quick note to follow up on the Wikipedia data. When do you think this data will be available? We are about to close the model and would very much like to have 2015 data included. Let me know.
Best,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall
Cornell University
Ithaca, NY 14853-6201
Phone 1: +1 (607) 262-0983
Phone 2: +1 (607) 255-9245
Email: re32@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
See www.globalinnovationindex.org
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org] *Sent:* Thursday, February 18, 2016 8:19 PM
*To:* Rafael Escalona Reynoso *Cc:* Jordan Litner; sacha.wunschvincent@wipo.int *Subject:* Re: On Wikipedia edits archive per county.
That's perfect, I added it to the request: https://phabricator.wikimedia.org/T127409
On Thursday, February 18, 2016, Rafael Escalona Reynoso re32@cornell.edu wrote:
Dan,
These were my thoughts exactly. Let me then elaborate on the value of the report, the index and why we feel that Wikipedia data is essential.
The report is co-published by Cornell University, INSEAD, and the World Intellectual Property Organization (WIPO, a specialized agency of the United Nations), with the collaboration of three Knowledge Partners: the Confederation of Indian Industry, du, and A.T. Kearney and IMP³rove – European Innovation Management Academy. Now in its ninth edition, the report has established itself as a premier reference among innovation metrics and as a tool to facilitate public-private dialogue and evidence-based policymaking.
The Global Innovation Index (GII) is a ranking of 141 economies in terms of their innovation capabilities and results. A total of 79 metrics in the form of data-based indicators are at its core. These rich metrics can be used —on the level of the index, the sub-indices, or as individual variables—to monitor performance over time and to benchmark developments against their peers. These can also help study country profiles over time, and to identify their relative strengths and weaknesses from the rich and unique GII dataset.
Each year the GII results are presented within the framework of a top-level international event:
• 2013 Geneva, Switzerland at the Opening Session of the United Nations Economic and Social Council (ECOSOC) High-Level Segment, organized by WIPO;
• 2014 Sydney, Australia in the context of the B20/G20 preparations; and
• 2015 London, United Kingdom before the Minister of Innovation and Industry.
This year the launch is scheduled for the summer in Beijing, China preceding the preparations for the 2016 G20 summit.
Recognizing the need for a broad horizontal vision of innovation applicable to developed and emerging economies alike, the GII includes indicators that go beyond the traditional measures such as expenditure in research and development. That said, an area that is of great relevance and limited to the GII is that of creative outputs. Within it, *Wikipedia monthly page edits (per million population 15-69 y/o)* is a key metric. This indicator, along with others that measure the number of generic top-level and country-code top-level domains and video uploads in YouTube, helps capture what we define as online creativity.
Lastly, we believe that the GII can be an important vehicle to signal that Wikipedia is a critical lever to innovation and a factor contributing to a new understanding of the digital information landscape and innovation globally.
Based on all the above, we would like to request that our petition to collect data on Wikipedia monthly page edits per country, reported quarterly per year be given priority within your tasks.
Sincerely,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall
Cornell University
Ithaca, NY 14853-6201
Phone 1: +1 (607) 262-0983
Phone 2: +1 (607) 255-9245
Email: re32@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
See www.globalinnovationindex.org
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org dandreescu@wikimedia.org] *Sent:* Thursday, February 18, 2016 11:17 AM *To:* Rafael Escalona Reynoso *Cc:* Jordan Litner; sacha.wunschvincent@wipo.int *Subject:* Re: On Wikipedia edits archive per county.
Where the request is coming from, with all due respect, does not matter. We aim to be neutral in how we make knowledge available (namely, we try to make it available to everyone, for free).
But, we have to prioritize somehow, and that process definitely takes into consideration the value our work has to the world. So, if you tell me more about what this data could help you accomplish, we could use that to argue that prioritizing your request might save lives, serve the mission of open knowledge, etc.
But to answer your other question directly, yes, a letter to Jimmy Wales would not have any effect on this priority process and might be seen by the community we serve as an attempt to circumvent our planning process.
On Thu, Feb 18, 2016 at 10:54 AM, Rafael Escalona Reynoso < re32@cornell.edu> wrote:
Dan,
Let me share with you the following thought. I just had a call with the Dean at the business school here at Cornell, who is the creator of the Global Innovation Index (and my direct boss). I explained the situation with the Wikipedia uploads data and how methodological changes are now making it impossible for us to collect it in the fashion that we were used to. He mentioned that he is an acquaintance with Jimmy Wales and offered to direct him a letter explaining what we need and the importance of the indicator for our index. My notion here is that the issue has more to do with a shortage of labor hand and quite large a backlog than with where the request is coming from. Also, I do not want the letter to come across as an imposition or to give the wrong message. Based on the above, would this letter help prioritize the collection of this data?
Let me know what you think.
Best,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall
Cornell University
Ithaca, NY 14853-6201
Phone 1: +1 (607) 262-0983
Phone 2: +1 (607) 255-9245
Email: re32@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
See www.globalinnovationindex.org
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org dandreescu@wikimedia.org] *Sent:* Wednesday, February 17, 2016 2:57 PM *To:* Rafael Escalona Reynoso *Cc:* Jordan Litner; sacha.wunschvincent@wipo.int *Subject:* Re: On Wikipedia edits archive per county.
As much as I love to help a fellow Cornellian, we are too small of a team to create one-off solutions like that. We either publish it for everyone or no-one. But even if we did that, we'd still have a lot of work to check whether cross-referencing that data with other data wouldn't hurt privacy.
What would help is if you filed a task in Phabricator and tagged it with the "Analytics" project, and described very precisely what data you need, at what time granularity, and what you need it for. We'll use that as proof that we need to prioritize the work sooner than later.
On Wed, Feb 17, 2016 at 2:45 PM, Rafael Escalona Reynoso re32@cornell.edu wrote:
Dan,
One last thing. We also report scaled data from Google on YouTube uploads and, as you mention, they have to protect privacy. However, we prepare for them an Excel sheet where they simply need to upload the totals for each country (which we never get to see) and they report back to us exclusively the normalized scores (0-100) and rankings for all countries we request information for. Using this procedure it becomes impossible to reverse-engineer the raw values used to obtain these totals and – again – we never get to see the actual data. Is there a chance that we could establish a similar type of arrangement with Wikimedia? Let me know.
Best,
Rafael.
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org dandreescu@wikimedia.org] *Sent:* Wednesday, February 17, 2016 2:23 PM *To:* Rafael Escalona Reynoso *Subject:* Re: On Wikipedia edits archive per county.
Thank you for this, again. Sorry to pester you with this again but, do you know of any other data (from a source different from Google) where online activity could be measured? Any leads would be quite appreciated.
mmm, not geolocated that I know of, and it's unlikely that you'd find that. Because either
an organization is for-profit, in which case they would sell that data
an organization is non-profit, in which case they'd likely need to
protect their users and bump up against the same *hard* problems we did
But I could be wrong, good luck in your search and do report back if you find any, and especially if you find approaches that help to protect privacy.
Note that the last email in the thread was blocked by the mailing list for 7 days, because the sender was not subscribed to it.
On Mon, Dec 26, 2016 at 5:05 PM, Rafael Escalona Reynoso re32@cornell.edu wrote:
Dear Nuria, First of all, happy holidays. I write you directly for the following reason: as you know we use edits to Wikipedia per country per year to define an indicator in the Global Innovation Index (GII). This year were hoping to collect this data by March 31- first week of April 2017, the latest.
Reviewing the project link you provided for such data availability it looks like it will not be at hand in the time and form required from us. In addition, the project is not marked as a priority. From previous exchanges I believe also that resources for this project are also an issue.
Based on the above, is there a possibility that the provision of financial assistance can help expedite the production of this data in the time and form needed? As mentioned before, the indicator produced using this information is key in our creative outputs pillar and the index itself is an internationally recognized study showcasing data globally. If we accomplish to include new Wikipedia data this year it will be a win-win situation for both of us.
Please let me know at your soonest convenience of this possibility.
Sincerely,
Rafael Escalona Reynoso.
On Wed, Nov 2, 2016 at 5:31 PM Nuria Ruiz nuria@wikimedia.org wrote:
(including analytics@ public list)
Rafael:
As I think we have mention before please be so kind to e-mail analytics@ rather than individual people.
Again, we are exclusively looking for the absolute number of Wikipedia
updates per year per county. I think you mean "edits" to wikipedia. If so we are currently working on a project that sets up to deliver an estimate (likely an interval) of edits happening a country. we do not have an ETA for such a deliverable but the major project that you can follow is this one:
https://phabricator.wikimedia.org/T130256
You can ping us again by the end of next quarter (April 2017) by which we can probably give you more specific information.
Thanks,
Nuria
On Wed, Nov 2, 2016 at 12:53 PM, Rafael Escalona Reynoso < re32@cornell.edu> wrote:
Dear Dan,
I hope you are doing fine and that you remember me. I am the lead researcher at The Global Innovation Index (GII). I contacted you last year searching for data on Wikipedia uploads per country. I believe that this request got assigned a task number at some point. Here is what I know:
Can you please let me know of this request’s status?
Also, if any legal issues seem to be obstructing the compilation of this data, can you please refer us to someone from your legal department to explore the possibility of tailoring a contract/confidentiality agreement between the GII and Wikimedia?
Again, we are exclusively looking for the absolute number of Wikipedia updates per year per county. These used to be available via Wikimedia here:
https://stats.wikimedia.org/wikimedia/squids/
Hope to hear from you soon.
Sincerely,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall Cornell University Ithaca, NY 14853-6201
Phone: +1 (607) 262-0983
Email: re32@cornell.edu soumitra.dutta@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
On Mon, Apr 4, 2016 at 10:58 PM, Dan Andreescu dandreescu@wikimedia.org wrote:
Yes, once this data is properly anonymized, it should continue to be released in the same shape. We just have to make sure it's properly safe first.
*From: *Rafael Escalona Reynoso
*Sent: *Monday, April 4, 2016 19:21
*To: *Dan Andreescu
*Cc: *Jordan Litner; sacha.wunschvincent@wipo.int
*Subject: *RE: On Wikipedia edits archive per county.
Dan,
Thank you for the update. This is kind of what we were expecting. I have a follow-up question: Would the data be collected in the same fashion for subsequent years (2016, 2017, etc.)? Or will this be a single time exercise? Do let me know whenever you can.
Best,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall
Cornell University
Ithaca, NY 14853-6201
Phone 1: +1 (607) 262-0983
Phone 2: +1 (607) 255-9245
Email: re32@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
See www.globalinnovationindex.org
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org] *Sent:* Monday, April 04, 2016 6:13 PM
*To:* Rafael Escalona Reynoso *Cc:* Jordan Litner; sacha.wunschvincent@wipo.int *Subject:* Re: On Wikipedia edits archive per county.
Hey Rafael,
We haven't been able to prioritize this work yet. It's been moved here:
https://phabricator.wikimedia.org/T131280
It has two stakeholders but no resources to get it done due to privacy issues. So we won't be able to get the 2015 data cleaned up before your deadline. But we're meeting about it again tomorrow and we will still do it so you can have this data for either next year's report or an amendment.
On Mon, Apr 4, 2016 at 11:49 AM, Rafael Escalona Reynoso < re32@cornell.edu> wrote:
Dear Dan,
Hope you are doing fine. Just is quick note to follow up on the Wikipedia data. When do you think this data will be available? We are about to close the model and would very much like to have 2015 data included. Let me know.
Best,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall
Cornell University
Ithaca, NY 14853-6201
Phone 1: +1 (607) 262-0983
Phone 2: +1 (607) 255-9245
Email: re32@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
See www.globalinnovationindex.org
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org] *Sent:* Thursday, February 18, 2016 8:19 PM
*To:* Rafael Escalona Reynoso *Cc:* Jordan Litner; sacha.wunschvincent@wipo.int *Subject:* Re: On Wikipedia edits archive per county.
That's perfect, I added it to the request: https://phabricator. wikimedia.org/T127409
On Thursday, February 18, 2016, Rafael Escalona Reynoso re32@cornell.edu wrote:
Dan,
These were my thoughts exactly. Let me then elaborate on the value of the report, the index and why we feel that Wikipedia data is essential.
The report is co-published by Cornell University, INSEAD, and the World Intellectual Property Organization (WIPO, a specialized agency of the United Nations), with the collaboration of three Knowledge Partners: the Confederation of Indian Industry, du, and A.T. Kearney and IMP³rove – European Innovation Management Academy. Now in its ninth edition, the report has established itself as a premier reference among innovation metrics and as a tool to facilitate public-private dialogue and evidence-based policymaking.
The Global Innovation Index (GII) is a ranking of 141 economies in terms of their innovation capabilities and results. A total of 79 metrics in the form of data-based indicators are at its core. These rich metrics can be used —on the level of the index, the sub-indices, or as individual variables—to monitor performance over time and to benchmark developments against their peers. These can also help study country profiles over time, and to identify their relative strengths and weaknesses from the rich and unique GII dataset.
Each year the GII results are presented within the framework of a top-level international event:
• 2013 Geneva, Switzerland at the Opening Session of the United Nations Economic and Social Council (ECOSOC) High-Level Segment, organized by WIPO;
• 2014 Sydney, Australia in the context of the B20/G20 preparations; and
• 2015 London, United Kingdom before the Minister of Innovation and Industry.
This year the launch is scheduled for the summer in Beijing, China preceding the preparations for the 2016 G20 summit.
Recognizing the need for a broad horizontal vision of innovation applicable to developed and emerging economies alike, the GII includes indicators that go beyond the traditional measures such as expenditure in research and development. That said, an area that is of great relevance and limited to the GII is that of creative outputs. Within it, *Wikipedia monthly page edits (per million population 15-69 y/o)* is a key metric. This indicator, along with others that measure the number of generic top-level and country-code top-level domains and video uploads in YouTube, helps capture what we define as online creativity.
Lastly, we believe that the GII can be an important vehicle to signal that Wikipedia is a critical lever to innovation and a factor contributing to a new understanding of the digital information landscape and innovation globally.
Based on all the above, we would like to request that our petition to collect data on Wikipedia monthly page edits per country, reported quarterly per year be given priority within your tasks.
Sincerely,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall
Cornell University
Ithaca, NY 14853-6201
Phone 1: +1 (607) 262-0983
Phone 2: +1 (607) 255-9245
Email: re32@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
See www.globalinnovationindex.org
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org dandreescu@wikimedia.org] *Sent:* Thursday, February 18, 2016 11:17 AM *To:* Rafael Escalona Reynoso *Cc:* Jordan Litner; sacha.wunschvincent@wipo.int *Subject:* Re: On Wikipedia edits archive per county.
Where the request is coming from, with all due respect, does not matter. We aim to be neutral in how we make knowledge available (namely, we try to make it available to everyone, for free).
But, we have to prioritize somehow, and that process definitely takes into consideration the value our work has to the world. So, if you tell me more about what this data could help you accomplish, we could use that to argue that prioritizing your request might save lives, serve the mission of open knowledge, etc.
But to answer your other question directly, yes, a letter to Jimmy Wales would not have any effect on this priority process and might be seen by the community we serve as an attempt to circumvent our planning process.
On Thu, Feb 18, 2016 at 10:54 AM, Rafael Escalona Reynoso < re32@cornell.edu> wrote:
Dan,
Let me share with you the following thought. I just had a call with the Dean at the business school here at Cornell, who is the creator of the Global Innovation Index (and my direct boss). I explained the situation with the Wikipedia uploads data and how methodological changes are now making it impossible for us to collect it in the fashion that we were used to. He mentioned that he is an acquaintance with Jimmy Wales and offered to direct him a letter explaining what we need and the importance of the indicator for our index. My notion here is that the issue has more to do with a shortage of labor hand and quite large a backlog than with where the request is coming from. Also, I do not want the letter to come across as an imposition or to give the wrong message. Based on the above, would this letter help prioritize the collection of this data?
Let me know what you think.
Best,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Samuel Curtis Johnson Graduate School of Management
207 Sage Hall
Cornell University
Ithaca, NY 14853-6201
Phone 1: +1 (607) 262-0983
Phone 2: +1 (607) 255-9245
Email: re32@cornell.edu
http://www.johnson.cornell.edu
[image: cid:image001.png@01CB662F.A467E740]
See www.globalinnovationindex.org
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org dandreescu@wikimedia.org] *Sent:* Wednesday, February 17, 2016 2:57 PM *To:* Rafael Escalona Reynoso *Cc:* Jordan Litner; sacha.wunschvincent@wipo.int *Subject:* Re: On Wikipedia edits archive per county.
As much as I love to help a fellow Cornellian, we are too small of a team to create one-off solutions like that. We either publish it for everyone or no-one. But even if we did that, we'd still have a lot of work to check whether cross-referencing that data with other data wouldn't hurt privacy.
What would help is if you filed a task in Phabricator and tagged it with the "Analytics" project, and described very precisely what data you need, at what time granularity, and what you need it for. We'll use that as proof that we need to prioritize the work sooner than later.
On Wed, Feb 17, 2016 at 2:45 PM, Rafael Escalona Reynoso < re32@cornell.edu> wrote:
Dan,
One last thing. We also report scaled data from Google on YouTube uploads and, as you mention, they have to protect privacy. However, we prepare for them an Excel sheet where they simply need to upload the totals for each country (which we never get to see) and they report back to us exclusively the normalized scores (0-100) and rankings for all countries we request information for. Using this procedure it becomes impossible to reverse-engineer the raw values used to obtain these totals and – again – we never get to see the actual data. Is there a chance that we could establish a similar type of arrangement with Wikimedia? Let me know.
Best,
Rafael.
*From:* Dan Andreescu [mailto:dandreescu@wikimedia.org dandreescu@wikimedia.org] *Sent:* Wednesday, February 17, 2016 2:23 PM *To:* Rafael Escalona Reynoso *Subject:* Re: On Wikipedia edits archive per county.
Thank you for this, again. Sorry to pester you with this again but, do you know of any other data (from a source different from Google) where online activity could be measured? Any leads would be quite appreciated.
mmm, not geolocated that I know of, and it's unlikely that you'd find that. Because either
an organization is for-profit, in which case they would sell that data
an organization is non-profit, in which case they'd likely need to
protect their users and bump up against the same *hard* problems we did
But I could be wrong, good luck in your search and do report back if you find any, and especially if you find approaches that help to protect privacy.
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Again, we are exclusively looking for the absolute number of Wikipedia updates per year per county.
https://stats.wikimedia.org/wikimedia/squids/ has data on 23 quarters. To get the absolute number, you can multiply the percentage by the totals at https://stats.wikimedia.org/EN/TablesDatabaseEdits.htm .
These are HTML tables, but nothing LibreOffice Calc can't digest.
Nemo
Thank you for your reply. While reviewing both tables, the process to define the total number of edits per country is still a bit ambiguous. For example, in the case of Spanish, edits can be allocated to various countries. In addition not all quarters for 2015 are available.
Would it be possible to obtain instead the overall number of edits per country for the year 2015?
Perhaps a call will be easier. Can you provide me with a number to call to follow up with these inquiries?
Sincerely,
Rafael Escalona Reynoso, PhD, MPA. Lead Researcher at The Global Innovation Index Cornell College of Business Cornell University Ithaca, NY 14853
Phone: +1 (607) 262-0983 Email: re32@cornell.edu www.globalinnovationindex.org
-----Original Message----- From: Federico Leva (Nemo) [mailto:nemowiki@gmail.com] Sent: Monday, January 02, 2017 8:59 AM To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics.; Rafael Escalona Reynoso Cc: Jordan Litner; Wunsch-Vincent, Sacha Subject: Re: [Analytics] On Wikipedia edits archive per county.
Again, we are exclusively looking for the absolute number of Wikipedia updates per year per county.
https://stats.wikimedia.org/wikimedia/squids/ has data on 23 quarters. To get the absolute number, you can multiply the percentage by the totals at https://stats.wikimedia.org/EN/TablesDatabaseEdits.htm .
These are HTML tables, but nothing LibreOffice Calc can't digest.
Nemo
Probably someone at WMF would be more appropriate for a call; I can only share information which is online.
Rafael Escalona Reynoso, 02/01/2017 17:40:
Thank you for your reply. While reviewing both tables, the process to define the total number of edits per country is still a bit ambiguous. For example, in the case of Spanish, edits can be allocated to various countries.
Sorry, I should have been clearer. What you need is the language breakdown, like this (which was last calculated in 2013, it seems): https://stats.wikimedia.org/wikimedia/squids/SquidReportPageEditsPerLanguage...
If you only consider the numbers above 3 % or so, these tend to be rather stable, so the 2013 data is still a good approximation.
In addition not all quarters for 2015 are available.
Would it be possible to obtain instead the overall number of edits per country for the year 2015?
Ah sorry, I thought you wanted historical data more than recent data. Updates have been stalled by WMF while they work on a potential replacement, so I suspect you can only adapt older data or wait.
Nemo
Dear Federico, Thank you for your reply. I have been in touch with the Analytics team making inquiries about this data. Is there a particular person I should forward my request within WMF? Can you please provide me with a contact there?
Best,
Rafael Escalona Reynoso, PhD, MPA. Lead Researcher at The Global Innovation Index Cornell College of Business Cornell University Ithaca, NY 14853
Phone: +1 (607) 262-0983 Email: re32@cornell.edu www.globalinnovationindex.org
-----Original Message----- From: Federico Leva (Nemo) [mailto:nemowiki@gmail.com] Sent: Monday, January 02, 2017 12:07 PM To: Rafael Escalona Reynoso; A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Cc: Jordan Litner; Wunsch-Vincent, Sacha Subject: Re: [Analytics] On Wikipedia edits archive per county.
Probably someone at WMF would be more appropriate for a call; I can only share information which is online.
Rafael Escalona Reynoso, 02/01/2017 17:40:
Thank you for your reply. While reviewing both tables, the process to define the total number of edits per country is still a bit ambiguous. For example, in the case of Spanish, edits can be allocated to various countries.
Sorry, I should have been clearer. What you need is the language breakdown, like this (which was last calculated in 2013, it seems): https://stats.wikimedia.org/wikimedia/squids/SquidReportPageEditsPerLanguage...
If you only consider the numbers above 3 % or so, these tend to be rather stable, so the 2013 data is still a good approximation.
In addition not all quarters for 2015 are available.
Would it be possible to obtain instead the overall number of edits per country for the year 2015?
Ah sorry, I thought you wanted historical data more than recent data. Updates have been stalled by WMF while they work on a potential replacement, so I suspect you can only adapt older data or wait.
Nemo
Rafael,
The most recent data we have on this regard publicy available has already been linked on this thread. Sorry we do not have more current data, as I pointed out couple times before we will likely have newer data by April 2017 when our ongoing project regarding edit data reconstruction (we expect) would be mostly completed.
Thanks,
Nuria
On Mon, Jan 2, 2017 at 9:35 AM, Rafael Escalona Reynoso re32@cornell.edu wrote:
Dear Federico, Thank you for your reply. I have been in touch with the Analytics team making inquiries about this data. Is there a particular person I should forward my request within WMF? Can you please provide me with a contact there?
Best,
Rafael Escalona Reynoso, PhD, MPA. Lead Researcher at The Global Innovation Index Cornell College of Business Cornell University Ithaca, NY 14853
Phone: +1 (607) 262-0983 Email: re32@cornell.edu www.globalinnovationindex.org
-----Original Message----- From: Federico Leva (Nemo) [mailto:nemowiki@gmail.com] Sent: Monday, January 02, 2017 12:07 PM To: Rafael Escalona Reynoso; A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Cc: Jordan Litner; Wunsch-Vincent, Sacha Subject: Re: [Analytics] On Wikipedia edits archive per county.
Probably someone at WMF would be more appropriate for a call; I can only share information which is online.
Rafael Escalona Reynoso, 02/01/2017 17:40:
Thank you for your reply. While reviewing both tables, the process to define the total number of edits per country is still a bit ambiguous. For example, in the case of Spanish, edits can be allocated to various countries.
Sorry, I should have been clearer. What you need is the language breakdown, like this (which was last calculated in 2013, it seems): https://stats.wikimedia.org/wikimedia/squids/ SquidReportPageEditsPerLanguageBreakdown.htm
If you only consider the numbers above 3 % or so, these tend to be rather stable, so the 2013 data is still a good approximation.
In addition not all quarters for 2015 are available.
Would it be possible to obtain instead the overall number of edits per country for the year 2015?
Ah sorry, I thought you wanted historical data more than recent data. Updates have been stalled by WMF while they work on a potential replacement, so I suspect you can only adapt older data or wait.
Nemo
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Hi Rafael,
The exact data you're looking for, as mentioned by Nuria, is not readily available for sharing. If you find it useful, I'm happy to set up a call with you to understand what your needs are (I've read the details of https://phabricator.wikimedia.org/T131889 already) and brainstorm with you about other innovative ways you can use Wikipedia/Wikimedia data for the purposes of these reports. If that's of interest, please ping me off-list.
Best, Leila
Leila Zia Senior Research Scientist Wikimedia Foundation
On Tue, Jan 17, 2017 at 8:51 AM, Nuria Ruiz nuria@wikimedia.org wrote:
Rafael,
The most recent data we have on this regard publicy available has already been linked on this thread. Sorry we do not have more current data, as I pointed out couple times before we will likely have newer data by April 2017 when our ongoing project regarding edit data reconstruction (we expect) would be mostly completed.
Thanks,
Nuria
On Mon, Jan 2, 2017 at 9:35 AM, Rafael Escalona Reynoso re32@cornell.edu wrote:
Dear Federico, Thank you for your reply. I have been in touch with the Analytics team making inquiries about this data. Is there a particular person I should forward my request within WMF? Can you please provide me with a contact there?
Best,
Rafael Escalona Reynoso, PhD, MPA. Lead Researcher at The Global Innovation Index Cornell College of Business Cornell University Ithaca, NY 14853
Phone: +1 (607) 262-0983 Email: re32@cornell.edu www.globalinnovationindex.org
-----Original Message----- From: Federico Leva (Nemo) [mailto:nemowiki@gmail.com] Sent: Monday, January 02, 2017 12:07 PM To: Rafael Escalona Reynoso; A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Cc: Jordan Litner; Wunsch-Vincent, Sacha Subject: Re: [Analytics] On Wikipedia edits archive per county.
Probably someone at WMF would be more appropriate for a call; I can only share information which is online.
Rafael Escalona Reynoso, 02/01/2017 17:40:
Thank you for your reply. While reviewing both tables, the process to define the total number of edits per country is still a bit ambiguous. For example, in the case of Spanish, edits can be allocated to various countries.
Sorry, I should have been clearer. What you need is the language breakdown, like this (which was last calculated in 2013, it seems): https://stats.wikimedia.org/wikimedia/squids/SquidReportPage EditsPerLanguageBreakdown.htm
If you only consider the numbers above 3 % or so, these tend to be rather stable, so the 2013 data is still a good approximation.
In addition not all quarters for 2015 are available.
Would it be possible to obtain instead the overall number of edits per country for the year 2015?
Ah sorry, I thought you wanted historical data more than recent data. Updates have been stalled by WMF while they work on a potential replacement, so I suspect you can only adapt older data or wait.
Nemo
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Dear Leila, Thank you for your reply. Can we schedule a call for tomorrow or Monday morning? Let me know what times work for you best.
Sincerely,
Rafael Escalona Reynoso.
On Tue, Jan 17, 2017 at 12:21 PM Leila Zia leila@wikimedia.org wrote:
Hi Rafael,
The exact data you're looking for, as mentioned by Nuria, is not readily available for sharing. If you find it useful, I'm happy to set up a call with you to understand what your needs are (I've read the details of https://phabricator.wikimedia.org/T131889 already) and brainstorm with you about other innovative ways you can use Wikipedia/Wikimedia data for the purposes of these reports. If that's of interest, please ping me off-list.
Best, Leila
Leila Zia Senior Research Scientist Wikimedia Foundation
On Tue, Jan 17, 2017 at 8:51 AM, Nuria Ruiz nuria@wikimedia.org wrote:
Rafael,
The most recent data we have on this regard publicy available has already been linked on this thread. Sorry we do not have more current data, as I pointed out couple times before we will likely have newer data by April 2017 when our ongoing project regarding edit data reconstruction (we expect) would be mostly completed.
Thanks,
Nuria
On Mon, Jan 2, 2017 at 9:35 AM, Rafael Escalona Reynoso re32@cornell.edu wrote:
Dear Federico,
Thank you for your reply. I have been in touch with the Analytics team
making inquiries about this data. Is there a particular person I should
forward my request within WMF? Can you please provide me with a contact
there?
Best,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Cornell College of Business
Cornell University
Ithaca, NY 14853
Phone: +1 (607) 262-0983
Email: re32@cornell.edu
www.globalinnovationindex.org
-----Original Message-----
From: Federico Leva (Nemo) [mailto:nemowiki@gmail.com]
Sent: Monday, January 02, 2017 12:07 PM
To: Rafael Escalona Reynoso; A mailing list for the Analytics Team at WMF
and everybody who has an interest in Wikipedia and analytics.
Cc: Jordan Litner; Wunsch-Vincent, Sacha
Subject: Re: [Analytics] On Wikipedia edits archive per county.
Probably someone at WMF would be more appropriate for a call; I can only
share information which is online.
Rafael Escalona Reynoso, 02/01/2017 17:40:
Thank you for your reply. While reviewing both tables, the process to
define the total number of edits per country is still a bit ambiguous.
For example, in the case of Spanish, edits can be allocated to various
countries.
Sorry, I should have been clearer. What you need is the language breakdown,
like this (which was last calculated in 2013, it seems):
https://stats.wikimedia.org/wikimedia/squids/SquidReportPageEditsPerLanguage...
If you only consider the numbers above 3 % or so, these tend to be rather
stable, so the 2013 data is still a good approximation.
In
addition not all quarters for 2015 are available.
Would it be possible to obtain instead the overall number of edits per
country for the year 2015?
Ah sorry, I thought you wanted historical data more than recent data.
Updates have been stalled by WMF while they work on a potential replacement,
so I suspect you can only adapt older data or wait.
Nemo
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list
Analytics@lists.wikimedia.org
Note that last email in this thread was retained by the mailing list filter for 6 days, because the sender was not subscribed to the list.
On Thu, Jan 19, 2017 at 4:34 PM, Rafael Escalona Reynoso re32@cornell.edu wrote:
Dear Leila, Thank you for your reply. Can we schedule a call for tomorrow or Monday morning? Let me know what times work for you best.
Sincerely,
Rafael Escalona Reynoso.
On Tue, Jan 17, 2017 at 12:21 PM Leila Zia leila@wikimedia.org wrote:
Hi Rafael,
The exact data you're looking for, as mentioned by Nuria, is not readily available for sharing. If you find it useful, I'm happy to set up a call with you to understand what your needs are (I've read the details of https://phabricator.wikimedia.org/T131889 already) and brainstorm with you about other innovative ways you can use Wikipedia/Wikimedia data for the purposes of these reports. If that's of interest, please ping me off-list.
Best, Leila
Leila Zia Senior Research Scientist Wikimedia Foundation
On Tue, Jan 17, 2017 at 8:51 AM, Nuria Ruiz nuria@wikimedia.org wrote:
Rafael,
The most recent data we have on this regard publicy available has already been linked on this thread. Sorry we do not have more current data, as I pointed out couple times before we will likely have newer data by April 2017 when our ongoing project regarding edit data reconstruction (we expect) would be mostly completed.
Thanks,
Nuria
On Mon, Jan 2, 2017 at 9:35 AM, Rafael Escalona Reynoso <re32@cornell.edu
wrote:
Dear Federico,
Thank you for your reply. I have been in touch with the Analytics team
making inquiries about this data. Is there a particular person I should
forward my request within WMF? Can you please provide me with a contact
there?
Best,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Cornell College of Business
Cornell University
Ithaca, NY 14853
Phone: +1 (607) 262-0983
Email: re32@cornell.edu
www.globalinnovationindex.org
-----Original Message-----
From: Federico Leva (Nemo) [mailto:nemowiki@gmail.com]
Sent: Monday, January 02, 2017 12:07 PM
To: Rafael Escalona Reynoso; A mailing list for the Analytics Team at WMF
and everybody who has an interest in Wikipedia and analytics.
Cc: Jordan Litner; Wunsch-Vincent, Sacha
Subject: Re: [Analytics] On Wikipedia edits archive per county.
Probably someone at WMF would be more appropriate for a call; I can only
share information which is online.
Rafael Escalona Reynoso, 02/01/2017 17:40:
Thank you for your reply. While reviewing both tables, the process to
define the total number of edits per country is still a bit ambiguous.
For example, in the case of Spanish, edits can be allocated to various
countries.
Sorry, I should have been clearer. What you need is the language breakdown,
like this (which was last calculated in 2013, it seems):
https://stats.wikimedia.org/wikimedia/squids/ SquidReportPageEditsPerLanguageBreakdown.htm
If you only consider the numbers above 3 % or so, these tend to be rather
stable, so the 2013 data is still a good approximation.
In
addition not all quarters for 2015 are available.
Would it be possible to obtain instead the overall number of edits per
country for the year 2015?
Ah sorry, I thought you wanted historical data more than recent data.
Updates have been stalled by WMF while they work on a potential replacement,
so I suspect you can only adapt older data or wait.
Nemo
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list
Analytics@lists.wikimedia.org
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Thanks, Marcel.
Rafael, Jordan and I, and later Dan, Nuria, and I met 2 days ago and next steps are followed on https://phabricator.wikimedia.org/T131889. I will update this list if something beyond that ticket's activity should be reported on my end.
Best, Leila
On Wed, Jan 25, 2017 at 6:23 AM, Marcel Ruiz Forns mforns@wikimedia.org wrote:
Note that last email in this thread was retained by the mailing list filter for 6 days, because the sender was not subscribed to the list.
On Thu, Jan 19, 2017 at 4:34 PM, Rafael Escalona Reynoso <re32@cornell.edu
wrote:
Dear Leila, Thank you for your reply. Can we schedule a call for tomorrow or Monday morning? Let me know what times work for you best.
Sincerely,
Rafael Escalona Reynoso.
On Tue, Jan 17, 2017 at 12:21 PM Leila Zia leila@wikimedia.org wrote:
Hi Rafael,
The exact data you're looking for, as mentioned by Nuria, is not readily available for sharing. If you find it useful, I'm happy to set up a call with you to understand what your needs are (I've read the details of https://phabricator.wikimedia.org/T131889 already) and brainstorm with you about other innovative ways you can use Wikipedia/Wikimedia data for the purposes of these reports. If that's of interest, please ping me off-list.
Best, Leila
Leila Zia Senior Research Scientist Wikimedia Foundation
On Tue, Jan 17, 2017 at 8:51 AM, Nuria Ruiz nuria@wikimedia.org wrote:
Rafael,
The most recent data we have on this regard publicy available has already been linked on this thread. Sorry we do not have more current data, as I pointed out couple times before we will likely have newer data by April 2017 when our ongoing project regarding edit data reconstruction (we expect) would be mostly completed.
Thanks,
Nuria
On Mon, Jan 2, 2017 at 9:35 AM, Rafael Escalona Reynoso < re32@cornell.edu> wrote:
Dear Federico,
Thank you for your reply. I have been in touch with the Analytics team
making inquiries about this data. Is there a particular person I should
forward my request within WMF? Can you please provide me with a contact
there?
Best,
Rafael Escalona Reynoso, PhD, MPA.
Lead Researcher at The Global Innovation Index
Cornell College of Business
Cornell University
Ithaca, NY 14853
Phone: +1 (607) 262-0983
Email: re32@cornell.edu
www.globalinnovationindex.org
-----Original Message-----
From: Federico Leva (Nemo) [mailto:nemowiki@gmail.com]
Sent: Monday, January 02, 2017 12:07 PM
To: Rafael Escalona Reynoso; A mailing list for the Analytics Team at WMF
and everybody who has an interest in Wikipedia and analytics.
Cc: Jordan Litner; Wunsch-Vincent, Sacha
Subject: Re: [Analytics] On Wikipedia edits archive per county.
Probably someone at WMF would be more appropriate for a call; I can only
share information which is online.
Rafael Escalona Reynoso, 02/01/2017 17:40:
Thank you for your reply. While reviewing both tables, the process to
define the total number of edits per country is still a bit ambiguous.
For example, in the case of Spanish, edits can be allocated to various
countries.
Sorry, I should have been clearer. What you need is the language breakdown,
like this (which was last calculated in 2013, it seems):
https://stats.wikimedia.org/wikimedia/squids/SquidReportPage EditsPerLanguageBreakdown.htm
If you only consider the numbers above 3 % or so, these tend to be rather
stable, so the 2013 data is still a good approximation.
In
addition not all quarters for 2015 are available.
Would it be possible to obtain instead the overall number of edits per
country for the year 2015?
Ah sorry, I thought you wanted historical data more than recent data.
Updates have been stalled by WMF while they work on a potential replacement,
so I suspect you can only adapt older data or wait.
Nemo
Analytics mailing list
Analytics@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list
Analytics@lists.wikimedia.org
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
-- *Marcel Ruiz Forns* Analytics Developer Wikimedia Foundation
Hello, Just a quick follow up on this issue. Still have not heard back from someone at WMF with whom I can schedule a call.
Looking for data in this form for all 4 quarters of 2015:
https://stats.wikimedia.org/wikimedia/squids/SquidReportPageEditsPerCountryO...
This data was requested a year ago for its use in The Global Innovation Index, a United Nations report published every year. More information on the index here: https://www.globalinnovationindex.org/
Your reply is very appreciated and time-sensitive.
Best,
Rafael Escalona Reynoso, PhD, MPA. Lead Researcher at The Global Innovation Index Cornell College of Business Cornell University Ithaca, NY 14853
Phone: +1 (607) 262-0983 Email: re32@cornell.edu www.globalinnovationindex.org
-----Original Message----- From: Federico Leva (Nemo) [mailto:nemowiki@gmail.com] Sent: Monday, January 02, 2017 12:07 PM To: Rafael Escalona Reynoso; A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Cc: Jordan Litner; Wunsch-Vincent, Sacha Subject: Re: [Analytics] On Wikipedia edits archive per county.
Probably someone at WMF would be more appropriate for a call; I can only share information which is online.
Rafael Escalona Reynoso, 02/01/2017 17:40:
Thank you for your reply. While reviewing both tables, the process to define the total number of edits per country is still a bit ambiguous. For example, in the case of Spanish, edits can be allocated to various countries.
Sorry, I should have been clearer. What you need is the language breakdown, like this (which was last calculated in 2013, it seems): https://stats.wikimedia.org/wikimedia/squids/SquidReportPageEditsPerLanguage...
If you only consider the numbers above 3 % or so, these tend to be rather stable, so the 2013 data is still a good approximation.
In addition not all quarters for 2015 are available.
Would it be possible to obtain instead the overall number of edits per country for the year 2015?
Ah sorry, I thought you wanted historical data more than recent data. Updates have been stalled by WMF while they work on a potential replacement, so I suspect you can only adapt older data or wait.
Nemo