Hi All,
I am a summer research intern with the Commonwealth Scientific and Industrial Research Organisation (CSIRO) in Australia. I am studying a statistics degree and so I don't really have skills in the type of data collection required to access the Wiki data for my research. I was wondering if someone might be able to give me a hand (by pointing me in the right direction)?
I have a list of pest species that I wish to find the total number of page views via stats.grok.se or https://dumps.wikimedia.org/other/pagecounts-raw/ . There must be a good method to go through and pick out page views by name rather than by hand (which obviously isn't feasible)? I'd also need to be able to find the total number of page views for each period in order to standardize the response to account for the increase in traffic over the years.
We are in the process of gathering similar data through a Plant Pest database but due to privacy concerns, the organisation is arranging to reconcile the data on our behalf and so I do not have a part in that.
Any help would be really appreciated!
Kind regards, Caitlin Gardner
Hi Caitlin,
Do you have a list of relevant articles on pest species that you want to examine?
Cheers, Craig
On 14 December 2015 at 09:04, Caitlin.Gardner@csiro.au wrote:
Hi All,
I am a summer research intern with the Commonwealth Scientific and Industrial Research Organisation (CSIRO) in Australia. I am studying a statistics degree and so I don’t really have skills in the type of data collection required to access the Wiki data for my research. I was wondering if someone might be able to give me a hand (by pointing me in the right direction)?
I have a list of pest species that I wish to find the total number of page views via stats.grok.se or https://dumps.wikimedia.org/other/pagecounts-raw/ . There must be a good method to go through and pick out page views by name rather than by hand (which obviously isn’t feasible)? I’d also need to be able to find the total number of page views for each period in order to standardize the response to account for the increase in traffic over the years.
We are in the process of gathering similar data through a Plant Pest database but due to privacy concerns, the organisation is arranging to reconcile the data on our behalf and so I do not have a part in that.
Any help would be really appreciated!
Kind regards,
Caitlin Gardner
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Hi Caitlin --
The Analytics team recently rolled out the page view API. Docs are here:
https://wikitech.wikimedia.org/wiki/Analytics/AQS/Pageview_API
There's a list of clients at the bottom of the page.
-Toby
On Sun, Dec 13, 2015 at 3:34 PM, Craig Franklin cfranklin@halonetwork.net wrote:
Hi Caitlin,
Do you have a list of relevant articles on pest species that you want to examine?
Cheers, Craig
On 14 December 2015 at 09:04, Caitlin.Gardner@csiro.au wrote:
Hi All,
I am a summer research intern with the Commonwealth Scientific and Industrial Research Organisation (CSIRO) in Australia. I am studying a statistics degree and so I don’t really have skills in the type of data collection required to access the Wiki data for my research. I was wondering if someone might be able to give me a hand (by pointing me in the right direction)?
I have a list of pest species that I wish to find the total number of page views via stats.grok.se or https://dumps.wikimedia.org/other/pagecounts-raw/ . There must be a good method to go through and pick out page views by name rather than by hand (which obviously isn’t feasible)? I’d also need to be able to find the total number of page views for each period in order to standardize the response to account for the increase in traffic over the years.
We are in the process of gathering similar data through a Plant Pest database but due to privacy concerns, the organisation is arranging to reconcile the data on our behalf and so I do not have a part in that.
Any help would be really appreciated!
Kind regards,
Caitlin Gardner
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Hi Caitlin,
If you have a list of relevant articles and understanding what time period you would like to research, contact me of the list and I probably can help you. Also my advise: have a look at wikipediatrends.com or stats.grok.se and try some of your queries to get a better undestanding of possible results. Best wishes,
On Mon, Dec 14, 2015 at 12:04 AM, Caitlin.Gardner@csiro.au wrote:
Hi All,
I am a summer research intern with the Commonwealth Scientific and Industrial Research Organisation (CSIRO) in Australia. I am studying a statistics degree and so I don’t really have skills in the type of data collection required to access the Wiki data for my research. I was wondering if someone might be able to give me a hand (by pointing me in the right direction)?
I have a list of pest species that I wish to find the total number of page views via stats.grok.se or https://dumps.wikimedia.org/other/pagecounts-raw/ . There must be a good method to go through and pick out page views by name rather than by hand (which obviously isn’t feasible)? I’d also need to be able to find the total number of page views for each period in order to standardize the response to account for the increase in traffic over the years.
We are in the process of gathering similar data through a Plant Pest database but due to privacy concerns, the organisation is arranging to reconcile the data on our behalf and so I do not have a part in that.
Any help would be really appreciated!
Kind regards,
Caitlin Gardner
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Hi Caitlin,
Here is a breakdown of categories within Phytopathology on English wikipedia: http://ow.ly/VQNVL
and the articles within those categories ranked by page view for Oct 2015 : http://ow.ly/VQNCv
I can run similar reports for earlier months.
Cheers,
Erik
From: Analytics [mailto:analytics-bounces@lists.wikimedia.org] On Behalf Of Alex Druk Sent: Monday, December 14, 2015 10:44 To: A mailing list for the Analytics Team at WMF and everybody who has an interest in Wikipedia and analytics. Subject: Re: [Analytics] Data collection
Hi Caitlin,
If you have a list of relevant articles and understanding what time period you would like to research, contact me of the list and I probably can help you.
Also my advise: have a look at wikipediatrends.com or stats.grok.se and try some of your queries to get a better undestanding of possible results.
Best wishes,
On Mon, Dec 14, 2015 at 12:04 AM, Caitlin.Gardner@csiro.au wrote:
Hi All,
I am a summer research intern with the Commonwealth Scientific and Industrial Research Organisation (CSIRO) in Australia. I am studying a statistics degree and so I don’t really have skills in the type of data collection required to access the Wiki data for my research. I was wondering if someone might be able to give me a hand (by pointing me in the right direction)?
I have a list of pest species that I wish to find the total number of page views via stats.grok.se or https://dumps.wikimedia.org/other/pagecounts-raw/ . There must be a good method to go through and pick out page views by name rather than by hand (which obviously isn’t feasible)? I’d also need to be able to find the total number of page views for each period in order to standardize the response to account for the increase in traffic over the years.
We are in the process of gathering similar data through a Plant Pest database but due to privacy concerns, the organisation is arranging to reconcile the data on our behalf and so I do not have a part in that.
Any help would be really appreciated!
Kind regards,
Caitlin Gardner
_______________________________________________ Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics
Erik Zachte, 14/12/2015 14:14:
I can run similar reports for earlier months.
Thanks for publishing that code too! https://github.com/wikimedia/analytics-wikistats/tree/master/dammit.lt/bash
Nemo