Hi Nima,
It should be possible, and it is interesting to merge geodata with pageviews. Newer pageview data may be easier to work with: https://dumps.wikimedia.org/other/analytics/
I wonder if the timing when GPS data became available in an article has any impact on pageviews. It may be easier to assume that is not the case so you don't have to look at article's history as well.
Wikidata will also be an easy way to query for GPS data. Check out this mapping of data with coordinates: https://ddll.inf.tu-dresden.de/web/Wikidata/Maps-06-2015/en
On Tue, Apr 5, 2016 at 4:07 AM, Nima Dashtban nima.dashtban@gmail.com wrote:
Hi there,
Hope my email finds you well. My name is Nima Dashtban and I'm a student of computer science in Ca'foscari University of Venice / Italy.
I am investigating these access logs of wikipedia pages: https://dumps.wikimedia.org/other/pagecounts-raw/
In particular I would like to build up an DB of the time series of accesses to (Italian) pages of wikipedia that have a GPS position, i.e. wikipedia page that refer to geographical point of interests. I think that such data could be useful as predictive signal of interest of potential visitors of such geographical places.
Any help of you whether you say it is possible or not would be huge for me.
Sincerely and Regards, Nima Dashtban
Analytics mailing list Analytics@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/analytics