Nuria/Kevin,
If I understand the request correctly, it seems to be asking for data of
this form for pages in the Italian wikipedia that are about places of
interest.
*Timestamp, Geo data of the request - Country, city etc obtained from
geolocating the IP, Page Title.*
Nima, if I understand this right, we have this data available in the
internal webrequest logs - however it is highly sensitive and I don't think
can be published as a dataset publicly. Getting access to work with this
type of data (involving geo-data) usually involves an NDA process etc -
which I'm not an expert on and will let others who know better help with.
On Thu, Apr 7, 2016 at 9:16 AM, Kevin Leduc <kevin(a)wikimedia.org> wrote:
Hi Nima,
It should be possible, and it is interesting to merge geodata with
pageviews. Newer pageview data may be easier to work with:
https://dumps.wikimedia.org/other/analytics/
I wonder if the timing when GPS data became available in an article has
any impact on pageviews. It may be easier to assume that is not the case
so you don't have to look at article's history as well.
Wikidata will also be an easy way to query for GPS data. Check out this
mapping of data with coordinates:
https://ddll.inf.tu-dresden.de/web/Wikidata/Maps-06-2015/en
On Tue, Apr 5, 2016 at 4:07 AM, Nima Dashtban <nima.dashtban(a)gmail.com>
wrote:
Hi there,
Hope my email finds you well. My name is Nima Dashtban and I'm a student
of computer science in Ca'foscari University of Venice / Italy.
I am investigating these access logs of wikipedia pages:
https://dumps.wikimedia.org/other/pagecounts-raw/
In particular I would like to build up an DB of the time series of
accesses to (Italian) pages of wikipedia that have a GPS position, i.e.
wikipedia page that refer to geographical point of interests. I think that
such data could be useful as predictive signal of interest of potential
visitors of such geographical places.
Any help of you whether you say it is possible or not would be huge for
me.
Sincerely and Regards,
Nima Dashtban
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
--
--Madhu :)