Is there a way to find what are the most popular articles per country?
Finding the most popular articles per language is easy with the Pageviews
tool, but languages and countries are of course not the same.
One thing I tried is going to Turnilo, webrequest_sampled_128, and
filtering by country. But here it gets troublesome:
* Splitting can be done by Uri host, which is *more or less* the project,
or by Uri path, which is *more or less* the article (but see below), and I
couldn't find a convenient way to combine them.
* Mobile (.m.) and desktop hosts are separate. It may actually sometimes be
useful to see differences (or lack thereof) between desktop and mobile, but
combining them is often useful, too. This can probably be done with regular
expressions, but this brings us to the biggest problem:
* Filtering by Uri path would be useful if it didn't have so many paths for
images, beacons, etc. Filtering using the regular expression "\/wiki\/.+"
may be the right thing functionally, but in practice it's very slow or
doesn't work at all.
* I don't know what exactly is logged in webrequest_sampled_128, but the
name hints that it doesn't include everything. A sample may be OK for
countries with a lot of traffic like U.S. or Spain, but for countries with
smaller traffic this may start being a problem.
Any better ideas?
Amir Elisha Aharoni · אָמִיר אֱלִישָׁע אַהֲרוֹנִי
“We're living in pieces,
I want to live in peace.” – T. Moore