It would be nice to have the page counts split at least by language.
(and it would reduce the load on your machine)
please let me know If I could help with the code
cheers,
Diego
On Mon, Jan 7, 2013 at 9:55 AM, Ed Summers <ehs(a)pobox.com> wrote:
Have you seen the Page View Statistics data that is
available?
http://dumps.wikimedia.org/other/pagecounts-raw/
The page counts are not broken out by category, or country, but they
do include the project and language. So in theory you can do something
like this:
curl
http://dumps.wikimedia.org/other/pagecounts-raw/2013/2013-01/pagecounts-201…
| zcat - | egrep '^hi '
To see the Hindi Wikipedia page views for 2013-01-07 08:00. I say in
theory because the download server seems to be somewhat slow at the
moment (100K/s) so I didn't see it actually work :-)
//Ed
_______________________________________________
Analytics mailing list
Analytics(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/analytics
--
Computers are useless. They can only give you answers.
(Pablo Picasso)
_______________
Diego Ceccarelli
High Performance Computing Laboratory
Information Science and Technologies Institute (ISTI)
Italian National Research Council (CNR)
Via Moruzzi, 1
56124 - Pisa - Italy
Phone: +39 050 315 3055
Fax: +39 050 315 2040
________________________________________