Forwarding a note from Ashok Rao (cc’ed), can anyone comment on the dumps server returning 503s?
Ashok – we don’t have yet an in-house API to retrieve pageview data, but the Analytics team is working on one: see this thread https://phabricator.wikimedia.org/T44259#1341010. Depending on what you’re doing, http://stats.grok.se/ http://stats.grok.se/ may also come in handy.
Best, Dario
Begin forwarded message:
From: Ashok Rao raoashok@seas.upenn.edu Subject: Wikipedia Page views access Date: June 18, 2015 at 5:53:12 PM GMT+2 To: dario@wikimedia.org
Hi Dario,
Good morning. I'm a student at the University of Pennsylvania and I've been trying to perform a few analyses based on Wikipedia page views data. I've written a script that grabs data from the main dump site – https://dumps.wikimedia.org/other/pagecounts-raw/ https://dumps.wikimedia.org/other/pagecounts-raw/ – but run into many sporadic 503 errors (sometimes with the download link, other times with the main page itself). I noticed some of this data might be available directly on Wikimedia servers that can be utilized for research purposes.
I was hoping I could get access to this and appreciate your help.
Best, Ashok
-- Ashok M. Rao The Rajendra and Neera Singh Program in Market and Social Systems Engineering School of Engineering and Applied Sciences University of Pennsylvania | Class of '17