Please, update hourly, to avoid duplicated downloads.
2009/7/3 Frederic Schutz schutz@mathgen.ch
On Fri, Jul 3, 2009 at 1:02 PM, emijrpemijrp@gmail.com wrote:
To update a template similar to {{Popular articles}} of English
Wikipedia.
Now, I'm downloading one .gz (40 MB) each hour, so, it wouldn't be
neccesary
if this directory is updated in "real time".
If nothing has changed since last time I checked, it is one of my cron jobs that does the update, and I am happy to run it every hour if needed (and if that is not a problem).
By the way, the directory has grown quite a bit and is getting difficult to use (even an "ls" takes ages to run), so I should probably change the layout a bit (e.g. having subdirectories for archives). At some point, we may have to delete the older files, or compress them (that's what Erik Zachte does for the "official" statistics), but I think there is enough space for now (let me know if any of you, especially ts-admins, think otherwise).
One short-term plan is, instead of simply downloading the files, to replicate part of the infrastucture set up by Erik (provide compressed and/or processed files) so that it is easier to use the data on the toolserver. Well, it was a short-term plan in January and then I was kept away from this work by other comitments...
Frédéric
Toolserver-l mailing list Toolserver-l@lists.wikimedia.org https://lists.wikimedia.org/mailman/listinfo/toolserver-l