Please, update hourly, to avoid duplicated downloads.
On Fri, Jul 3, 2009 at 1:02 PM, emijrp<emijrp@gmail.com> wrote:If nothing has changed since last time I checked, it is one of my cron
> To update a template similar to {{Popular articles}} of English Wikipedia.
> Now, I'm downloading one .gz (40 MB) each hour, so, it wouldn't be neccesary
> if this directory is updated in "real time".
jobs that does the update, and I am happy to run it every hour if
needed (and if that is not a problem).
By the way, the directory has grown quite a bit and is getting
difficult to use (even an "ls" takes ages to run), so I should
probably change the layout a bit (e.g. having subdirectories for
archives). At some point, we may have to delete the older files, or
compress them (that's what Erik Zachte does for the "official"
statistics), but I think there is enough space for now (let me know if
any of you, especially ts-admins, think otherwise).
One short-term plan is, instead of simply downloading the files, to
replicate part of the infrastucture set up by Erik (provide compressed
and/or processed files) so that it is easier to use the data on the
toolserver. Well, it was a short-term plan in January and then I was
kept away from this work by other comitments...
Frédéric
_______________________________________________
Toolserver-l mailing list
Toolserver-l@lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/toolserver-l