Please, update hourly, to avoid duplicated downloads.
2009/7/3 Frederic Schutz <schutz(a)mathgen.ch>
On Fri, Jul 3, 2009 at 1:02 PM,
emijrp<emijrp(a)gmail.com> wrote:
To update a template similar to {{Popular
articles}} of English
Wikipedia.
Now, I'm downloading one .gz (40 MB) each
hour, so, it wouldn't be
neccesary
if this directory is updated in "real
time".
If nothing has changed since last time I checked, it is one of my cron
jobs that does the update, and I am happy to run it every hour if
needed (and if that is not a problem).
By the way, the directory has grown quite a bit and is getting
difficult to use (even an "ls" takes ages to run), so I should
probably change the layout a bit (e.g. having subdirectories for
archives). At some point, we may have to delete the older files, or
compress them (that's what Erik Zachte does for the "official"
statistics), but I think there is enough space for now (let me know if
any of you, especially ts-admins, think otherwise).
One short-term plan is, instead of simply downloading the files, to
replicate part of the infrastucture set up by Erik (provide compressed
and/or processed files) so that it is easier to use the data on the
toolserver. Well, it was a short-term plan in January and then I was
kept away from this work by other comitments...
Frédéric
_______________________________________________
Toolserver-l mailing list
Toolserver-l(a)lists.wikimedia.org
https://lists.wikimedia.org/mailman/listinfo/toolserver-l