Kate Turner wrote:
I'd like to
do some statistics based on the database dumps, so I'd need
a lot of more disk quota (50GB would be fine or at least 2GB for the
cur-dumps :-)
i've increased your quota to 52428800 blocks (50Gbytes).
. It's better to only mirror dump files at one
place so I
created the directory
/u01/u/voj/dumps
yes, it would make sense to only have one copy of the dumps. since yours is
already there, would you like to become the official dump copier? ;-)
At the moment I don't have time to write and test some scripts to easily
copy dumps on demand or with a cron job. Maybe at the moment it's
enough to collect all copied dumps in the same way at the same place so
if you'd like to get some dumps you have to switch to the corresponding
directory and wget it.
what particular problem did you have with it? you can
see my
.htaccess at /u01/u/kate/public_html/.htaccess which seems to work okay.
Thanks - now it works. And you can browse
http://tools.wikimedia.de/~voj/dumps/
Please don't uncompress files if you don't have to.
For instance you can read file.gz this way:
gzip -dc file.gz |
btw, i was using /u01/wikipedia/dumps/ before.
i've made that writable by
users, if you feel like moving them.
I renamed the files and sorted it to into /u01/u/voj/dumps/. If you want
you can also move everything to somewhere else and make it accesible via
http://tools.wikimedia.de/dumps/
filenames in .htaccess need to be changed when moving.
It's a pitty that I have not had this server 2 month ago. I almost
finished my masters thesis with wikipedia statistics of smaller
proportions of the dumps and now I could analyse all of it!
Greetings,
Jakob