Kate Turner wrote:
I'd like to do some statistics based on the database dumps, so I'd need a lot of more disk quota (50GB would be fine or at least 2GB for the cur-dumps :-)
i've increased your quota to 52428800 blocks (50Gbytes).
. It's better to only mirror dump files at one place so I created the directory
/u01/u/voj/dumps
yes, it would make sense to only have one copy of the dumps. since yours is already there, would you like to become the official dump copier? ;-)
At the moment I don't have time to write and test some scripts to easily copy dumps on demand or with a cron job. Maybe at the moment it's enough to collect all copied dumps in the same way at the same place so if you'd like to get some dumps you have to switch to the corresponding directory and wget it.
what particular problem did you have with it? you can see my .htaccess at /u01/u/kate/public_html/.htaccess which seems to work okay.
Thanks - now it works. And you can browse http://tools.wikimedia.de/~voj/dumps/
Please don't uncompress files if you don't have to. For instance you can read file.gz this way:
gzip -dc file.gz |
btw, i was using /u01/wikipedia/dumps/ before. i've made that writable by users, if you feel like moving them.
I renamed the files and sorted it to into /u01/u/voj/dumps/. If you want you can also move everything to somewhere else and make it accesible via
http://tools.wikimedia.de/dumps/
filenames in .htaccess need to be changed when moving.
It's a pitty that I have not had this server 2 month ago. I almost finished my masters thesis with wikipedia statistics of smaller proportions of the dumps and now I could analyse all of it!
Greetings, Jakob