On Tue, 29 Apr 2003, Lee Daniel Crocker wrote:
(Nick Reinking nick@twoevils.org): I notice that /usr (/dev/sda2) is at 96%... Perhaps somebody can remove a bunch of the packages that are installed that we don't use?
The server is very clean in terms of software. The big culprits for disk usage are MySQL's Innodb transaction data (currently a single 10Gb file!), and logfiles from MySQL and Apache.
Innodb keeps most of its goodies in that one big file, which can expand but cannot contract. Certain operations (like altering the table structure) involve making a complete duplicate of the database, altering it, then replacing the old one; so it's taking up nearly twice the space it actually _needs_ on a regular basis. On the plus side, it gives us room to grow. :)
There's also the www-bin.### files, which are the binary log. These track changes made to the database, and are rotated at 1 gigabyte or when the server is restarted. These are mainly useful for database replication, which we don't do _yet_ but will do in the future. For now, I just periodically delete the old ones. It can be disabled somehow, but we'll likely want them in the future so I've not bothered.
Now, here's the space used by the actual wiki files under /usr/local/apache:
2017344 htdocs 387056 logs 302384 htdocs-fr 183700 htdocs-sv 172480 htdocs-de 133512 htdocs-meta 129772 htdocs-eo 103748 htdocs-pl 98588 htdocs-es 97832 htdocs-ja 88128 htdocs-nl 71080 htdocs-da 31276 htdocs-zh 30896 htdocs-test 17036 htdocs-wiktionary 10868 htdocs-ko 7788 htdocs-ru 6784 htdocs-cs 4056 htdocs-bs 3256 htdocs-ms 3020 htdocs-el 2960 htdocs-tr 2788 htdocs-sh 2788 htdocs-ml 2772 htdocs-sr 2772 htdocs-hr 2740 htdocs-sep11
These include the php files, uploaded images, backup tarballs, webalizer stuff, and TeX-generated images. I've deleted saved log files from prior to one week ago (and those that are retained are gzipped).
Further breakdown on the English wiki: 1306996 tarballs 440728 upload 120176 stats 92972 tmp 28472 math 16796 images 4444 w ... some other small smidgens of files...
I do think we could lighten the load on Apache log files considerably now, to save both disk space and gain some performance. For instance, we logged user agents and referrers to get some stats, but I don't think we really need that anymore.
Oh, I think it's quite useful to get that information, otherwise I wouldn't know about *($%@^&*$%@# Grub.
Anyway, I cleaned out a few things and moved some of the older tarballs over to the archives in the home partition, and we're down to 85% usage on /usr.
-- brion vibber (brion @ pobox.com)