Ashar Voultoiz wrote:
Yes. Not enough cpu / disk space to generate statistics. iirc the last attempt resulted in a total block of the whole cluster. Hopefully one server will be dedicated to handling logs in a near future and webalizer stats will then be build again.
I strongly support the idea of a machine for handling logs, statistics, and research into those things. I think we could learn a lot about how the community really functions by studying logs, and the traffic stats are important for us to understand and project our growth patterns.
My question to wikitech-l: what sort of machine should we use for this? I am thinking that a fully loaded dual Opteron is overkill, but a typical apache is too small?
Probably we'd want to have a lot of disk space, possibly RAID 5 as the best balance between storage space, redundancy, and speed. (We don't want to lose a huge chunk of data to a bad hard drive, but on the other hand we don't need absolute speed either.)
Probably we'd want a decent CPU, but it doesn't have to be top-notch, since these are batch jobs and the machine should not be doing anything else anyway.
Your thoughts?
--Jimbo