On Dec 25, 2003, at 08:00, Jimmy Wales wrote:
From now on when we think about how to spend money, we should think about
this morning, and think about redundancy, which is expensive but probably necessary as we get bigger and bigger.
When Geoffrin comes back online, I intend to move the live database back to Pliny. Geoffrin is unsuitable for production use until it is thoroughly tested for defects and rebootable in case of future crashes. In just three weeks we've had two hard crashes and two instances of file corruption.
A RAM test needs to be done; that's a highly likely source of errors (over 32 billion bits of memory; if just _one_ is defective, it can cause data corruption or a crash).
We *really should* get database replication going, if at some point we have two working machines with enough disk space to handle the database, so a dead database server can be taken over by the slave.
-- brion vibber (brion @ pobox.com)