Hi!
Right now most stuff seems to be back up and running: viewing, editing, images, IRC feeds.
Editing-time outage: 6 hours, 15 minutes.
Now as we have enough of money to perform better, we can start planning.
First of all, for 24/7 operation we'd need to have 9 fully qualified system administrators ( for every 8h timeslot with redundancy, leave/ sick coverage, etc) (payroll costs ~100,000$ each).
Of course as these guys would be working on site operations, they'd not have any time for development. So we need >100 guys outsourced to India (payroll costs ~1000$ each).
Sure, as open source technologies are not that suitable for round the clock operation, we might have to start using either Java (enterprise!!!) or .Net/C# (on Microsoft platform, sure, enterprise again!!!).
Then, datacenters. To have a reasonably working read-only stand-by datacenter, we'd have to invest ~250k$ in it (with current load).
We should of course have 3rd party consultants to help this set up (+100,000$ one time, one week gig).
Moreover, we should invest into security measures (as wikis are often hacked). We'd need to screen traffic, ensure highly secure AAA subsystems, uh oh.
I guess other developers (especially with experience in high- availability environments) could add more requirements for fluent operation!
So... let's start rolling!!!
Domas