[Crossposted to Wikitech-l]
Hello, all!
As a reminder, the planned partial downtime of the Labs network
filesystem for maintenance is today (January 15) at 18:00 UTC and is
scheduled to last up to 24 hours - but I have high hopes that it will be
much shorter than that.
In addition to the expected impact during the window (copied below)
there will be a brief (less than 5 minutes) complete suspension of
service to the affected filesystems at the very beginning of the process
that should not cause errors but may cause many services to stall briefly.
-- Marc
On 14-12-31 12:11 PM, Marc A. Pelletier wrote:
The expecte impacts is:
* Starting at the beginning of the window, /home and /data/project will
switch to readonly mode; any attempt to write to files to those trees
will result in EROFS errors being thrown. Reading from those
filesystems will still work as expected, so would writing to other
filesystems;
* Read performance may degrade noticably as the disk subsystem will be
loaded to capacity;
* It will not be possible to manipulate the gridengine queue -
specifically, starting or stopping jobs will not work; and
* At the end of the window, when the operation is complete, the "old"
file system will go away and be replaced by the new one - this will
cause any access to files or directories that were previously opened
(including working directories) on the affected filesystems to error out
with ESTALE. Reopening files by name will access the new copy identical
to the one at the time the filesystems became readonly.