[Crossposted to Wikitech-l]
Hello, all!
As a reminder, the planned partial downtime of the Labs network filesystem for maintenance is today (January 15) at 18:00 UTC and is scheduled to last up to 24 hours - but I have high hopes that it will be much shorter than that.
In addition to the expected impact during the window (copied below) there will be a brief (less than 5 minutes) complete suspension of service to the affected filesystems at the very beginning of the process that should not cause errors but may cause many services to stall briefly.
-- Marc
On 14-12-31 12:11 PM, Marc A. Pelletier wrote:
The expecte impacts is:
- Starting at the beginning of the window, /home and /data/project will
switch to readonly mode; any attempt to write to files to those trees will result in EROFS errors being thrown. Reading from those filesystems will still work as expected, so would writing to other filesystems;
- Read performance may degrade noticably as the disk subsystem will be
loaded to capacity;
- It will not be possible to manipulate the gridengine queue -
specifically, starting or stopping jobs will not work; and
- At the end of the window, when the operation is complete, the "old"
file system will go away and be replaced by the new one - this will cause any access to files or directories that were previously opened (including working directories) on the affected filesystems to error out with ESTALE. Reopening files by name will access the new copy identical to the one at the time the filesystems became readonly.
wikitech-l@lists.wikimedia.org