Hi all,
I’m in the middle of a (slow) upgrade process for the Hadoop cluster. Currently, we are running CDH 5.0.2, and would like to upgrade to CDH 5.3. There are several steps to this process, the first of which is upgrading our OS to Ubuntu Trusty.
Along the way, I’m replacing our current NameNodes with different hardware. I am ready to do this now. I don’t see much opportunity to schedule this over the next couple of weeks, due to All-Hands travel, so I’d like to schedule this for tomorrow morning (Friday January 16th).
I expect this to be relatively simple downtime, that will only take a few minutes. Just in case, I’d like to reserve 2 hours of time.
So, unless there are serious objections, plan for Hadoop to be offline from
2015-01-16 15:45 - 17:45 UTC
Also, please don’t start jobs before this time slot that you think will take a long time. If there are running jobs, I either can’t shut down the cluster, or I will have to kill the jobs. If I see running jobs, I’ll try to reach out to you before I kill anything.
If anyone is interested in a rough migration plan, it is here:
https://wikitech.wikimedia.org/wiki/Analytics/Cluster/Hadoop/Administration#... https://wikitech.wikimedia.org/wiki/Analytics/Cluster/Hadoop/Administration#Migrating_to_new_HA_NameNodes
Thanks all!
-Ao
I am starting this now.
On Jan 15, 2015, at 11:34, Andrew Otto aotto@wikimedia.org wrote:
Hi all,
I’m in the middle of a (slow) upgrade process for the Hadoop cluster. Currently, we are running CDH 5.0.2, and would like to upgrade to CDH 5.3. There are several steps to this process, the first of which is upgrading our OS to Ubuntu Trusty.
Along the way, I’m replacing our current NameNodes with different hardware. I am ready to do this now. I don’t see much opportunity to schedule this over the next couple of weeks, due to All-Hands travel, so I’d like to schedule this for tomorrow morning (Friday January 16th).
I expect this to be relatively simple downtime, that will only take a few minutes. Just in case, I’d like to reserve 2 hours of time.
So, unless there are serious objections, plan for Hadoop to be offline from
2015-01-16 15:45 - 17:45 UTC
Also, please don’t start jobs before this time slot that you think will take a long time. If there are running jobs, I either can’t shut down the cluster, or I will have to kill the jobs. If I see running jobs, I’ll try to reach out to you before I kill anything.
If anyone is interested in a rough migration plan, it is here:
https://wikitech.wikimedia.org/wiki/Analytics/Cluster/Hadoop/Administration#... https://wikitech.wikimedia.org/wiki/Analytics/Cluster/Hadoop/Administration#Migrating_to_new_HA_NameNodes
Thanks all!
-Ao
Done! analytics1001 and analytics1002 are now the Hadoop NameNodes, and the analytics1001 is the YARN master.
Thanks all!
-Ao
On Jan 16, 2015, at 10:46, Andrew Otto aotto@wikimedia.org wrote:
I am starting this now.
On Jan 15, 2015, at 11:34, Andrew Otto <aotto@wikimedia.org mailto:aotto@wikimedia.org> wrote:
Hi all,
I’m in the middle of a (slow) upgrade process for the Hadoop cluster. Currently, we are running CDH 5.0.2, and would like to upgrade to CDH 5.3. There are several steps to this process, the first of which is upgrading our OS to Ubuntu Trusty.
Along the way, I’m replacing our current NameNodes with different hardware. I am ready to do this now. I don’t see much opportunity to schedule this over the next couple of weeks, due to All-Hands travel, so I’d like to schedule this for tomorrow morning (Friday January 16th).
I expect this to be relatively simple downtime, that will only take a few minutes. Just in case, I’d like to reserve 2 hours of time.
So, unless there are serious objections, plan for Hadoop to be offline from
2015-01-16 15:45 - 17:45 UTC
Also, please don’t start jobs before this time slot that you think will take a long time. If there are running jobs, I either can’t shut down the cluster, or I will have to kill the jobs. If I see running jobs, I’ll try to reach out to you before I kill anything.
If anyone is interested in a rough migration plan, it is here:
https://wikitech.wikimedia.org/wiki/Analytics/Cluster/Hadoop/Administration#... https://wikitech.wikimedia.org/wiki/Analytics/Cluster/Hadoop/Administration#Migrating_to_new_HA_NameNodes
Thanks all!
-Ao
Not that if you use ssh tunnels to access any of the Hadoop GUIs that were previously on analytics1010, you should now use analytics1001 instead.
On Jan 16, 2015, at 11:47, Andrew Otto aotto@wikimedia.org wrote:
Done! analytics1001 and analytics1002 are now the Hadoop NameNodes, and the analytics1001 is the YARN master.
Thanks all!
-Ao
On Jan 16, 2015, at 10:46, Andrew Otto <aotto@wikimedia.org mailto:aotto@wikimedia.org> wrote:
I am starting this now.
On Jan 15, 2015, at 11:34, Andrew Otto <aotto@wikimedia.org mailto:aotto@wikimedia.org> wrote:
Hi all,
I’m in the middle of a (slow) upgrade process for the Hadoop cluster. Currently, we are running CDH 5.0.2, and would like to upgrade to CDH 5.3. There are several steps to this process, the first of which is upgrading our OS to Ubuntu Trusty.
Along the way, I’m replacing our current NameNodes with different hardware. I am ready to do this now. I don’t see much opportunity to schedule this over the next couple of weeks, due to All-Hands travel, so I’d like to schedule this for tomorrow morning (Friday January 16th).
I expect this to be relatively simple downtime, that will only take a few minutes. Just in case, I’d like to reserve 2 hours of time.
So, unless there are serious objections, plan for Hadoop to be offline from
2015-01-16 15:45 - 17:45 UTC
Also, please don’t start jobs before this time slot that you think will take a long time. If there are running jobs, I either can’t shut down the cluster, or I will have to kill the jobs. If I see running jobs, I’ll try to reach out to you before I kill anything.
If anyone is interested in a rough migration plan, it is here:
https://wikitech.wikimedia.org/wiki/Analytics/Cluster/Hadoop/Administration#... https://wikitech.wikimedia.org/wiki/Analytics/Cluster/Hadoop/Administration#Migrating_to_new_HA_NameNodes
Thanks all!
-Ao