Hello all,
yesterday Merlissimo and I successfully tested the installation of the SGE- version for the toolserver. The last step is now to install the new version on the live-system. For that, the SGE-service needs to stop completely on the cluster, the old version has to be removed and the new one has to be installed. We plan to to this on
Thursday 5. July between 17:30 and 22:30 UTC.
During this time no SGE will work. There will be no restarting (and no migration) of stopped things after the update.
After the update is done, we will start to use the 2 Linux-boxes for tools too (I will send details than).
Sincerely, DaB.
Hello all, At Friday 06 July 2012 02:40:11 DaB. wrote:
We plan to to this on
Thursday 5. July between 17:30 and 22:30 UTC.
we had to extend the timeframe, but now the main-system is working again (with the new version!). More details tomorrow after our slumber. One important thing: If you run sge-task from the command-line, you have to logout and login one time (on each server) to get the new environment- variables.
Sincerely, DaB.
Submitting jobs from cronie on submit.ts is not working for me:
Cron bryan@hawthorn cronsub -l TsLogBot $HOME/projects/TsLogBot/TsLogBot.sh error: commlib error: can't connect to service (Connection refused) error: unable to send message to qmaster using port 444 on host "turnera-bge0": got send error error: commlib error: can't connect to service (Connection refused) Unable to run job: unable to send message to qmaster using port 444 on host "turnera-bge0": got send error. Exiting.
On Fri, Jul 6, 2012 at 2:51 AM, DaB. WP@daniel.baur4.info wrote:
Hello all, At Friday 06 July 2012 02:40:11 DaB. wrote:
We plan to to this on
Thursday 5. July between 17:30 and 22:30 UTC.
we had to extend the timeframe, but now the main-system is working again (with the new version!). More details tomorrow after our slumber. One important thing: If you run sge-task from the command-line, you have to logout and login one time (on each server) to get the new environment- variables.
Sincerely, DaB.
-- Userpage: [[:w:de:User:DaB.]] — PGP: 2B255885
Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org) https://lists.wikimedia.org/mailman/listinfo/toolserver-l Posting guidelines for this list: https://wiki.toolserver.org/view/Mailing_list_etiquette
On Fri, Jul 6, 2012 at 1:12 PM, DaB. WP@daniel.baur4.info wrote:
Hello, At Friday 06 July 2012 13:12:33 DaB. wrote:
Submitting jobs from cronie on submit.ts is not working for me:
should be fixed now.
Yes it is. Thank you!
Hello, At Sunday 15 July 2012 14:15:11 DaB. wrote:
we had to extend the timeframe, but now the main-system is working again (with the new version!). More details tomorrow after our slumber.
there was never a detail-email, for which I'm sorry. So now some details: -SGE moved from /sge62 to /sge(/GE). So if you have /sge62 in your PATH, in your (login-)scripts or somewhere else you have to change that (/sge62 will vanish somewhen in near future). -You can use SGE now under linux too. -There is "-l arch=lx" now which will run your task at a linux-host. -There is also "-l arch='*'" that will run your task on linux or Solaris. -We created a shadow-master that should help if the HA-nodes are away for some reasons. -We will soon send mails if your task has used more resources than announced. -Under Linux the SGE-jobs run in a cgroup (one for each job). -Hawthorn and Clematis are not longer (available) submit-hosts. -Munin-graphs for sge are working again and can be found in the turnera- section at the moment. -SGE under linux is now handled by puppet. -qcronsub has now some colorful help-output (no sure if that is new). -The wiki-page for SGE [1] was updated.
If there are any other questions, please use the mailinglist. If you find a problem, please open a JIRA-ticket. Thanks for your patience.
Sincerely, DaB.
[1] https://wiki.toolserver.org/view/Job_scheduling
On 15/07/12 14:39, DaB. wrote:
-There is "-l arch=lx" now which will run your task at a linux-host.
Wouldn't a naming of arch=linux be preferable? Being simple to understand seems more important than the 3 characters saved.
-Hawthorn and Clematis are not longer (available) submit-hosts.
Connecting to submit.toolserver.org I still arrive at clematis...
Hello, At Sunday 15 July 2012 20:27:52 DaB. wrote:
-There is "-l arch=lx" now which will run your task at a linux-host.
Wouldn't a naming of arch=linux be preferable? Being simple to understand seems more important than the 3 characters saved.
I guess that is from SGE, but Merlissimo should have details.
-Hawthorn and Clematis are not longer (available) submit-hosts.
Connecting to submit.toolserver.org I still arrive at clematis...
Yes, you are right. I choose the wrong term. There not longer sge-execution- hosts.
Sincerely, DaB.
Am 15/07/12 20:32, DaB. schrieb:
-Hawthorn and Clematis are not longer (available) submit-hosts.
Connecting to submit.toolserver.org I still arrive at clematis...
Yes, you are right. I choose the wrong term. There not longer sge-execution- hosts.
That makes much more sense. Thanks.
On 2012-07-15 08:39, DaB. wrote:
-SGE moved from /sge62 to /sge(/GE). So if you have /sge62 in your PATH, in your (login-)scripts or somewhere else you have to change that (/sge62 will vanish somewhen in near future).
Great to learn. NOT! My weekly cron script failed because of this, so my data will be incomplete. The administration of the toolserver is a total disaster. Others be warned!
-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
On 15.07.2012 14:39, DaB. wrote:
-There is "-l arch=lx" now which will run your task at a linux-host. -There is also "-l arch='*'" that will run your task on linux or Solaris.
...just wondering, what about running a job on NON-linux-hosts only? Something like "-l arch=bsd" would be useful too. Or is this useless since Solaris hosts will soon be gone anyway...?
Greetings DrTrigon
On 22 July 2012 10:29, Dr. Trigon dr.trigon@surfeu.ch wrote:
...just wondering, what about running a job on NON-linux-hosts only? Something like "-l arch=bsd" would be useful too. Or is this useless since Solaris hosts will soon be gone anyway...?
SGE will take sol as arch as well: -l arch=sol
-- Josh
toolserver-l@lists.wikimedia.org