I have a cronietab job that now gives this error message:
error: commlib error: can't connect to service (Connection refused) Unable to run job: unable to send message to qmaster using port 444 on host "damiana": got send error. Exiting.
What does that mean, and how can I fix it? I have no idea what commlib or damiana are, but perhaps they are related to qsub?
The command I tried to run (from cronie) was:
qsub -N links-s7 -l sql-s7-user-readonly=1 -l h_rt=11:00:00 -l virtual_free=250M -l user_slot=5 linkstats.sh 7
Where linkstats.sh is a shell script in ~la2/
On 05/02/2013 04:48 PM, Lars Aronsson wrote:
I have a cronietab job that now gives this error message:
error: commlib error: can't connect to service (Connection refused) Unable to run job: unable to send message to qmaster using port 444 on host "damiana": got send error. Exiting.
Now it has changed to:
error: commlib error: can't connect to service (Connection refused) Unable to run job: unable to send message to qmaster using port 444 on host "turnera-bge0": got send error. Exiting.
On 05/03/2013 11:16 PM, Lars Aronsson wrote:
On 05/02/2013 04:48 PM, Lars Aronsson wrote:
I have a cronietab job that now gives this error message:
error: commlib error: can't connect to service (Connection refused) Unable to run job: unable to send message to qmaster using port 444 on host "damiana": got send error. Exiting.
Now it has changed to:
error: commlib error: can't connect to service (Connection refused) Unable to run job: unable to send message to qmaster using port 444 on host "turnera-bge0": got send error. Exiting.
Here is a third variant, that I got today:
error: commlib error: can't connect to service (Connection refused) Unable to run job: unable to send message to qmaster using port 444 on host "clematis.toolserver.org": got send error. Exiting.
Can someone please explain how I should submit a cron/cronie job?
Lars Aronsson lars@aronsson.se wrote:
I have a cronietab job that now gives this error message:
error: commlib error: can't connect to service (Connection refused) Unable to run job: unable to send message to qmaster using port 444 on host "damiana": got send error. Exiting.
Now it has changed to:
error: commlib error: can't connect to service (Connection refused) Unable to run job: unable to send message to qmaster using port 444 on host "turnera-bge0": got send error. Exiting.
Here is a third variant, that I got today:
error: commlib error: can't connect to service (Connection refused) Unable to run job: unable to send message to qmaster using port 444 on host "clematis.toolserver.org": got send error. Exiting.
Can someone please explain how I should submit a cron/cronie job?
You shouldn't change anything. There have been some tran- sient errors in connection with the outage (NFS/SGE/LDAP failure), and these are artifacts of those. At the moment, SGE is up and running.
Tim
toolserver-l@lists.wikimedia.org