-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
Hello all
I got some strange info by mail tonight, first:
Unable to run job: error: no suitable queues. Exiting.
then later
Job 1601224 (subster_ar) Set in error state Exit Status = -1 Signal = unknown signal User = drtrigon Queue = short@ortelius.toolserver.org Host = ortelius.toolserver.org Start Time = <unknown> End Time = <unknown> CPU = NA Max vmem = NA failed assumedly before job because: can't get password entry for user "drtrigon". Either the user does not exist or NIS error! Use "qmod -c <jobid>" to clear job error state once the problem is fixed.
(see the attachements)
I think they were reactions to 2 of my cronjobs (the other 2 run as usual). Now the strange thing is I have a job in my SGE queue which I am not able to delete by 'qdel' anymore:
job-ID prior name user state submit/start at queue slots ja-task-ID
1601224 0.00250 subster_ar drtrigon dt 02/12/2012 12:35:55 all.q@ortelius.toolserver.org 1
what to do now? Or what am I doing wrong? I just want to delete this job since it crashed obviousely (but because of strange reasons) and then start it again.
The cron(ie)tab entries are:
30 0 * * * cronsub -s subster_frr $HOME/pywikipedia/bot_control.py -subster -cron -lang:frr 0 1 * * * cronsub -s subster_en $HOME/pywikipedia/bot_control.py -subster -cron -lang:en 30 1 * * * cronsub -s subster_ar $HOME/pywikipedia/bot_control.py -subster -cron -lang:ar
(the job at 1:00 had no problems...)
So essentially there are 2 questions: 1.) How to remove this job from queue (in order to restart it)? 2.) Why did this happen? As you can see the job was started on 'ortelius'... is this usual behaviour or was another server down?
May be someone can give me any hint? Thanks in advance and greetings! DrTrigon