I'm on willow, and I'm not getting any email errors, but it looks like my cron jobs are being very seriously delayed. My one bot is supposed to run every two hours on the hour, but the interval between runs is increasing steadily.
----
User:Hersfold
hersfoldwiki@gmail.com
On 2/3/2013 2:32 PM, Alchimista wrote:
I'm having the same problem, for two days, and by what i can see, it's happening only on hawthorn.

Alchimista


2013/2/3 Bryan Tong Minh <bryan.tongminh@gmail.com>
Hello,


SGE is spewing out various JSV errors, included below. Anybody an idea what is going on?


Bryan

Cron <bryan@hawthorn> qcronsub -l h_rt=00:10:00 -l arch=* -l virtual_free=200M -e $HOME/replag-plot.err -N replag-plot /home/bryan/public_html/stats/replag/make-plot.sh

error: JSV stderr: Traceback (most recent call last):
error: JSV stderr: File "/sge/GE/bin/sol-amd64/
qjobtest", line 108, in <module>
error: JSV stderr: dom = minidom.parse(child_stdout)
error: JSV stderr: File "/opt/ts/python/2.7/lib/python2.7/site-packages/_xmlplus/dom/minidom.py", line 1915, in parse
error: JSV stderr: return expatbuilder.parse(file)
error: JSV stderr: File "/opt/ts/python/2.7/lib/python2.7/site-packages/_xmlplus/dom/expatbuilder.py", line 930, in parse
error: JSV stderr: result = builder.parseFile(file)
error: JSV stderr: File "/opt/ts/python/2.7/lib/python2.7/site-packages/_xmlplus/dom/expatbuilder.py", line 207, in parseFile
error: JSV stderr: parser.Parse(buffer, 0)
error: JSV stderr: xml.parsers.expat.ExpatError: syntax error: line 1, column 0
Unable to run job: JSV stderr: Traceback (most recent call last):
JSV stderr: File "/sge/GE/bin/sol-amd64/qjobtest", line 108, in <module>
JSV stderr: dom = minidom.parse(child_stdout)
JSV stderr: File "/opt/ts/python/2.7/lib/python2.7/site-packages/_xmlplus/dom/minidom.py", line 1915, in parse
JSV stderr: return expatbuilder.parse(file)
JSV stderr: File "/opt/ts/python/2.7/lib/python2.7/site-packages/_xmlplus/dom/expatbuilder.py", line 930, in parse
JSV stderr: result = builder.parseFile(file)
JSV stderr: File "/opt/ts/python/2.7/lib/python2.7/site-packages/_xmlplus/dom/expatbuilder.py", line 207, in parseFile
JSV stderr: parser.Parse(buffer, 0)
JSV stderr: xml.parsers.expat.ExpatError: syntax error: line 1, column 0
JSV stderr is - xml.parsers.expat.ExpatError: syntax error: line 1, column 0.
Exiting.

Cron <bryan@hawthorn> qcronsub -l h_rt=00:10:00 -l arch=* -l virtual_free=200M -e $HOME/replag-plot.err -N replag-plot /home/bryan/public_html/stats/replag/make-plot.sh

Unable to run job: unable to send message to qmaster using port 536 on host "damiana": got send error.
Exiting.
error: commlib error: can't connect to service (Connection refused)


Cron <bryan@hawthorn> qcronsub -l h_rt=INFINITY -l arch=sol -l virtual_free=200M -e $HOME/flickr_index.err -N flickr_index $HOME/projects/flickr_old/index.sh

error: JSV stderr: Traceback (most recent call last):
error: JSV stderr: File "/sge/GE/bin/sol-amd64/qjobtest", line 108, in <module>
Unable to run job: JSV stderr: Traceback (most recent call last):
JSV stderr: File "/sge/GE/bin/sol-amd64/qjobtest", line 108, in <module>
JSV stderr is - File "/sge/GE/bin/sol-amd64/
qjobtest", line 108, in <module>.
Exiting.

Cron <bryan@hawthorn> qcronsub -l h_rt=00:10:00 -l arch=* -l virtual_free=200M -e $HOME/replag-plot.err -N replag-plot /home/bryan/public_html/stats/replag/make-plot.sh

error: JSV stderr: Traceback (most recent call last):
error: JSV stderr: File "/sge/GE/bin/sol-amd64/

qjobtest", line 105, in <module>
error: JSV stderr: exitStatus = p.wait()
error: JSV stderr: File "/opt/ts/python/2.7/lib/python2.7/subprocess.py", line 1243, in wait
error: JSV stderr: pid, sts = _eintr_retry_call(os.waitpid, self.pid, 0)
error: JSV stderr: File "/opt/ts/python/2.7/lib/python2.7/subprocess.py", line 471, in _eintr_retry_call
error: JSV stderr: return func(*args)
error: JSV stderr: File "/sge/GE/bin/sol-amd64/qjobtest", line 89, in handleAlarm
error: JSV stderr: sys.exit(nagiosStateUnknown)
error: JSV stderr: NameError: global name 'nagiosStateUnknown' is not defined
Unable to run job: JSV stderr: Traceback (most recent call last):
JSV stderr: File "/sge/GE/bin/sol-amd64/qjobtest", line 105, in <module>
JSV stderr: exitStatus = p.wait()
JSV stderr: File "/opt/ts/python/2.7/lib/python2.7/subprocess.py", line 1243, in wait
JSV stderr: pid, sts = _eintr_retry_call(os.waitpid, self.pid, 0)
JSV stderr: File "/opt/ts/python/2.7/lib/python2.7/subprocess.py", line 471, in _eintr_retry_call
JSV stderr: return func(*args)
JSV stderr: File "/sge/GE/bin/sol-amd64/qjobtest", line 89, in handleAlarm
JSV stderr: sys.exit(nagiosStateUnknown)
JSV stderr: NameError: global name 'nagiosStateUnknown' is not defined
JSV stderr is - NameError: global name 'nagiosStateUnknown' is not defined.
Exiting.





_______________________________________________
Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org)
https://lists.wikimedia.org/mailman/listinfo/toolserver-l
Posting guidelines for this list: https://wiki.toolserver.org/view/Mailing_list_etiquette



--
Alchimista
http://pt.wikipedia.org/wiki/Utilizador:Alchimista


_______________________________________________
Toolserver-l mailing list (Toolserver-l@lists.wikimedia.org)
https://lists.wikimedia.org/mailman/listinfo/toolserver-l
Posting guidelines for this list: https://wiki.toolserver.org/view/Mailing_list_etiquette