-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1
On 24.11.2012 21:15, Merlissimo wrote:
At 20:32 on Nov 23th sge on turnera stopped and was started at damiana. The qmaster thread started successfully because it responses pings and so on. But the scheduler thread seems not to work. qconf -tsm does not show any status information (which whould be written to logs when is send this command). That's why no new jobs are send to execution clients.
So the switch over on the ha-cluster failed.
...so is it supposed to be working now...?
@All: If you are working on big files please copy them to local temp first (on sge $TMP contains an individual temp dir for the job). E.g. piping big files to other slow programs causes much nfs load because data must be read in small packages which cause high load on servers. That's why sge cannot schedule new jobs on nightshade since days.
What is a big file? Is it ok if the file is in user-home?
Thanks and greetings DrTrigon