A Toolforge job I'm running is stuck. I started it last Friday after having
stopped once before for now apparent reason. When I checked today I got:
> tools.commonsdb-registry@tools-bastion-15:~$ toolforge jobs show
> make-declarations-wlm-2013
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Job name: | make-declarations-wlm-2013
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Command: | src/make_declaration.py -t mem:1.0Gi -t cpu:3.0 -t
> version:0.1.11 --verbose batch:category-26152515 |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Job type: | one-off
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Image: | tool-commonsdb-registry/tool-commonsdb-registry:latest
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Port: | none
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | File log: | yes
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Output log: |
> /data/project/commonsdb-registry/make-declarations-wlm-2013.out
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Error log: |
> /data/project/commonsdb-registry/make-declarations-wlm-2013.err
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Emails: | all
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Resources: | mem: 1.0Gi, cpu: 3.0
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Replicas: |
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Mounts: | all
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Retry: | no
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Timeout: | no
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Health check: | none
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Status: | Running for 3d23m
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
> | Hints: | Run not attempted yet. Pod in 'Pending' phase.
> |
>
> +---------------+-----------------------------------------------------------------------------------------------------+
As far as I can tell that means it hasn't actually started. It's still
waiting for something, but it's unclear what. I restarted it a couple of
times today so far, but it never gets past that stage.
When I've run similar jobs in the past they sometimes take a while to
start. Eventually they have though, so I though maybe they were waiting for
resources or something.
Can anyone help me get this started or explain if I've misunderstood
something?
*Sebastian Berlin*
Utvecklare/*Developer*
Wikimedia Sverige (WMSE)
E-post/*E-Mail*: sebastian.berlin(a)wikimedia.se
Telefon/*Phone*: (+46) 0707 - 92 03 84