Hello Everyone I have the following question: I ...
# ask-metaflow
a
Hello Everyone I have the following question: I am working on an on-prem cluster and I want to schedule 50 pods in an
foreach
loop. I call the process from cli using
python flow.py run --max-workers 50
the processes are being started but shortly after the following error message appears
Copy code
2024-10-22 12:12:38.075 [1437/process_batches/10681 (pid 114822)] File "/tmp/tmp_momd9jb/metaflow/plugins/kubernetes/kubernetes_job.py", line 319, in execute
2024-10-22 12:12:38.075 [1437/process_batches/10681 (pid 114822)] raise KubernetesJobException(
2024-10-22 12:12:38.075 [1437/process_batches/10681 (pid 114822)] metaflow.plugins.kubernetes.kubernetes_job.KubernetesJobException: Unable to launch Kubernetes job.
2024-10-22 12:12:38.075 [1437/process_batches/10681 (pid 114822)] Timeout: request did not complete within requested timeout - context deadline exceeded
Does anybody have an idea how to set timeout to something bigger or any other idea whats happening here?