Hi all, I'm seeking for some help with the followi...
# ask-metaflow
n
Hi all, I'm seeking for some help with the following setup. I have the following flow:
Copy code
...
@step
def schedule_workers(self):
    worker_inputs = [...]
    self.next(self.run_worker, foreach='worker_inputs')

@catch(var='timeout')
@retry(times=0)
@timeout(seconds=3600)
@step
def run_worker(self):
    ...

...
Flows are deployed to aws step functions. I want the run to succeed in case run_worker tasks time out. However, the problem that I'm experiencing is that as soon as one of the run_worker tasks time out, other run_worker tasks are aborted, and the run fails (exception is not caught). Anyone has any suggestions how to fix this? Let me know if you need more explanation. Thanks!