quick-lighter-52296
01/20/2023, 6:13 AM--with retry
so that they are retried only in case of platform/infra errors and not user-level errors? IIRC metaflow retries both in case of user raised exceptions and platform errors, this is good for production but for our staging/test environments we want the flows to fail fast and fail early. If we remove all retries then the issue is that the flow sometimes fail due to infra issues / provisioning timeouts from ECS, etc. that we still want to avoid