Hi, I am hitting some problems in aws batch. The ...
# ask-metaflow
p
Hi, I am hitting some problems in aws batch. The batch Job says its state is RUNNING, but nothing happens (no further logs, CPU usage is zero %) Metaflow reports that the task has failed. I see logs from the pip decorator we use before each step and then nothing. Any idea how to debug? Basically the error 'seems' to happen (more?) when using step functions. This is a parallel step so perhaps 30 out of 100 steps succeeded. Doing another run, I notice that Metaflow reports runs failed even though they later succeed (eg when i refresh the ui)
👋 2