bumpy-piano-21462
11/07/2024, 5:00 PM2024-11-07 08:40:17.452 [44/train_model/155 (pid 1995363)] RandomizedSearchCV initialized.
2024-11-07 08:40:17.452 [44/train_model/155 (pid 1995363)] Proceeding to the evaluation step.
2024-11-07 08:41:59.554 1 task is running: train_model (1 running; 0 done).
2024-11-07 08:41:59.554 No tasks are waiting in the queue.
2024-11-07 08:41:59.554 2 steps have not started: end, evaluate.
2024-11-07 08:46:59.851 1 task is running: train_model (1 running; 0 done).
2024-11-07 08:46:59.851 No tasks are waiting in the queue.
2024-11-07 08:46:59.851 2 steps have not started: end, evaluate.
2024-11-07 08:48:33.830 [44/train_model/155 (pid 1995363)] Internal error:
2024-11-07 08:48:33.831 [44/train_model/155 (pid 1995363)] ('Connection aborted.', TimeoutError('The write operation timed out'))
2024-11-07 08:48:40.879 [44/train_model/155 (pid 1995363)]
2024-11-07 08:48:41.211 [44/train_model/155 (pid 1995363)] Task failed.
the Proceeding to the evaluation step log is the last print statment i have in the task before calling self.next() in the workflow, this hangs forever and never comes back. I feel like there might be a time out in writting to the data store but i cannot confirm this, is there anywhere i can see more detailed logs from metaflow to see what is going on?ancient-application-36103
11/11/2024, 10:25 AMbumpy-piano-21462
11/12/2024, 8:46 AM