We're trying to justify using Metaflow on Batch ov...
# ask-metaflow
a
We're trying to justify using Metaflow on Batch over Sagemaker to our users, but one thing we noticed whilst training their model was that Sagemaker is slightly faster. I looked at the average time per epoch (100 epochs), and Batch was 10.67 minutes versus 9.09 minutes for Sagemaker training. The Batch training job on Metaflow is using the
p2.xlarge
instance type and the Sagemaker training job is using the
ml.p2.xlarge
instance type. Configuration for both was set the same. Any clue as to why the Sagemaker training job is slightly faster?
1