Hi all,
Curious about something - if I enable checkpoints in Sagemaker training, and I prematurely stop the training tasks, would Metaflow's
resume
command be able to carry on the training task and load the model from the previous saved checkpoint? Or would it start a new sagemaker training instance? Thanks in advance!