hallowed-soccer-94479
04/28/2025, 8:48 PM@checkpoint
decorator when using @parallel
steps? The docs I found here say TODO
https://github.com/outerbounds/metaflow-checkpoint-examples/blob/master/documentation/checkpoint_deco/checkpoint_usage.md#saving--loa[…]rallel-stepshallowed-soccer-94479
04/28/2025, 8:52 PMray.train.RunConfig
https://docs.ray.io/en/latest/train/api/doc/ray.train.RunConfig.htmlancient-application-36103
04/28/2025, 9:23 PMhallowed-glass-14538
04/28/2025, 9:47 PMcurrent.checkpoint.save
). If you are fully using Ray then you don't even need the decorator since you can let ray fully control the checkpoint storage/loading. Example :
from metaflow.metaflow_config import DATASTORE_SYSROOT_S3
from metaflow import current
import os
path_to_checkpoints = os.path.join(
DATASTORE_SYSROOT_S3,
"mf.ray_checkpoints",
current.flow_name,
current.run_id,
current.step_name,
current.parallel.control_task_id,
)
from ray.train import RunConfig
RunConfig(
storage_path = path_to_checkpoints
# ...
)
This way every execution will store checkpoints in a different location. You can set the value of path_to_checkpoints
to self
to make it a data artifact.hallowed-soccer-94479
04/28/2025, 9:49 PM