clean-monitor-54487
07/19/2024, 3:04 PMforeach
steps?victorious-lawyer-58417
07/19/2024, 5:34 PMvictorious-lawyer-58417
07/19/2024, 5:37 PMclean-monitor-54487
07/20/2024, 7:12 AM@batch
on the fly depending on cli args?
Alternatively, is it trivial to tell from current
whether the current method is running @batch
or not?clean-monitor-54487
07/20/2024, 7:30 AM--max-num-splits
being set to 100 by defaults lets me think that I should not increase to 1M samples. In that case, is mini-batching or sharding the dataset the recommended method?victorious-lawyer-58417
07/20/2024, 5:41 PM@batch
decorator on the fly too. There are a few days to do it:
• You can set --with batch
as a CLI option
• set METAFLOW_DECOSPECS=batch
environment variable
• or you could even make a simple custom decorator (see inspiration here) that attaches @batch
conditionally only to certain steps, depending on an environment variable / config file etc.victorious-lawyer-58417
07/20/2024, 5:42 PM@batch
, you can use this snippet:
is_running_on_batch = bool(os.environ.get('AWS_BATCH_JOB_ID'))
victorious-lawyer-58417
07/20/2024, 5:45 PMclean-monitor-54487
07/22/2024, 8:41 AM