stale-scientist-29745
07/28/2023, 10:16 PMhundred
of files) on s3
. The size of the folder is the like of hundred
of GBs
.
• We download the data with https://docs.metaflow.org/api/S3#S3.get_all
• We modify something from the data
• We upload the data again with https://docs.metaflow.org/api/S3#S3.put_files to a new s3 path.
We noticed on a machine (with only 2 CPUs), downloading the data takes 30
mins but upload
the data again takes only 5
mins!!!!
How is that even possible? If any one nows why pls let me know here. Many thanks!