stale-scientist-29745
07/28/2023, 10:16 PMhundred of files) on s3. The size of the folder is the like of hundred of GBs.
• We download the data with https://docs.metaflow.org/api/S3#S3.get_all
• We modify something from the data
• We upload the data again with https://docs.metaflow.org/api/S3#S3.put_files to a new s3 path.
We noticed on a machine (with only 2 CPUs), downloading the data takes 30 mins but upload the data again takes only 5 mins!!!!
How is that even possible? If any one nows why pls let me know here. Many thanks!