Hello, we have seen a weird but interesting thing:...
# ask-metaflow
s
Hello, we have seen a weird but interesting thing: we have a folder with a lot of files (e.g.
hundred
of files) on
s3
. The size of the folder is the like of
hundred
of
GBs
. • We download the data with https://docs.metaflow.org/api/S3#S3.get_all • We modify something from the data • We upload the data again with https://docs.metaflow.org/api/S3#S3.put_files to a new s3 path. We noticed on a machine (with only 2 CPUs), downloading the data takes
30
mins but
upload
the data again takes only
5
mins!!!! How is that even possible? If any one nows why pls let me know here. Many thanks!
1