Skip to content

Commit 326454d

Browse files
committed
docs: write out steps of the upload
1 parent 27d23bf commit 326454d

File tree

1 file changed

+14
-0
lines changed

1 file changed

+14
-0
lines changed

examples/create_downampled.py

Lines changed: 14 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -971,6 +971,20 @@ def upload_any_remaining_chunks():
971971

972972
return uploaded_count
973973

974+
# Upload process
975+
# 1. Write chunks to a local directory that is temporary
976+
# 2. When a processing step is done (or after every N steps), check the output
977+
# files in the temp directory to see which ones are fully ready to be uploaded
978+
# 3. Move these files to a new folder that is for completed files
979+
# 4. Upload the completed files to GCS using gcloud command line tool
980+
# if no errors, then write the names of all the files in that folder to a text file
981+
# ensuring to always append to the text file
982+
# 5. Delete the files in the completed folder to save space
983+
# Separately now during the chunk writing step, we should check if any of the
984+
# files that were created are listed in the uploaded files text file
985+
# because if they are that was a mistake and we should throw an error about this
986+
# If there are errors during the upload step, we need to crash the process
987+
# and check what the error was to know how to proceed from there
974988

975989
# %% Move the data across with a single worker
976990
total_uploaded_files = 0

0 commit comments

Comments
 (0)