File tree Expand file tree Collapse file tree 1 file changed +14
-0
lines changed
Expand file tree Collapse file tree 1 file changed +14
-0
lines changed Original file line number Diff line number Diff line change @@ -971,6 +971,20 @@ def upload_any_remaining_chunks():
971971
972972 return uploaded_count
973973
974+ # Upload process
975+ # 1. Write chunks to a local directory that is temporary
976+ # 2. When a processing step is done (or after every N steps), check the output
977+ # files in the temp directory to see which ones are fully ready to be uploaded
978+ # 3. Move these files to a new folder that is for completed files
979+ # 4. Upload the completed files to GCS using gcloud command line tool
980+ # if no errors, then write the names of all the files in that folder to a text file
981+ # ensuring to always append to the text file
982+ # 5. Delete the files in the completed folder to save space
983+ # Separately now during the chunk writing step, we should check if any of the
984+ # files that were created are listed in the uploaded files text file
985+ # because if they are that was a mistake and we should throw an error about this
986+ # If there are errors during the upload step, we need to crash the process
987+ # and check what the error was to know how to proceed from there
974988
975989# %% Move the data across with a single worker
976990total_uploaded_files = 0
You can’t perform that action at this time.
0 commit comments