file_id is a unique identifier for each file processed by a pipeline. file_id = pipeline_id + cloudFile_id Necessary to be able to leverage delete, update and augment capabilities.