Add file_id by default to each vector

file_id is a unique identifier for each file processed by a pipeline. 

file_id = pipeline_id + cloudFile_id

Necessary to be able to leverage delete, update and augment capabilities.