Skip to content

Conversation

@ssh-meister
Copy link
Collaborator

A common processor that removes specified fields from each data entry in the manifest.

fields_to_drop (List[str]): A list of keys to remove from each manifest entry.
**kwargs: Additional arguments passed to the BaseProcessor (e.g., input/output manifest paths).
Returns:
Copy link
Collaborator

@lilithgrigoryan lilithgrigoryan Jul 21, 2025

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Optionally please add example section to the docstring

    Example:
        .. code-block:: yaml

            - _target_: sdp.processors.modify_manifest.common.DuplicateFields
               input_manifest_file: ${workspace_dir}/test1.json
               output_manifest_file: ${workspace_dir}/test2.json
               duplicate_fields: {"text":"answer"}

@ssh-meister ssh-meister merged commit e60baa5 into main Jul 21, 2025
10 checks passed
@ssh-meister ssh-meister deleted the DropSpecifiedFields branch July 22, 2025 11:30
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants