Skip to content

Add load parameter to capture skipped rows metadata #191

@cschloer

Description

@cschloer

Hey,

I understand the point of this feature being outside of the scope of tabulator (frictionlessdata/tabulator-py#331). I think it would be an important feature to implement in load. As proposed above:

It takes in a list of dicts, each dict containing a regular expression string with once captured group, and one string that contains a column name. The regular expression is then compared to each skipped row in the data. A new column is created with the column_name as its name and the value in the capture group as its value.

If you don't think this would be useful for the general DPP/dataflows community, let me know and I can implement it in our own custom load processor.

@roll @akariv

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions