Skip to content

[Feature]: Implement AI-based automatic identification of sensitive columns in databases #4489

@fenyf

Description

@fenyf

Is your feature request related to a problem?

Yes. Currently, ODC relies on manually configured rules to identify sensitive columns, which is time-consuming and prone to errors. There is a strong need for a more intelligent solution to improve both accuracy and efficiency.

Describe the solution you'd like

Introduce an AI-based sensitive column identification feature in ODC by integrating pre-trained language models. The system can understand the semantics of column names and predict the presence and risk level of sensitive information. It will support flexible rule configuration, allowing users to define rule-based identification while specifying the priority between rule matching and AI inference.

Additional context

No response

Metadata

Metadata

Assignees

Labels

type-featureThe functionality to be implemented

Type

No type

Projects

Status

New

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions