Is your feature request related to a problem?
Yes. Currently, ODC relies on manually configured rules to identify sensitive columns, which is time-consuming and prone to errors. There is a strong need for a more intelligent solution to improve both accuracy and efficiency.
Describe the solution you'd like
Introduce an AI-based sensitive column identification feature in ODC by integrating pre-trained language models. The system can understand the semantics of column names and predict the presence and risk level of sensitive information. It will support flexible rule configuration, allowing users to define rule-based identification while specifying the priority between rule matching and AI inference.
Additional context
No response