Skip to content

Commit 83a7367

Browse files
authored
Merge pull request #234929 from ankitscribbles/UpdateWorldCities
Fixed formatting and added keywords for City
2 parents 637d9bb + c54578e commit 83a7367

File tree

1 file changed

+22
-6
lines changed

1 file changed

+22
-6
lines changed

articles/purview/supported-classifications.md

Lines changed: 22 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ ms.author: ankitgup
66
ms.service: purview
77
ms.subservice: purview-data-map
88
ms.topic: reference
9-
ms.date: 03/07/2023
9+
ms.date: 04/10/2023
1010
#Customer intent: As a data steward or catalog administrator, I need to understand what's supported under classifications.
1111
---
1212

@@ -24,9 +24,21 @@ Microsoft Purview classifies data by using [RegEx](https://wikipedia.org/wiki/Re
2424
2525
## Bloom Filter based classifications
2626

27-
### City, Country, and Place
27+
### World Cities, Country
2828

29-
The City, Country, and Place filters have been prepared using best datasets available for preparing the data.
29+
The City and Country classifier identifies the data based on their full names as well as short codes.
30+
31+
#### Keywords
32+
- burg
33+
- city
34+
- cities
35+
- city names
36+
- cosmopolis
37+
- metropolis
38+
- municipality
39+
- place
40+
- town
41+
-------------------------------------
3042

3143
## Machine Learning based classifications
3244

@@ -35,10 +47,9 @@ The City, Country, and Place filters have been prepared using best datasets avai
3547
3648
### Person's Name
3749

38-
Person Name machine learning model has been trained using global datasets of names in English language.
50+
Person Name machine learning model has been trained using global datasets of names in English language. Microsoft Purview classifies full names stored in the same column as well as first and last names in separate columns.
3951

40-
> [!NOTE]
41-
> Microsoft Purview classifies full names stored in the same column as well as first/last names in separate columns.
52+
-------------------------------------
4253

4354
### Person's Address
4455
Person's address classification is used to detect full address stored in a single column containing the following elements: House number, Street Name, City, State, Country, Zip Code. Person's Address classifier uses machine learning model that is trained on the global addresses data set in English language.
@@ -52,6 +63,8 @@ Currently the address model supports the following formats in the same column:
5263
- street, city, pincode or zipcode
5364
- landmark, city
5465

66+
-------------------------------------
67+
5568
### Person's Gender
5669
Person's Gender machine learning model has been trained using US Census data and other public data sources in English language. It supports classifying 50+ genders out of the box.
5770

@@ -60,6 +73,7 @@ Person's Gender machine learning model has been trained using US Census data and
6073
- gender
6174
- orientation
6275

76+
-------------------------------------
6377

6478
### Person's Age
6579
Person's Age machine learning model detects age of an individual specified in various different formats. The qualifiers for days, months, and years must be in English language.
@@ -110,6 +124,8 @@ Person's Age machine learning model detects age of an individual specified in va
110124
- {%y}.{%m}
111125
- {%y}.{%yd}
112126

127+
-------------------------------------
128+
113129
## RegEx Classifications
114130

115131
### ABA routing number

0 commit comments

Comments
 (0)