Skip to content
This repository was archived by the owner on Jun 20, 2023. It is now read-only.

Commit cb4a194

Browse files
authored
Merge pull request #4 from Code-Plus-CUMI/CSFelix-patch-1
📦 Update
2 parents ccb611c + 174c95f commit cb4a194

File tree

1 file changed

+2
-14
lines changed

1 file changed

+2
-14
lines changed

4 - Python/4 - Data Science/2 - Features Engineering/0 - codes/4 - Principal Component Analysis - PCA.py

Lines changed: 2 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -15,20 +15,6 @@
1515
1616
{ image 4 }
1717
18-
Often, we can give names to these axes of variation. The longer
19-
axis we might call the "Size" component: small height and small
20-
diameter (lower left) contrasted with large height and large
21-
diameter (upper right). The shorter axis we might call the "Shape"
22-
component: small height and large diameter (flat shape)
23-
contrasted with large height and small diameter (round shape).
24-
25-
Notice that instead of describing abalones by their 'Height'
26-
and 'Diameter', we could just as well describe them by their
27-
'Size' and 'Shape'. This, in fact, is the whole idea of PCA:
28-
instead of describing the data with the original features, we
29-
describe it with its axes of variation. The axes of variation
30-
become the new features.
31-
3218
These new features are called the principal components of the
3319
data. The weights themselves are called loadings. There will be
3420
as many principal components as there are features in the
@@ -58,6 +44,8 @@
5844
/ when the features are multi-colinear (there's a significant
5945
number of linear correlations between them);
6046
/ when our goal is to apply denoising;
47+
/ when you want to check out whether clusters have similar
48+
properties and attributes
6149
"""
6250

6351
# 0 - Importing libraries, creating functions to plot PCA's

0 commit comments

Comments
 (0)