Emphasize do not install extra packages in sample code's comment

Bowen-Guo-Microsoft · Bowen-Guo-Microsoft · commit 1af4b8c0acb2 · 2020-04-20T10:58:44.000+08:00
diff --git a/articles/machine-learning/algorithm-module-reference/create-python-model.md b/articles/machine-learning/algorithm-module-reference/create-python-model.md
@@ -28,11 +28,10 @@ After you create the model, you can use [Train Model](train-model.md) to train t
 Use of this module requires intermediate or expert knowledge of Python. The module supports use of any learner that's included in the Python packages already installed in Azure Machine Learning. See the preinstalled Python package list in [Execute Python Script](execute-python-script.md).
 
 > [!NOTE]
-> Please be very careful when writing your script and makes sure there is no syntax error, such as using a un-declared object or a un-imported module. Also pay extra attentions to the pre-installed modules list in [Execute Python Script](execute-python-script.md). To import modules which are not listed, please install the corresponding packages in your script such as
-> ```Python
-> import os
-> os.system(f"pip install scikit-misc")
-> ```
+> Please be very careful when writing your script and makes sure there is no syntax error, such as using a un-declared object or a un-imported module.
+
+> [!NOTE]
+Also pay extra attentions to the pre-installed modules list in [Execute Python Script](execute-python-script.md). Only import pre-installed modules. Please do not install extra packages such as "pip install xgboost" in this script, otherwise errors will be raised when reading models in down-stream modules.
   
 This article shows how to use **Create Python Model** with a simple pipeline. Here's a diagram of the pipeline:
 
@@ -54,7 +53,9 @@ This article shows how to use **Create Python Model** with a simple pipeline. He
        # predict: which generates prediction result, the input argument and the prediction result MUST be pandas DataFrame.
    # The signatures (method names and argument names) of all these methods MUST be exactly the same as the following example.
 
-
+   # Please do not install extra packages such as "pip install xgboost" in this script,
+   # otherwise errors will be raised when reading models in down-stream modules.
+   
    import pandas as pd
    from sklearn.naive_bayes import GaussianNB
 
@@ -65,10 +66,15 @@ This article shows how to use **Create Python Model** with a simple pipeline. He
            self.feature_column_names = list()
 
        def train(self, df_train, df_label):
+           # self.feature_column_names records the column names used for training.
+           # It is recommended to set this attribute before training so that the
+           # feature columns used in predict and train methods have the same names.
            self.feature_column_names = df_train.columns.tolist()
            self.model.fit(df_train, df_label)
 
        def predict(self, df):
+           # The feature columns used for prediction MUST have the same names as the ones for training.
+           # The name of score column ("Scored Labels" in this case) MUST be different from any other columns in input data.
            return pd.DataFrame(
                {'Scored Labels': self.model.predict(df[self.feature_column_names]), 
                 'probabilities': self.model.predict_proba(df[self.feature_column_names])[:, 1]}