Merge pull request #2017 from sushreebarsa:patch-16

copybara-github · copybara-github · commit bc6a7adf3c5e · 2022-02-09T11:34:34.000-08:00
PiperOrigin-RevId: 427527193
diff --git a/site/en/tutorials/load_data/pandas_dataframe.ipynb b/site/en/tutorials/load_data/pandas_dataframe.ipynb
@@ -205,7 +205,7 @@
         "id": "xNxJ41MafiB-"
       },
       "source": [
-        "If your data has a uniform datatype, or `dtype`, it's possible use a pandas DataFrame anywhere you could use a NumPy array. This works because the `pandas.DataFrame` class supports the `__array__` protocol, and TensorFlow's `tf.convert_to_tensor` function accepts objects that support the protocol.\n",
+        "If your data has a uniform datatype, or `dtype`, it's possible to use a pandas DataFrame anywhere you could use a NumPy array. This works because the `pandas.DataFrame` class supports the `__array__` protocol, and TensorFlow's `tf.convert_to_tensor` function accepts objects that support the protocol.\n",
         "\n",
         "Take the numeric features from the dataset (skip the categorical features for now):"
       ]
@@ -428,9 +428,9 @@
         "id": "NQcp7kiPF8TP"
       },
       "source": [
-        "When you start dealing with heterogenous data, it is no longer possible to treat the DataFrame as if it were a single array. TensorFlow tensors require that all elements have the same `dtype`.\n",
+        "When you start dealing with heterogeneous data, it is no longer possible to treat the DataFrame as if it were a single array. TensorFlow tensors require that all elements have the same `dtype`.\n",
         "\n",
-        "So, in this case, you need to start treating it as a dictionary of columns, where each column has a uniform dtype. A DataFrame is a lot like a dictionary of arrays, so typically all you need to do is cast the DataFrame to a Python dict. Many important TensorFlow APIs support (nested-)dictionaries of arrays as inputs."
+        "So, in this case, you need to start treating it as a dictionary of columns, where each column has a uniform `dtype`. A DataFrame is a lot like a dictionary of arrays, so typically all you need to do is cast the DataFrame to a Python dict. Many important TensorFlow APIs support (nested-)dictionaries of arrays as inputs."
       ]
     },
     {
@@ -491,7 +491,7 @@
       "source": [
         "Typically, Keras models and layers expect a single input tensor, but these classes can accept and return nested structures of dictionaries, tuples and tensors. These structures are known as \"nests\" (refer to the `tf.nest` module for details).\n",
         "\n",
-        "There are two equivalent ways you can write a keras model that accepts a dictionary as input."
+        "There are two equivalent ways you can write a Keras model that accepts a dictionary as input."
       ]
     },
     {
@@ -545,7 +545,7 @@
         "    ])\n",
         "\n",
         "  def adapt(self, inputs):\n",
-        "    # Stach the inputs and `adapt` the normalization layer.\n",
+        "    # Stack the inputs and `adapt` the normalization layer.\n",
         "    inputs = stack_dict(inputs)\n",
         "    self.normalizer.adapt(inputs)\n",
         "\n",
@@ -728,7 +728,7 @@
         "id": "zYQ5fDaRxRWQ"
       },
       "source": [
-        "It you're passing a heterogenous `DataFrame` to Keras, each column may need unique preprocessing. You could do this preprocessing directly in the DataFrame, but for a model to work correctly, inputs always need to be preprocessed the same way. So, the best approach is to build the preprocessing into the model. [Keras preprocessing layers](https://www.tensorflow.org/guide/keras/preprocessing_layers) cover many common tasks."
+        "If you're passing a heterogeneous DataFrame to Keras, each column may need unique preprocessing. You could do this preprocessing directly in the DataFrame, but for a model to work correctly, inputs always need to be preprocessed the same way. So, the best approach is to build the preprocessing into the model. [Keras preprocessing layers](https://www.tensorflow.org/guide/keras/preprocessing_layers) cover many common tasks."
       ]
     },
     {
@@ -746,9 +746,9 @@
         "id": "C6aVQN4Gw-Va"
       },
       "source": [
-        "In this dataset some of the \"integer\" features in the raw data are actually Categorical indices. These indices are not really ordered numeric values (refer to the <a href=\"https://archive.ics.uci.edu/ml/datasets/heart+Disease\" class=\"external\">the dataset description</a> for details). Because these are unordered they are inapropriate to feed directly to the model; the model would interpret them as being ordered. To use these inputs you'll need to encode them, either as one-hot vectors or embedding vectors. The same applies to string-categorical features.\n",
+        "In this dataset some of the \"integer\" features in the raw data are actually Categorical indices. These indices are not really ordered numeric values (refer to the <a href=\"https://archive.ics.uci.edu/ml/datasets/heart+Disease\" class=\"external\">the dataset description</a> for details). Because these are unordered they are inappropriate to feed directly to the model; the model would interpret them as being ordered. To use these inputs you'll need to encode them, either as one-hot vectors or embedding vectors. The same applies to string-categorical features.\n",
         "\n",
-        "Note: If you have many features that need identical preprocessing it's more efficient to concatenate them together befofre applying the preprocessing.\n",
+        "Note: If you have many features that need identical preprocessing it's more efficient to concatenate them together before applying the preprocessing.\n",
         "\n",
         "Binary features on the other hand do not generally need to be encoded or normalized.\n",
         "\n",
@@ -783,7 +783,7 @@
         "id": "HRcC8WkyamJb"
       },
       "source": [
-        "The next step is to build a preprocessing model that will apply apropriate preprocessing to each to each input and concatenate the results.\n",
+        "The next step is to build a preprocessing model that will apply appropriate preprocessing to each input and concatenate the results.\n",
         "\n",
         "This section uses the [Keras Functional API](https://www.tensorflow.org/guide/keras/functional) to implement  the preprocessing. You start by creating one `tf.keras.Input` for each column of the dataframe:"
       ]
@@ -926,7 +926,7 @@
         "id": "Z3wcFs1oKVao"
       },
       "source": [
-        "To use categorical features you'll first need to encode them into either binary vectors or embeddings. Since these features only contain a small number of categories, convert the inputs directly to one-hot vectors using the `output_mode='one_hot'` option, supported byy both the `tf.keras.layers.StringLookup` and `tf.keras.layers.IntegerLookup` layers.\n",
+        "To use categorical features you'll first need to encode them into either binary vectors or embeddings. Since these features only contain a small number of categories, convert the inputs directly to one-hot vectors using the `output_mode='one_hot'` option, supported by both the `tf.keras.layers.StringLookup` and `tf.keras.layers.IntegerLookup` layers.\n",
         "\n",
         "Here is an example of how these layers work:"
       ]