|
120 | 120 | "id": "ny5TEgcmHjVx"
|
121 | 121 | },
|
122 | 122 | "source": [
|
123 |
| - "For any small CSV dataset the simplest way to train a TensorFlow model on it is to load it into memory as a pandas Dataframe or a NumPy array.\n" |
| 123 | + "For any small CSV dataset the simplest way to train a TensorFlow model on it is to load it into memory as a [pandas `DataFrame`](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.html) or a NumPy array.\n" |
124 | 124 | ]
|
125 | 125 | },
|
126 | 126 | {
|
|
132 | 132 | "A relatively simple example is the [abalone dataset](https://archive.ics.uci.edu/ml/datasets/abalone).\n",
|
133 | 133 | "\n",
|
134 | 134 | "* The dataset is small.\n",
|
135 |
| - "* All the input features are all limited-range floating point values.\n", |
| 135 | + "* All the input features are limited-range floating point values.\n", |
136 | 136 | "\n",
|
137 |
| - "Here is how to download the data into a [pandas `DataFrame`](https://pandas.pydata.org/pandas-docs/stable/reference/api/pandas.DataFrame.html):" |
| 137 | + "Here is how to download the data into a `DataFrame`:" |
138 | 138 | ]
|
139 | 139 | },
|
140 | 140 | {
|
|
355 | 355 | "source": [
|
356 | 356 | "## Mixed data types\n",
|
357 | 357 | "\n",
|
| 358 | + "In the previous sections, you worked with a dataset where all the features were limited-range floating point values. But not all datasets are limited to a single data type.\n", |
| 359 | + "\n", |
358 | 360 | "The \"Titanic\" dataset contains information about the passengers on the Titanic. The nominal task on this dataset is to predict who survived.\n",
|
359 | 361 | "\n",
|
360 | 362 | "\n",
|
|
903 | 905 | "source": [
|
904 | 906 | "### From a single file\n",
|
905 | 907 | "\n",
|
906 |
| - "So far this tutorial has worked with in-memory data. `tf.data` is a highly scalable toolkit for building data pipelines, and provides a few functions for dealing loading CSV files. " |
| 908 | + "So far this tutorial has worked with in-memory data. `tf.data` is a highly scalable toolkit for building data pipelines, and provides a few functions for loading CSV files. " |
907 | 909 | ]
|
908 | 910 | },
|
909 | 911 | {
|
|
1373 | 1375 | "id": "3jiGZeUijJNd"
|
1374 | 1376 | },
|
1375 | 1377 | "source": [
|
1376 |
| - "So far this tutorial has focused on the highest-level utilities for reading csv data. There are other two APIs that may be helpful for advanced users if your use-case doesn't fit the basic patterns.\n", |
| 1378 | + "So far this tutorial has focused on the highest-level utilities for reading csv data. There are two other APIs that may be helpful for advanced users if your use-case doesn't fit the basic patterns.\n", |
1377 | 1379 | "\n",
|
1378 | 1380 | "* `tf.io.decode_csv`: a function for parsing lines of text into a list of CSV column tensors.\n",
|
1379 | 1381 | "* `tf.data.experimental.CsvDataset`: a lower-level CSV dataset constructor.\n",
|
|
0 commit comments