Fix various [friction points](https://docs.google.com/document/d/1nQeyjtNKig-TmbL2mImYfGaOIJQyFbNoISPdxalQm44/edit?usp=sharing&resourcekey=0-LleDJBkn_pWvC4pQaDz9Yg) in the TF beginner quickstart.

tensorflower-gardener · copybara-github · commit c122ccf47ee4 · 2023-01-04T15:09:55.000-08:00
PiperOrigin-RevId: 499589900
diff --git a/site/en/tutorials/quickstart/beginner.ipynb b/site/en/tutorials/quickstart/beginner.ipynb
@@ -125,7 +125,7 @@
         "\n",
         "## Load a dataset\n",
         "\n",
-        "Load and prepare the [MNIST dataset](http://yann.lecun.com/exdb/mnist/). Convert the sample data from integers to floating-point numbers:"
+        "Load and prepare the [MNIST dataset](http://yann.lecun.com/exdb/mnist/). The pixel values of the images range from 0 through 255. Scale these values to a range of 0 to 1 by dividing the values by `255.0`. This also converts the sample data from integers to floating-point numbers:"
       ]
     },
     {
@@ -150,7 +150,7 @@
       "source": [
         "## Build a machine learning model\n",
         "\n",
-        "Build a `tf.keras.Sequential` model by stacking layers."
+        "Build a `tf.keras.Sequential` model:"
       ]
     },
     {
@@ -175,6 +175,8 @@
         "id": "l2hiez2eIUz8"
       },
       "source": [
+        "[`Sequential`](https://www.tensorflow.org/guide/keras/sequential_model) is useful for stacking layers where each layer has one input [tensor](https://www.tensorflow.org/guide/tensor) and one output tensor. Layers are functions with a known mathematical structure that can be reused and have trainable variables. Most TensorFlow models are composed of layers. This model uses the [`Flatten`](https://www.tensorflow.org/api_docs/python/tf/keras/layers/Flatten), [`Dense`](https://www.tensorflow.org/api_docs/python/tf/keras/layers/Dense), and [`Dropout`](https://www.tensorflow.org/api_docs/python/tf/keras/layers/Dropout) layers.\n",
+        "\n",
         "For each example, the model returns a vector of [logits](https://developers.google.com/machine-learning/glossary#logits) or [log-odds](https://developers.google.com/machine-learning/glossary#log-odds) scores, one for each class."
       ]
     },
@@ -225,7 +227,7 @@
         "id": "hQyugpgRIyrA"
       },
       "source": [
-        "Define a loss function for training using `losses.SparseCategoricalCrossentropy`, which takes a vector of logits and a `True` index and returns a scalar loss for each example."
+        "Define a loss function for training using `losses.SparseCategoricalCrossentropy`:"
       ]
     },
     {
@@ -245,7 +247,7 @@
         "id": "SfR4MsSDU880"
       },
       "source": [
-        "This loss is equal to the negative log probability of the true class: The loss is zero if the model is sure of the correct class.\n",
+        "The loss function takes a vector of ground truth values and a vector of logits and returns a scalar loss for each example. This loss is equal to the negative log probability of the true class: The loss is zero if the model is sure of the correct class.\n",
         "\n",
         "This untrained model gives probabilities close to random (1/10 for each class), so the initial loss should be close to `-tf.math.log(1/10) ~= 2.3`."
       ]

Original file line number	Diff line number	Diff line change
`@@ -125,7 +125,7 @@`
`125`	`125`	`"\n",`
`126`	`126`	`"## Load a dataset\n",`
`127`	`127`	`"\n",`
`128`		`- "Load and prepare the [MNIST dataset](http://yann.lecun.com/exdb/mnist/). Convert the sample data from integers to floating-point numbers:"`
	`128`	+ "Load and prepare the [MNIST dataset](http://yann.lecun.com/exdb/mnist/). The pixel values of the images range from 0 through 255. Scale these values to a range of 0 to 1 by dividing the values by `255.0`. This also converts the sample data from integers to floating-point numbers:"
`129`	`129`	`]`
`130`	`130`	`},`
`131`	`131`	`{`
`@@ -150,7 +150,7 @@`
`150`	`150`	`"source": [`
`151`	`151`	`"## Build a machine learning model\n",`
`152`	`152`	`"\n",`
`153`		- "Build a `tf.keras.Sequential` model by stacking layers."
	`153`	+ "Build a `tf.keras.Sequential` model:"
`154`	`154`	`]`
`155`	`155`	`},`
`156`	`156`	`{`
`@@ -175,6 +175,8 @@`
`175`	`175`	`"id": "l2hiez2eIUz8"`
`176`	`176`	`},`
`177`	`177`	`"source": [`
	`178`	+ "[`Sequential`](https://www.tensorflow.org/guide/keras/sequential_model) is useful for stacking layers where each layer has one input [tensor](https://www.tensorflow.org/guide/tensor) and one output tensor. Layers are functions with a known mathematical structure that can be reused and have trainable variables. Most TensorFlow models are composed of layers. This model uses the [`Flatten`](https://www.tensorflow.org/api_docs/python/tf/keras/layers/Flatten), [`Dense`](https://www.tensorflow.org/api_docs/python/tf/keras/layers/Dense), and [`Dropout`](https://www.tensorflow.org/api_docs/python/tf/keras/layers/Dropout) layers.\n",
	`179`	`+ "\n",`
`178`	`180`	`"For each example, the model returns a vector of [logits](https://developers.google.com/machine-learning/glossary#logits) or [log-odds](https://developers.google.com/machine-learning/glossary#log-odds) scores, one for each class."`
`179`	`181`	`]`
`180`	`182`	`},`
`@@ -225,7 +227,7 @@`
`225`	`227`	`"id": "hQyugpgRIyrA"`
`226`	`228`	`},`
`227`	`229`	`"source": [`
`228`		- "Define a loss function for training using `losses.SparseCategoricalCrossentropy`, which takes a vector of logits and a `True` index and returns a scalar loss for each example."
	`230`	+ "Define a loss function for training using `losses.SparseCategoricalCrossentropy`:"
`229`	`231`	`]`
`230`	`232`	`},`
`231`	`233`	`{`
`@@ -245,7 +247,7 @@`
`245`	`247`	`"id": "SfR4MsSDU880"`
`246`	`248`	`},`
`247`	`249`	`"source": [`
`248`		`- "This loss is equal to the negative log probability of the true class: The loss is zero if the model is sure of the correct class.\n",`
	`250`	`+ "The loss function takes a vector of ground truth values and a vector of logits and returns a scalar loss for each example. This loss is equal to the negative log probability of the true class: The loss is zero if the model is sure of the correct class.\n",`
`249`	`251`	`"\n",`
`250`	`252`	"This untrained model gives probabilities close to random (1/10 for each class), so the initial loss should be close to `-tf.math.log(1/10) ~= 2.3`."
`251`	`253`	`]`