fixing links and finalizing text

avaamini · avaamini · commit 2aa6e10eb794 · 2024-01-08T01:10:42.000-05:00
diff --git a/lab1/solutions/Part1_TensorFlow_Solution.ipynb b/lab1/solutions/Part1_TensorFlow_Solution.ipynb
@@ -10,9 +10,9 @@
         "  <td align=\"center\"><a target=\"_blank\" href=\"http://introtodeeplearning.com\">\n",
         "        <img src=\"https://i.ibb.co/Jr88sn2/mit.png\" style=\"padding-bottom:5px;\" />\n",
         "      Visit MIT Deep Learning</a></td>\n",
-        "  <td align=\"center\"><a target=\"_blank\" href=\"https://colab.research.google.com/github/aamini/introtodeeplearning/blob/with_comet/lab1/solutions/Part1_TensorFlow_Solution.ipynb\">\n",
+        "  <td align=\"center\"><a target=\"_blank\" href=\"https://colab.research.google.com/github/aamini/introtodeeplearning/blob/2024/lab1/solutions/Part1_TensorFlow_Solution.ipynb\">\n",
         "        <img src=\"https://i.ibb.co/2P3SLwK/colab.png\"  style=\"padding-bottom:5px;\" />Run in Google Colab</a></td>\n",
-        "  <td align=\"center\"><a target=\"_blank\" href=\"https://github.com/aamini/introtodeeplearning/blob/2023/lab1/solutions/Part1_TensorFlow_Solution.ipynb\">\n",
+        "  <td align=\"center\"><a target=\"_blank\" href=\"https://github.com/aamini/introtodeeplearning/blob/2024/lab1/solutions/Part1_TensorFlow_Solution.ipynb\">\n",
         "        <img src=\"https://i.ibb.co/xfJbPmL/github.png\"  height=\"70px\" style=\"padding-bottom:5px;\"  />View Source on GitHub</a></td>\n",
         "</table>\n",
         "\n",
@@ -27,8 +27,8 @@
       },
       "outputs": [],
       "source": [
-        "# Copyright 2023 MIT Introduction to Deep Learning. All Rights Reserved.\n",
-        "# \n",
+        "# Copyright 2024 MIT Introduction to Deep Learning. All Rights Reserved.\n",
+        "#\n",
         "# Licensed under the MIT License. You may not use this file except in compliance\n",
         "# with the License. Use and/or modification of this code outside of MIT Introduction\n",
         "# to Deep Learning must reference:\n",
@@ -53,7 +53,7 @@
         "\n",
         "## 0.1 Install TensorFlow\n",
         "\n",
-        "TensorFlow is a software library extensively used in machine learning. Here we'll learn how computations are represented and how to define a simple neural network in TensorFlow. For all the labs in Introduction to Deep Learning 2023, we'll be using the latest version of TensorFlow, TensorFlow 2, which affords great flexibility and the ability to imperatively execute operations, just like in Python. You'll notice that TensorFlow 2 is quite similar to Python in its syntax and imperative execution. Let's install TensorFlow and a couple of dependencies.\n"
+        "TensorFlow is a software library extensively used in machine learning. Here we'll learn how computations are represented and how to define a simple neural network in TensorFlow. For all the labs in Introduction to Deep Learning 2024, we'll be using the latest version of TensorFlow, TensorFlow 2, which affords great flexibility and the ability to imperatively execute operations, just like in Python. You'll notice that TensorFlow 2 is quite similar to Python in its syntax and imperative execution. Let's install TensorFlow and a couple of dependencies.\n"
       ]
     },
     {
@@ -77,25 +77,29 @@
     },
     {
       "cell_type": "markdown",
-      "metadata": {},
+      "metadata": {
+        "id": "nrWxnP8Cn0En"
+      },
       "source": [
-        "## 0.2 Set Up Comet\n",
+        "## 0.2 Set Up Comet ML\n",
         "\n",
-        "When training models, it can be useful to visualize information about the model with plots. We can do this manually, but here, we'll show you how to do this using a tool called Comet, which generates loss and GPU usage curves for you.\n",
+        "When training models, it can be useful to visualize information about the model with plots. We can do this manually, but here, we'll show you how to do this using a tool called [Comet ML](https://www.comet.com/docs/v2/), which generates loss and GPU usage curves for you. As you'll see, Comet also enables easy saving of your experiments to a central interface.\n",
         "\n",
         "First, sign up for a Comet account [at this link](https://www.comet.com/signup?utm_source=mit_dl&utm_medium=partner&utm_content=github\n",
-        ") (you can use your Google or Github account). Running this cell will prompt you to enter your API Key (which you can find by pressing the '?' in the top right corner and then 'Quickstart Guide' - it is on the right hand side of the page)."
+        ") (you can use your Google or Github account). Running this cell will prompt you to enter your API Key, which you can find either in the first 'Get Started with Comet' page or by pressing the '?' in the top right corner and then 'Quickstart Guide' -- it is on the right hand side of the page."
       ]
     },
     {
       "cell_type": "code",
       "execution_count": null,
-      "metadata": {},
+      "metadata": {
+        "id": "qIYr99VUn0En"
+      },
       "outputs": [],
       "source": [
         "%pip install comet_ml\n",
         "import comet_ml\n",
-        "comet_ml.init(project_name=\"6.s191lab1_part0\")\n",
+        "comet_ml.init(project_name=\"6.s191_lab1_part1\")\n",
         "comet_experiment = comet_ml.Experiment()"
       ]
     },
@@ -189,7 +193,7 @@
       "outputs": [],
       "source": [
         "'''TODO: Define a 4-d Tensor.'''\n",
-        "# Use tf.zeros to initialize a 4-d Tensor of zeros with size 10 x 256 x 256 x 3. \n",
+        "# Use tf.zeros to initialize a 4-d Tensor of zeros with size 10 x 256 x 256 x 3.\n",
         "#   You can think of this as 10 images where each image is RGB 256 x 256.\n",
         "images = tf.zeros([10, 256, 256, 3]) # TODO\n",
         "# images = # TODO\n",
@@ -339,7 +343,7 @@
         "## 1.3 Neural networks in TensorFlow\n",
         "We can also define neural networks in TensorFlow. TensorFlow uses a high-level API called [Keras](https://www.tensorflow.org/guide/keras) that provides a powerful, intuitive framework for building and training deep learning models.\n",
         "\n",
-        "Let's first consider the example of a simple perceptron defined by just one dense layer: $ y = \\sigma(Wx + b)$, where $W$ represents a matrix of weights, $b$ is a bias, $x$ is the input, $\\sigma$ is the sigmoid activation function, and $y$ is the output. We can also visualize this operation using a graph: \n",
+        "Let's first consider the example of a simple perceptron defined by just one dense layer: $ y = \\sigma(Wx + b)$, where $W$ represents a matrix of weights, $b$ is a bias, $x$ is the input, $\\sigma$ is the sigmoid activation function, and $y$ is the output. We can also visualize this operation using a graph:\n",
         "\n",
         "![alt text](https://raw.githubusercontent.com/aamini/introtodeeplearning/master/lab1/img/computation-graph-2.png)\n",
         "\n",
@@ -400,7 +404,7 @@
         "id": "Jt1FgM7qYZ3D"
       },
       "source": [
-        "Conveniently, TensorFlow has defined a number of ```Layers``` that are commonly used in neural networks, for example a [```Dense```](https://www.tensorflow.org/api_docs/python/tf/keras/layers/Dense?version=stable). Now, instead of using a single ```Layer``` to define our simple neural network, we'll use the  [`Sequential`](https://www.tensorflow.org/versions/r2.0/api_docs/python/tf/keras/Sequential) model from Keras and a single [`Dense` ](https://www.tensorflow.org/versions/r2.0/api_docs/python/tf/keras/layers/Dense) layer to define our network. With the `Sequential` API, you can readily create neural networks by stacking together layers like building blocks. "
+        "Conveniently, TensorFlow has defined a number of ```Layers``` that are commonly used in neural networks, for example a [```Dense```](https://www.tensorflow.org/api_docs/python/tf/keras/layers/Dense?version=stable). Now, instead of using a single ```Layer``` to define our simple neural network, we'll use the  [`Sequential`](https://www.tensorflow.org/versions/r2.0/api_docs/python/tf/keras/Sequential) model from Keras and a single [`Dense` ](https://www.tensorflow.org/versions/r2.0/api_docs/python/tf/keras/layers/Dense) layer to define our network. With the `Sequential` API, you can readily create neural networks by stacking together layers like building blocks."
       ]
     },
     {
@@ -420,12 +424,12 @@
         "# Define the number of outputs\n",
         "n_output_nodes = 3\n",
         "\n",
-        "# First define the model \n",
+        "# First define the model\n",
         "model = Sequential()\n",
         "\n",
         "'''TODO: Define a dense (fully connected) layer to compute z'''\n",
         "# Remember: dense layers are defined by the parameters W and b!\n",
-        "# You can read more about the initialization of W and b in the TF documentation :) \n",
+        "# You can read more about the initialization of W and b in the TF documentation :)\n",
         "# https://www.tensorflow.org/api_docs/python/tf/keras/layers/Dense?version=stable\n",
         "dense_layer = Dense(n_output_nodes, activation='sigmoid') # TODO\n",
         "# dense_layer = # TODO\n",
@@ -487,7 +491,7 @@
         "  # In __init__, we define the Model's layers\n",
         "  def __init__(self, n_output_nodes):\n",
         "    super(SubclassModel, self).__init__()\n",
-        "    '''TODO: Our model consists of a single Dense layer. Define this layer.''' \n",
+        "    '''TODO: Our model consists of a single Dense layer. Define this layer.'''\n",
         "    self.dense_layer = Dense(n_output_nodes, activation='sigmoid') # TODO\n",
         "    # self.dense_layer = '''TODO: Dense Layer'''\n",
         "\n",
@@ -558,7 +562,7 @@
         "    if isidentity: # TODO\n",
         "      return inputs # TODO\n",
         "    return x\n",
-        "  \n",
+        "\n",
         "  # def call(self, inputs, isidentity=False):\n",
         "    # TODO"
       ]
@@ -611,11 +615,11 @@
         "## 1.4 Automatic differentiation in TensorFlow\n",
         "\n",
         "[Automatic differentiation](https://en.wikipedia.org/wiki/Automatic_differentiation)\n",
-        "is one of the most important parts of TensorFlow and is the backbone of training with \n",
-        "[backpropagation](https://en.wikipedia.org/wiki/Backpropagation). We will use the TensorFlow GradientTape [`tf.GradientTape`](https://www.tensorflow.org/api_docs/python/tf/GradientTape?version=stable) to trace operations for computing gradients later. \n",
+        "is one of the most important parts of TensorFlow and is the backbone of training with\n",
+        "[backpropagation](https://en.wikipedia.org/wiki/Backpropagation). We will use the TensorFlow GradientTape [`tf.GradientTape`](https://www.tensorflow.org/api_docs/python/tf/GradientTape?version=stable) to trace operations for computing gradients later.\n",
         "\n",
         "When a forward pass is made through the network, all forward-pass operations get recorded to a \"tape\"; then, to compute the gradient, the tape is played backwards. By default, the tape is discarded after it is played backwards; this means that a particular `tf.GradientTape` can only\n",
-        "compute one gradient, and subsequent calls throw a runtime error. However, we can compute multiple gradients over the same computation by creating a ```persistent``` gradient tape. \n",
+        "compute one gradient, and subsequent calls throw a runtime error. However, we can compute multiple gradients over the same computation by creating a ```persistent``` gradient tape.\n",
         "\n",
         "First, we will look at how we can compute gradients using GradientTape and access them for computation. We define the simple function $ y = x^2$ and compute the gradient:"
       ]
@@ -678,7 +682,7 @@
         "# Define the target value\n",
         "x_f = 4\n",
         "\n",
-        "# We will run SGD for a number of iterations. At each iteration, we compute the loss, \n",
+        "# We will run SGD for a number of iterations. At each iteration, we compute the loss,\n",
         "#   compute the derivative of the loss with respect to x, and perform the SGD update.\n",
         "for i in range(500):\n",
         "  with tf.GradientTape() as tape:\n",
@@ -687,7 +691,7 @@
         "    # loss = # TODO\n",
         "\n",
         "    # Here's where we're going to use Comet! We'll log our loss values into the experiment like so:\n",
-        "    comet_experiment.log_metric(\"loss\", loss, step=i) \n",
+        "    comet_experiment.log_metric(\"loss\", loss, step=i)\n",
         "\n",
         "  # loss minimization using gradient tape\n",
         "  grad = tape.gradient(loss, x) # compute the derivative of the loss with respect to x\n",
@@ -710,7 +714,9 @@
         "id": "pC7czCwk3ceH"
       },
       "source": [
-        "`GradientTape` provides an extremely flexible framework for automatic differentiation. In order to back propagate errors through a neural network, we track forward passes on the Tape, use this information to determine the gradients, and then use these gradients for optimization using SGD."
+        "`GradientTape` provides an extremely flexible framework for automatic differentiation. In order to back propagate errors through a neural network, we track forward passes on the Tape, use this information to determine the gradients, and then use these gradients for optimization using SGD.\n",
+        "\n",
+        "In the above code block, you also saw how you can directly log metrics, like loss values, to Comet. These are then retained within your profile's web interface, where you are able to visualize the results of your different training runs!"
       ]
     }
   ],
@@ -740,4 +746,4 @@
   },
   "nbformat": 4,
   "nbformat_minor": 0
-}
+}