continuing latex debug

avaamini · avaamini · commit efcbcc860c31 · 2020-12-30T22:19:06.000-05:00
diff --git a/lab2/Part2_Debiasing.ipynb b/lab2/Part2_Debiasing.ipynb
@@ -502,29 +502,6 @@
         "where $c$ is a weighting coefficient used for regularization. Now we're ready to define our VAE loss function:"
       ]
     },
-    {
-      "cell_type": "markdown",
-      "metadata": {
-        "id": "qWxOCPgvv1lf"
-      },
-      "source": [
-        "The equations for both of these losses are provided below.\r\n",
-        "\r\n",
-        "\r\n",
-        "\r\n",
-        "$$L_{KL}(\\mu, \\sigma) = \\frac{1}{2}\\sum\\limits_{j=0}^{k-1}\\small{(\\sigma_j + \\mu_j^2 - 1 - \\log{\\sigma_j})}$$\r\n",
-        "\r\n",
-        "$$L_{x}{(x,\\hat{x})} = ||x-\\hat{x}||_1$$\r\n",
-        "\r\n",
-        "Thus for the VAE loss we have: \r\n",
-        "\r\n",
-        "$$L_{VAE} = c\\cdot L_{KL} + L_{x}{(x,\\hat{x})}$$\r\n",
-        "\r\n",
-        "where $c$ is a weighting coefficient used for regularization. \r\n",
-        "\r\n",
-        "Now we're ready to define our VAE loss function:"
-      ]
-    },
     {
       "cell_type": "code",
       "metadata": {
@@ -581,9 +558,7 @@
         "\n",
         "As you may recall from lecture, VAEs use a \"reparameterization  trick\" for sampling learned latent variables. Instead of the VAE encoder generating a single vector of real numbers for each latent variable, it generates a vector of means and a vector of standard deviations that are constrained to roughly follow Gaussian distributions. We then sample from the standard deviations and add back the mean to output this as our sampled latent vector. Formalizing this for a latent variable $z$ where we sample $\\epsilon \\sim \\mathcal{N}(0,(I))$ we have: \n",
         "\n",
-        "\\begin{equation}\n",
-        "z = \\mathbb{\\mu} + e^{\\left(\\frac{1}{2} \\cdot \\log{\\Sigma}\\right)}\\circ \\epsilon\n",
-        "\\end{equation}\n",
+        "$$z = \\mu + e^{\\left(\\frac{1}{2} \\cdot \\log{\\Sigma}\\right)}\\circ \\epsilon$$\n",
         "\n",
         "where $\\mu$ is the mean and $\\Sigma$ is the covariance matrix. This is useful because it will let us neatly define the loss function for the VAE, generate randomly sampled latent variables, achieve improved network generalization, **and** make our complete VAE network differentiable so that it can be trained via backpropagation. Quite powerful!\n",
         "\n",
@@ -667,9 +642,7 @@
         "\n",
         "We can write a single expression for the loss by defining an indicator variable $\\mathcal{I}_f$which reflects which training data are images of faces ($\\mathcal{I}_f(y) = 1$ ) and which are images of non-faces ($\\mathcal{I}_f(y) = 0$). Using this, we obtain:\n",
         "\n",
-        "\\begin{equation}\n",
-        "L_{total} = L_y(y,\\hat{y}) + \\mathcal{I}_f(y)\\Big[L_{VAE}\\Big]\n",
-        "\\end{equation}\n",
+        "$$L_{total} = L_y(y,\\hat{y}) + \\mathcal{I}_f(y)\\Big[L_{VAE}\\Big]$$\n",
         "\n",
         "Let's write a function to define the DB-VAE loss function:\n",
         "\n"