destruc'd vjp output for clarity, grad argnums passed explicitly

jacanchaplais · jacanchaplais · commit 279c6508f1af · 2025-04-24T12:16:39.000+01:00
diff --git a/examples/simple/demo.ipynb b/examples/simple/demo.ipynb
@@ -64,9 +64,9 @@
    "source": [
     "## Run the Tesseract\n",
     "\n",
-    "The main entrypoint to `tesseract_jax` is the function `apply_tesseract`.\n",
-    "Using the `vectoradd_jax` Tesseract image we built earlier, let's add two vectors together.\n",
+    "The main entrypoint to `tesseract_jax` is `apply_tesseract()`.\n",
     "\n",
+    "Using the `vectoradd_jax` Tesseract image we built earlier, let's add two vectors together.\n",
     "The result should be:\n",
     "\n",
     "$$\\begin{pmatrix} 1 \\\\ 2 \\\\ 3 \\end{pmatrix} + 2 \\cdot \\begin{pmatrix} 4 \\\\ 5 \\\\ 6 \\end{pmatrix} = \\begin{pmatrix} 9 \\\\ 12 \\\\ 15 \\end{pmatrix}$$"
@@ -83,7 +83,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 1,
+   "execution_count": 2,
    "metadata": {},
    "outputs": [],
    "source": [
@@ -102,7 +102,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 2,
+   "execution_count": 3,
    "metadata": {},
    "outputs": [
     {
@@ -118,7 +118,7 @@
        " 'abstract_eval']"
       ]
      },
-     "execution_count": 2,
+     "execution_count": 4,
      "metadata": {},
      "output_type": "execute_result"
     }
@@ -147,7 +147,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 3,
+   "execution_count": 5,
    "metadata": {},
    "outputs": [
     {
@@ -201,7 +201,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 4,
+   "execution_count": 6,
    "metadata": {},
    "outputs": [
     {
@@ -210,7 +210,7 @@
        "Array(16.135319, dtype=float32)"
       ]
      },
-     "execution_count": 4,
+     "execution_count": 7,
      "metadata": {},
      "output_type": "execute_result"
     }
@@ -241,7 +241,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": 8,
    "metadata": {},
    "outputs": [
     {
@@ -250,7 +250,7 @@
        "Array(16.135319, dtype=float32)"
       ]
      },
-     "execution_count": 5,
+     "execution_count": 9,
      "metadata": {},
      "output_type": "execute_result"
     }
@@ -280,29 +280,27 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 6,
+   "execution_count": 10,
    "metadata": {},
    "outputs": [
     {
-     "data": {
-      "text/plain": [
-       "(Array(16.135319, dtype=float32), Array(25.004124, dtype=float32))"
-      ]
-     },
-     "execution_count": 6,
-     "metadata": {},
-     "output_type": "execute_result"
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "primal=Array(16.135319, dtype=float32), jvp=Array(25.004124, dtype=float32)\n"
+     ]
     }
    ],
    "source": [
-    "jax.jvp(fancy_operation, (a, b), (a, b))"
+    "primal, jvp = jax.jvp(fancy_operation, (a, b), (a, b))\n",
+    "print(f\"{primal=}, {jvp=}\")"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "(where the first argument is the primal value, and the second is the Jacobian of fancy_operation calculated in $(a,b)$ multiplied with the vector $(a \\, a)$)."
+    "Where `jvp` is the Jacobian of `fancy_operation` calculated in $(a,b)$ multiplied with the vector $(a, a)$."
    ]
   },
   {
@@ -314,25 +312,22 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 7,
+   "execution_count": 11,
    "metadata": {},
    "outputs": [
     {
-     "data": {
-      "text/plain": [
-       "({'v': Array([-0.20733577,  0.56435245, -0.329298  ], dtype=float32)},\n",
-       " {'s': Array(80.709854, dtype=float32),\n",
-       "  'v': Array([-0.8293431, 50.663364 , -1.317192 ], dtype=float32)})"
-      ]
-     },
-     "execution_count": 7,
-     "metadata": {},
-     "output_type": "execute_result"
+     "name": "stdout",
+     "output_type": "stream",
+     "text": [
+      "({'v': Array([-0.20733577,  0.56435245, -0.329298  ], dtype=float32)},\n",
+      " {'s': Array(80.709854, dtype=float32),\n",
+      "  'v': Array([-0.8293431, 50.663364 , -1.317192 ], dtype=float32)})\n"
+     ]
     }
    ],
    "source": [
     "primal, vjp = jax.vjp(fancy_operation, a, b)\n",
-    "vjp(primal)"
+    "pprint(vjp(primal))"
    ]
   },
   {
@@ -348,12 +343,12 @@
    "source": [
     "#### Computing the gradient\n",
     "\n",
-    "Let's calculate the gradient of `fancy_operation` w.r.t. the `a` argument at the point $(a,b)$:"
+    "Let's calculate the gradient of `fancy_operation` w.r.t. the `a` argument at the point $(a,b)$. `a` is the first argument, so we pass `jax.grad()` a parameter `argnums=0`."
    ]
   },
   {
    "cell_type": "code",
-   "execution_count": 8,
+   "execution_count": 12,
    "metadata": {},
    "outputs": [
     {
@@ -362,13 +357,42 @@
        "{'v': Array([-0.01284981,  0.03497622, -0.02040852], dtype=float32)}"
       ]
      },
-     "execution_count": 8,
+     "execution_count": 13,
      "metadata": {},
      "output_type": "execute_result"
     }
    ],
    "source": [
-    "jax.grad(fancy_operation)(a, b)"
+    "jax.grad(fancy_operation, argnums=0)(a, b)"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Or similar to our VJP calculation, we could calculate the gradients for both parameters `a` and `b` simultaneously."
+   ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": 14,
+   "metadata": {},
+   "outputs": [
+    {
+     "data": {
+      "text/plain": [
+       "({'v': Array([-0.01284981,  0.03497622, -0.02040852], dtype=float32)},\n",
+       " {'s': Array(5.002062, dtype=float32),\n",
+       "  'v': Array([-0.05139923,  3.139905  , -0.08163408], dtype=float32)})"
+      ]
+     },
+     "execution_count": 15,
+     "metadata": {},
+     "output_type": "execute_result"
+    }
+   ],
+   "source": [
+    "jax.grad(fancy_operation, argnums=[0, 1])(a, b)"
    ]
   },
   {
@@ -382,7 +406,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 9,
+   "execution_count": 16,
    "metadata": {},
    "outputs": [
     {
@@ -391,7 +415,7 @@
        "{'v': Array([-0.01284981,  0.03497622, -0.02040852], dtype=float32)}"
       ]
      },
-     "execution_count": 9,
+     "execution_count": 17,
      "metadata": {},
      "output_type": "execute_result"
     }
@@ -409,14 +433,36 @@
     "jax.jit(jax.grad(jitted_op))(a, b)"
    ]
   },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "## Teardown and conclusions"
+   ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "Since we kept the Tesseract alive using `.serve()`, now we need to stop it using `.teardown()`"
+   ]
+  },
   {
    "cell_type": "code",
-   "execution_count": 10,
+   "execution_count": 18,
    "metadata": {},
    "outputs": [],
    "source": [
     "vectoradd.teardown()"
    ]
+  },
+  {
+   "cell_type": "markdown",
+   "metadata": {},
+   "source": [
+    "And that's it!\n",
+    "You've worked through building up differentiable pipelines with Tesseracts that blend seamlessly with JAX's API, thanks to Tesseract-JAX."
+   ]
   }
  ],
  "metadata": {