RaggedTensor guide: Add a section on ragged shapes.

edloper · copybara-github · commit 7d5ea2e986a4 · 2022-05-05T06:01:16.000-07:00
PiperOrigin-RevId: 446696647
diff --git a/site/en/guide/ragged_tensor.ipynb b/site/en/guide/ragged_tensor.ipynb
@@ -81,6 +81,7 @@
       },
       "outputs": [],
       "source": [
+        "!pip install --pre -U tensorflow\n",
         "import math\n",
         "import tensorflow as tf"
       ]
@@ -1459,13 +1460,267 @@
         "print(\"Indexed value:\", rt[1].numpy())"
       ]
     },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "id": "J87jMZa0M_YW"
+      },
+      "source": [
+        "## Ragged Shapes\n",
+        "\n",
+        "The shape of a tensor specifies the size of each axis.  For example, the shape of `[[1, 2], [3, 4], [5, 6]]` is `[3, 2]`, since there are 3 rows and 2 columns.  TensorFlow has two separate but related ways to describe shapes:\n",
+        "\n",
+        "* ***static shape***: Information about axis sizes that is known statically (e.g., while tracing a `tf.function`).  May be partially specified.\n",
+        "\n",
+        "* ***dynamic shape***: Runtime information about the axis sizes."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "id": "IOETE_OLPLZo"
+      },
+      "source": [
+        "### Static shape\n",
+        "\n",
+        "A Tensor's static shape contains information about its axis sizes that is known at graph-construction time.  For both `tf.Tensor` and `tf.RaggedTensor`, it is available using the `.shape` property, and is encoded using `tf.TensorShape`:"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "id": "btGDjT4uNgQy"
+      },
+      "outputs": [],
+      "source": [
+        "x = tf.constant([[1, 2], [3, 4], [5, 6]])\n",
+        "x.shape  # shape of a tf.tensor"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "id": "__OgvmrGPEjq"
+      },
+      "outputs": [],
+      "source": [
+        "rt = tf.ragged.constant([[1], [2, 3], [], [4]])\n",
+        "rt.shape  # shape of a tf.RaggedTensor"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "id": "9EWnQd3qPWaw"
+      },
+      "source": [
+        "The static shape of a ragged dimension is always `None` (i.e., unspecified).  However, the inverse is not true -- if a `TensorShape` dimension is `None`, then that could indicate that the dimension is ragged, *or* it could indicate that the dimension is uniform but that its size is not statically known."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "id": "75E9YXYMNfne"
+      },
+      "source": [
+        "### Dynamic shape\n",
+        "\n",
+        "A tensor's dynamic shape contains information about its axis sizes that is known when the graph is run.  It is constructed using the `tf.shape` operation.  For `tf.Tensor`, `tf.shape` returns the shape as a 1D integer `Tensor`, where `tf.shape(x)[i]` is the size of axis `i`."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "id": "kWJ7Cn1EQTD_"
+      },
+      "outputs": [],
+      "source": [
+        "x = tf.constant([['a', 'b'], ['c', 'd'], ['e', 'f']])\n",
+        "tf.shape(x)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "id": "BeZEfxwmRcSv"
+      },
+      "source": [
+        "However, a 1D `Tensor` is not expressive enough to describe the shape of a `tf.RaggedTensor`.  Instead, the dynamic shape for ragged tensors is encoded using a dedicated type, `tf.experimental.DynamicRaggedShape`.  In the following example, the `DynamicRaggedShape` returned by `tf.shape(rt)` indicates that the ragged tensor has 4 rows, with lengths 1, 3, 0, and 2:"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "id": "nZc2wqgQQUFU"
+      },
+      "outputs": [],
+      "source": [
+        "rt = tf.ragged.constant([[1], [2, 3, 4], [], [5, 6]])\n",
+        "rt_shape = tf.shape(rt)\n",
+        "print(rt_shape)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "id": "EphU60YvTf98"
+      },
+      "source": [
+        "#### Dynamic shape: operations\n",
+        "\n",
+        "`DynamicRaggedShape`s can be used with most TensorFlow ops that expect shapes, including `tf.reshape`, `tf.zeros`, `tf.ones`. `tf.fill`, `tf.broadcast_dynamic_shape`, and `tf.broadcast_to`."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "id": "pclAODLXT6Gr"
+      },
+      "outputs": [],
+      "source": [
+        "print(f\"tf.reshape(x, rt_shape) = {tf.reshape(x, rt_shape)}\")\n",
+        "print(f\"tf.zeros(rt_shape) = {tf.zeros(rt_shape)}\")\n",
+        "print(f\"tf.ones(rt_shape) = {tf.ones(rt_shape)}\")\n",
+        "print(f\"tf.fill(rt_shape, 9) = {tf.fill(rt_shape, 'x')}\")"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "id": "rNP_3_btRAHj"
+      },
+      "source": [
+        "#### Dynamic shape: indexing and slicing\n",
+        "\n",
+        "`DynamicRaggedShape` can be also be indexed to get the sizes of uniform dimensions.  For example, we can find the number of rows in a raggedtensor using `tf.shape(rt)[0]` (just as we would for a non-ragged tensor):"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "id": "MzQvPhsxS6HN"
+      },
+      "outputs": [],
+      "source": [
+        "rt_shape[0]"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "id": "wvr2iT6zS_e8"
+      },
+      "source": [
+        "However, it is an error to use indexing to try to retrieve the size of a ragged dimension, since it doesn't have a single size.  (Since `RaggedTensor` keeps track of which axes are ragged, this error is only thrown during eager execution or when tracing a `tf.function`; it will never be thrown when executing a concrete function.)"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "id": "HgGMk0LeTGik"
+      },
+      "outputs": [],
+      "source": [
+        "try:\n",
+        "  rt_shape[1]\n",
+        "except ValueError as e:\n",
+        "  print(\"Got expected ValueError:\", e)"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "id": "5QUsdawGU0SM"
+      },
+      "source": [
+        "`DynamicRaggedShape`s can also be sliced, as long as the slice either begins with axis `0`, or contains only dense dimensions."
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "id": "APT72EaBU70t"
+      },
+      "outputs": [],
+      "source": [
+        "rt_shape[:1]"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "id": "a-Wl9IrQXcdY"
+      },
+      "source": [
+        "#### Dynamic shape: encoding\n",
+        "\n",
+        "`DynamicRaggedShape` is encoded using two fields:\n",
+        "\n",
+        "* `inner_shape`: An integer vector giving the shape of a dense `tf.Tensor`.\n",
+        "* `row_partitions`: A list of `tf.experimental.RowPartition` objects, describing how the outermost dimension of that inner shape should be partitioned to add ragged axes.\n",
+        "\n",
+        "For more information about row partitions, see the \"RaggedTensor encoding\" section below, and the API docs for `tf.experimental.RowPartition`."
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "id": "jfeY9tTcV_zL"
+      },
+      "source": [
+        "#### Dynamic shape: construction\n",
+        "\n",
+        "`DynamicRaggedShape` is most often constructed by applying `tf.shape` to a `RaggedTensor`, but it can also be constructed directly:"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "id": "NSRgD667WwIZ"
+      },
+      "outputs": [],
+      "source": [
+        "tf.experimental.DynamicRaggedShape(\n",
+        "    row_partitions=[tf.experimental.RowPartition.from_row_lengths([5, 3, 2])],\n",
+        "    inner_shape=[10, 8])"
+      ]
+    },
+    {
+      "cell_type": "markdown",
+      "metadata": {
+        "id": "EjzVjs9MXIIA"
+      },
+      "source": [
+        "If the lengths of all rows are known statically, `DynamicRaggedShape.from_lengths` can also be used to construct a dynamic ragged shape.  (This is mostly useful for testing and demonstration code, since it's rare for the lengths of ragged dimensions to be known statically).\n"
+      ]
+    },
+    {
+      "cell_type": "code",
+      "execution_count": null,
+      "metadata": {
+        "id": "gMxCzADUYIjY"
+      },
+      "outputs": [],
+      "source": [
+        "tf.experimental.DynamicRaggedShape.from_lengths([4, (2, 1, 0, 8), 12])"
+      ]
+    },
     {
       "cell_type": "markdown",
       "metadata": {
         "id": "EdljbNPq-PWS"
       },
       "source": [
-        "## Broadcasting\n",
+        "### Broadcasting\n",
         "\n",
         "Broadcasting is the process of making tensors with different shapes have compatible shapes for elementwise operations. For more background on broadcasting, refer to:\n",
         "\n",
@@ -1491,7 +1746,7 @@
         "id": "-S2hOUWx-PWU"
       },
       "source": [
-        "### Broadcasting examples"
+        "#### Broadcasting examples"
       ]
     },
     {