Merge pull request #613 from emxuyu/bert_tutorial

guschmue · web-flow · commit a6f8ccfec88b · 2019-06-19T13:55:14.000-07:00
Fixed typos and documentation
diff --git a/tutorials/BertTutorial.ipynb b/tutorials/BertTutorial.ipynb
@@ -9,7 +9,7 @@
     "This tutorial shows how to convert the original Tensorflow Bert model to ONNX. \n",
     "In this example we fine tune Bert for squad-1.1 on top of [BERT-Base, Uncased](https://storage.googleapis.com/bert_models/2018_10_18/uncased_L-12_H-768_A-12.zip).\n",
     "\n",
-    "Since this tutorial cares mostly about the conversion process we reuse tokenizer and utilities defined in the Bert source tree as much as possible.\n",
+    "Since this tutorial cares mostly about the conversion process, we reuse tokenizer and utilities defined in the Bert source tree as much as possible.\n",
     "\n",
     "This should work with all versions supported by the [tensorflow-onnx converter](https://github.com/onnx/tensorflow-onnx), we used the following versions while writing the tutorial:\n",
     "```\n",
@@ -27,7 +27,7 @@
    "metadata": {},
    "source": [
     "## Step 1 - define some environment variables\n",
-    "Before we start, lets setup some variables where to find things."
+    "Before we start, let's set up some variables for where to find things."
    ]
   },
   {
@@ -96,21 +96,20 @@
    "outputs": [],
    "source": [
     "!wget -q https://storage.googleapis.com/bert_models/2018_10_18/uncased_L-12_H-768_A-12.zip\n",
-    "!unzip /uncased_L-12_H-768_A-12.zip\n",
+    "!unzip uncased_L-12_H-768_A-12.zip\n",
     "\n",
     "!mkdir squad-1.1 out\n",
     "\n",
     "!wget -O squad-1.1/train-v1.1.json  https://rajpurkar.github.io/SQuAD-explorer/dataset/train-v1.1.json \n",
-    "!wget -O squad-1.1/dev-v1.1.json  https://rajpurkar.github.io/SQuAD-explorer/dataset/dev-v1.1.json \n",
-    "!wget -O squad-1.1/evaluate-v1.1.json  https://rajpurkar.github.io/SQuAD-explorer/dataset/evaluate-v1.1.json "
+    "!wget -O squad-1.1/dev-v1.1.json  https://rajpurkar.github.io/SQuAD-explorer/dataset/dev-v1.1.json \n"
    ]
   },
   {
    "cell_type": "markdown",
    "metadata": {},
    "source": [
     "## Step 4 - fine tune the Bert model for squad-1.1\n",
-    "This is the same as described in the [Bert repository](https://github.com/google-research/bert). You need to do this only once.\n"
+    "This is the same as described in the [Bert repository](https://github.com/google-research/bert). This only needs to be done once.\n"
    ]
   },
   {
@@ -121,7 +120,7 @@
    "source": [
     "#\n",
     "# finetune bert for squad-1.1\n",
-    "# this may take a bit\n",
+    "# this will take around 3 hours to complete, and even longer if your device does not have a GPU \n",
     "#\n",
     "\n",
     "!cd bert && \\\n",
@@ -146,9 +145,9 @@
    "metadata": {},
    "source": [
     "## Step 5 - create the inference graph and save it\n",
-    "With a fined tuned model in hands we want to create the inference graph for it and save it as saved_model format.\n",
+    "With a fine-tuned model in hands we want to create the inference graph for it and save it as saved_model format.\n",
     "\n",
-    "***We assune that after 2 epochs the checkpoint is model.ckpt-21899 - if the following code does not find it, check the $OUT directory for the higest checkpoint***."
+    "***We assume that after 2 epochs the checkpoint is model.ckpt-21899 - if the following code does not find it, check the $OUT directory for the higest checkpoint***."
    ]
   },
   {
@@ -202,7 +201,7 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "Create the model and run predictions on all data and save the results so we can compare them later to the onnxruntime version."
+    "Create the model, run predictions on all data, and save the results to later compare them to the onnxruntime version."
    ]
   },
   {
@@ -309,7 +308,7 @@
     "scrolled": true
    },
    "source": [
-    "Now lets create the inference graph and save it."
+    "Now let's create the inference graph and save it."
    ]
   },
   {
@@ -340,7 +339,7 @@
    "source": [
     "## Step 6 - convert to ONNX\n",
     "\n",
-    "Convert the model from tensorflow to onnx using https://github.com/onnx/tensorflow-onnx."
+    "Convert the model from Tensorflow to ONNX using https://github.com/onnx/tensorflow-onnx."
    ]
   },
   {
@@ -393,7 +392,7 @@
    "cell_type": "markdown",
    "metadata": {},
    "source": [
-    "Lets look at the inputs to the ONNX model. The input 'unique_ids' is special and creates some issue in ONNX: the input is passed directly to the output and in Tensorflow both have the same name. In ONNX that is not supported and the converter creates a new name for the input. We need to use that created name so we remember it."
+    "Let's look at the inputs to the ONNX model. The input 'unique_ids' is special and creates an issue in ONNX: the input passed directly to the output and in Tensorflow both have the same name. Because that is not supported in ONNX, the converter creates a new name for the input. We need to use that created name as to remember it."
    ]
   },
   {
@@ -552,9 +551,9 @@
    "source": [
     "## Summary\n",
     "\n",
-    "That was all it takes to convert a relativly complex model from Tensorflow to ONNX. \n",
+    "That was all it takes to convert a relatively complex model from Tensorflow to ONNX. \n",
     "\n",
-    "You find more documentation about tensorflow-onnx [here](https://github.com/onnx/tensorflow-onnx)."
+    "You can find more documentation about tensorflow-onnx [here](https://github.com/onnx/tensorflow-onnx)."
    ]
   },
   {