Minor updates (#157)

carlosgjs · web-flow · commit ddaf0baac256 · 2025-05-18T16:30:34.000-07:00
diff --git a/AI_Postdoc_Workshop/module1/2-langchain.ipynb b/AI_Postdoc_Workshop/module1/2-langchain.ipynb
@@ -49,7 +49,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 1,
+   "execution_count": null,
    "id": "8dcf656b",
    "metadata": {},
    "outputs": [],
@@ -64,7 +64,7 @@
    "source": [
     "### String PromptTemplates\n",
     "\n",
-    "The [String PromptTemplates](https://python.langchain.com/v0.2/docs/concepts/#string-prompttemplates) is used to format a string input. By default, the template takes Python `f-string` format. There are currently 2 choices of `template_format` available: `f-string` and `jinja2`. Later we will see the use of `jinja2` format. In the example below, we will use the `f-string` format."
+    "The [String PromptTemplates](https://python.langchain.com/v0.2/docs/concepts/#string-prompttemplates) are used to format a string input. By default, the templates take Python's `f-string` format. There are currently 2 choices of `template_format` available: `f-string` and `jinja2`. Later we will see the use of `jinja2` format. In the example below, we will use the `f-string` format."
    ]
   },
   {
@@ -91,7 +91,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 3,
+   "execution_count": null,
    "id": "13cf3124",
    "metadata": {},
    "outputs": [],
@@ -124,7 +124,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 5,
+   "execution_count": null,
    "id": "9",
    "metadata": {},
    "outputs": [],
@@ -163,7 +163,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 8,
+   "execution_count": null,
    "id": "12",
    "metadata": {},
    "outputs": [],
@@ -192,14 +192,14 @@
    "source": [
     "#### Your turn 😎\n",
     "\n",
-    "Create a `StringPromptTemplate` that outputs some text generation prompt, for example, \"Sun is part of galaxy ...\".\n",
+    "Create a `String PromptTemplate` that outputs some text generation prompt, for example, \"Sun is part of galaxy ...\".\n",
     "\n",
     "Feel free to experiment with the built in [Python `f-string` ](https://docs.python.org/3.11/tutorial/inputoutput.html#formatted-string-literals) for the `prompt` input argument to the model."
    ]
   },
   {
    "cell_type": "code",
-   "execution_count": 10,
+   "execution_count": null,
    "id": "b0cdc634",
    "metadata": {},
    "outputs": [],
@@ -220,12 +220,12 @@
    "id": "8e67e77a",
    "metadata": {},
    "source": [
-    "LangChain have implemented a [`Runnable`](https://api.python.langchain.com/en/stable/runnables/langchain_core.runnables.base.Runnable.html#langchain_core.runnables.base.Runnable) protocol that allows us to create custom \"chains\".\n",
+    "LangChain has implemented a [`Runnable`](https://api.python.langchain.com/en/stable/runnables/langchain_core.runnables.base.Runnable.html#langchain_core.runnables.base.Runnable) protocol that allows us to create custom \"chains\".\n",
     "This protocol has a standard interface for defining and invoking various LLMs, PromptTemplates, and other components, enabling reusability.\n",
     "For more details, go to LangChain's [Runnable documentation](https://python.langchain.com/v0.2/docs/concepts/#runnable-interface).\n",
     "\n",
     "```{note}\n",
-    "In this tutorial, you will see the use of `.invoke` method on various LangChain's object.\n",
+    "In this tutorial, you will see the use of the `.invoke` method on various LangChain objects.\n",
     "This is essentially using that standard interface for the `Runnable` protocol.\n",
     "```"
    ]
@@ -240,7 +240,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 11,
+   "execution_count": null,
    "id": "16",
    "metadata": {},
    "outputs": [],
@@ -250,7 +250,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 12,
+   "execution_count": null,
    "id": "17",
    "metadata": {},
    "outputs": [],
@@ -290,7 +290,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 14,
+   "execution_count": null,
    "id": "864d5266",
    "metadata": {},
    "outputs": [],
@@ -313,7 +313,7 @@
    "id": "8b730d9e",
    "metadata": {},
    "source": [
-    "If you'd like to access the base object `Llama` object from the `llama-cpp-python` package, you can access it via the `.client` attribute of the `LlamaCpp` object."
+    "If you'd like to access the base `Llama` object from the `llama-cpp-python` package, you can access it via the `.client` attribute of the `LlamaCpp` object."
    ]
   },
   {
@@ -338,7 +338,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 17,
+   "execution_count": null,
    "id": "18",
    "metadata": {},
    "outputs": [],
@@ -400,19 +400,19 @@
    "metadata": {},
    "source": [
     "As we can see above, the template reads as follows:\n",
-    "- `eos_token` is a string that is added at the top of the resulting string after prompt is formatted.\n",
+    "- `eos_token` is a string that is added at the top of the resulting string after the prompt is formatted.\n",
     "You can also see that `eos_token` is used to append `content` string values from an `assistant` `role`.\n",
     "You can find this value by going to the Model's [`tokenizer_config.json`](https://huggingface.co/allenai/OLMo-7B-Instruct-hf/blob/main/tokenizer_config.json#L233) file and looking for the `eos_token` key. *Unfornately, this is currently the only way to get this information, you can go to https://github.com/ggerganov/llama.cpp/issues/5040 for more details.* In our case, the `eos_token` is `<|endoftext|>`.\n",
-    "- `messages` is a list of dictionary that is iterated over. As you can see that this dictionary should contain a `role` and `content` key.\n",
-    "- `add_generation_prompt` is a boolean that is used to determine whether to add a generation prompt or not. In this case, when it's the last message and `add_generation_prompt` is `True`, it will add `<|assistant|>` string to the end of the prompt."
+    "- `messages` is a list of dictionary that is iterated over. As you can see these dictionaries should contain `role` and `content` keys.\n",
+    "- `add_generation_prompt` is a boolean that is used to determine whether to add a generation prompt or not. In this case, when it's the last message and `add_generation_prompt` is `True`, it will add the `<|assistant|>` string to the end of the prompt."
    ]
   },
   {
    "cell_type": "markdown",
    "id": "f1ad5f5c",
    "metadata": {},
    "source": [
-    "Now that we know what the template expects we can create the final prompt string by passing in the expected input variables, this time, instead of using the `.format` method, let's see what happens if we use the `.invoke` method on the `PromptTemplate` object."
+    "Now that we know what the template expects we can create the final prompt string by passing in the expected input variables. This time, instead of using the `.format` method, let's see what happens if we use the `.invoke` method on the `PromptTemplate` object."
    ]
   },
   {
@@ -464,7 +464,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 21,
+   "execution_count": null,
    "id": "4caf24cf",
    "metadata": {},
    "outputs": [],
@@ -488,7 +488,7 @@
    "id": "6d33e0d4",
    "metadata": {},
    "source": [
-    "You can see below that we get [`StringPromptValue`](https://api.python.langchain.com/en/latest/prompt_values/langchain_core.prompt_values.StringPromptValue.html) object this time as the output rather than pure string. But we can still get the string value by calling the `.to_string` method on the `StringPromptValue` object."
+    "You can see below that we get a [`StringPromptValue`](https://api.python.langchain.com/en/latest/prompt_values/langchain_core.prompt_values.StringPromptValue.html) object this time as the output rather than a pure string. But we can still get the string value by calling the `.to_string` method on the `StringPromptValue` object."
    ]
   },
   {
@@ -560,7 +560,7 @@
     "STEP 2: Prompt Template reads the variables to form the prompt text as output - \"What are stars and moon?\"  \n",
     "STEP 3: The prompt is given as input to the LLM model.  \n",
     "STEP 4: LLM Model produces output.  \n",
-    "STEP 5: The output goes through StrOutputParser that parses it into string and gives the result.  "
+    "STEP 5: The output goes through StrOutputParser that parses it into a string and gives the result.  "
    ]
   },
   {
@@ -573,7 +573,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 23,
+   "execution_count": null,
    "id": "25",
    "metadata": {},
    "outputs": [],
@@ -667,7 +667,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 28,
+   "execution_count": null,
    "id": "28",
    "metadata": {},
    "outputs": [],
@@ -682,7 +682,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 29,
+   "execution_count": null,
    "id": "29",
    "metadata": {},
    "outputs": [],
@@ -705,7 +705,7 @@
   },
   {
    "cell_type": "code",
-   "execution_count": 30,
+   "execution_count": null,
    "id": "31",
    "metadata": {},
    "outputs": [],
@@ -748,7 +748,7 @@
    "source": [
     "#### Your turn 😎\n",
     "\n",
-    "Try different messages value(s) and see how the output changes. But remember to follow the template structure.\n",
+    "Try different message values and see how the output changes. But remember to follow the template structure.\n",
     "The dictionary keys must contain `role` and `content` and the allowed `role` values are only `user` and `assistant`."
    ]
   },
@@ -761,6 +761,22 @@
    "source": [
     "# Write your llm_chain.invoke code here, feel free to also, create your own template and try partial_variables"
    ]
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "daade3a0",
+   "metadata": {},
+   "outputs": [],
+   "source": []
+  },
+  {
+   "cell_type": "code",
+   "execution_count": null,
+   "id": "e20c605d-ecd3-400a-9ee7-cd1c9fa9d486",
+   "metadata": {},
+   "outputs": [],
+   "source": []
   }
  ],
  "metadata": {
@@ -779,7 +795,7 @@
    "name": "python",
    "nbconvert_exporter": "python",
    "pygments_lexer": "ipython3",
-   "version": "3.11.9"
+   "version": "3.11.11"
   }
  },
  "nbformat": 4,
diff --git a/AI_Postdoc_Workshop/module1/setup.md b/AI_Postdoc_Workshop/module1/setup.md
@@ -7,7 +7,7 @@ During the tutorial, to follow along, we recommend using
 to worry about setting up the compute environment. However, if you would like to
 set up the tutorial on your local machine, you can use [**Conda**](#conda).
 
-[**GitHub Codespaces**](#github-codespaces), which is a cloud-based development
+[**GitHub Codespaces**](#github-codespaces), is a cloud-based development
 environment that's hosted in the cloud. This option is available indefinitely,
 but you will be limited in the free resources you can use with GitHub
 Codespaces.
@@ -29,7 +29,7 @@ GitHub Codespaces and you need to have a GitHub account to use GitHub
 Codespaces.
 
 A codespace is a development environment that's hosted in the cloud. You are
-able to chose from various Dev container configuration, for this specific
+able to chose from various Dev container configurations. For this specific
 workshop, please ensure that `Scipy2024` is selected. GitHub currently gives
 every user
 [120 vCPU hours per month for free](https://docs.github.com/en/billing/managing-billing-for-github-codespaces/about-billing-for-github-codespaces#monthly-included-storage-and-core-hours-for-personal-accounts),
@@ -51,14 +51,14 @@ You can set up the tutorial locally using a Conda environment. Here's how:
 
 0. Downloading and Installing Conda
 
-   If you don't have Conda installed, we recommend following the instruction to
+   If you don't have Conda installed, we recommend following the instructions to
    download and install the
    [Miniforge distribution](https://github.com/conda-forge/miniforge) >=
    `Miniforge3-22.3.1-0` of Conda. This distribution is a minimal installer for
    conda specifically optimized for [conda-forge](https://conda-forge.org/)
    (Community-led recipes, infrastructure and distributions for conda.).
 
-1. Create a new Conda environment called `ssec-scipy2024` with
+1. Create a new Conda environment called `ssec-scipy2024` with the
    [`conda-lock`](https://github.com/conda/conda-lock) package installed. This
    package is used to install the exact versions of the packages in the
    [`conda-lock.yml`](https://raw.githubusercontent.com/uw-ssec/docker-images/main/tutorial-scipy-2024/conda-lock.yml)