Skip to content

Commit 6f45d03

Browse files
authored
Merge pull request #154 from uw-ssec/147-fix-anshul-changes-for-workshop-37
fix: Reflecting feedback for 3_7 Workshop
2 parents 67aa40b + b11915a commit 6f45d03

File tree

3 files changed

+4
-43
lines changed

3 files changed

+4
-43
lines changed

AI_Postdoc_Workshop/module1/setup.md

Lines changed: 2 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -15,7 +15,8 @@ Codespaces.
1515
## 🐱‍💻 GitHub Codespaces
1616

1717
```{note}
18-
Indefinite Access, but Limited Resources
18+
Indefinite Access, but Limited Resources.
19+
If your Codespace gives you a low disk space error, please refresh the tab.
1920
```
2021

2122
This tutorial is available to run within

AI_Postdoc_Workshop/module2/1-retrieval-augmented-text-generation.ipynb

Lines changed: 1 addition & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -18,26 +18,6 @@
1818
"Throughout this notebook we will be exploring RAG, what it is, how it works, and why it's so exciting."
1919
]
2020
},
21-
{
22-
"cell_type": "markdown",
23-
"id": "eea1041c",
24-
"metadata": {},
25-
"source": [
26-
"## Why RAG?\n",
27-
"\n",
28-
"Although trained on large datasets, stale data can severely limit LLMs. It faces several challenges:\n",
29-
"\n",
30-
"1. The models are trained on internet content, so they might not generate relevant output when prompted for information that is not publicly available on the internet.\n",
31-
"\n",
32-
"2. The models are trained up to a certain date, they might not generate relevant output when prompted for content and information that has happened after the training completion date of the model.\n",
33-
"\n",
34-
"3. The models are trained to be more generalized. This means that they can only produce generic outputs and might not perform as expected when prompted for specific deep-dive concepts related to a particular topic.\n",
35-
"\n",
36-
"One way to dynamically integrate relevant external information is retrieval-augmented generation (RAG), which can help improve the reliability of LLM outputs.\n",
37-
"\n",
38-
"Going back to our original question of how this can be utilized in our own work or organization on [module 1, section 3](../module1/3-domain-specific-question-answering.ipynb) of this workshop. RAG Framework can really be useful in the scenario where there may be a set of documents, GitHub repositories, research papers, and domain-specific knowledge bases that you might want to search through quickly."
39-
]
40-
},
4121
{
4222
"cell_type": "markdown",
4323
"id": "7",
@@ -490,7 +470,7 @@
490470
" documents.extend(loader.load())\n",
491471
"\n",
492472
"for each in documents:\n",
493-
" # # print(each.page_content) # Uncomment this line to see the individual page_content\n",
473+
" # print(each.page_content) # Uncomment this line to see the individual page_content\n",
494474
" print(each.metadata)"
495475
]
496476
},

AI_Postdoc_Workshop/module2/2-olmo-chat-rag.ipynb

Lines changed: 1 addition & 21 deletions
Original file line numberDiff line numberDiff line change
@@ -22,30 +22,10 @@
2222
"id": "53cb355b",
2323
"metadata": {},
2424
"source": [
25-
"For this demo we will use Panel, a powerful open source Python library to create interactive dashboards in Jupyter notebooks or standalone apps.\n",
25+
"For this demo we will use [Panel](https://panel.holoviz.org/), a powerful open source Python library to create interactive dashboards in Jupyter notebooks or standalone apps.\n",
2626
" "
2727
]
2828
},
29-
{
30-
"cell_type": "markdown",
31-
"id": "9cf1c043",
32-
"metadata": {},
33-
"source": [
34-
"## What is Panel?\n",
35-
"\n",
36-
"[Panel](https://panel.holoviz.org/) is a part of the [HoloViz](https://holoviz.org/) ecosystem, which is a set of tools that make it easier to work with large datasets and create interactive visualizations. This ecosystem consists of the following tools beyond Panel:\n",
37-
"\n",
38-
"- [hvPlot](https://hvplot.holoviz.org/) to quickly generate interactive plots from your data\n",
39-
"- [HoloViews](https://holoviews.org/) to help you make all of your data instantly visualizable\n",
40-
"- [GeoViews](https://geoviews.org/) to extend HoloViews for geographic data\n",
41-
"- [Datashader](https://datashader.org/) for rendering even the largest datasets\n",
42-
"- [Lumen](https://lumen.holoviz.org/) to build data-driven dashboards from a simple YAML specification\n",
43-
"- [Param](https://param.holoviz.org/) to create declarative user-configurable objects\n",
44-
"- [Colorcet](https://colorcet.holoviz.org/) for perceptually uniform colormaps\n",
45-
"\n",
46-
"The nice thing about this ecosystem is that all of these tools are designed to work together as part of the [Pandata scalable open-source analysis stack](https://github.com/panstacks/pandata?tab=readme-ov-file), so you can easily integrate them as needed."
47-
]
48-
},
4929
{
5030
"cell_type": "markdown",
5131
"id": "9f31fff8",

0 commit comments

Comments
 (0)