STRIDES
diff --git a/‎docs/KubeFlow_Azure.md‎
Lines changed: 38 additions & 30 deletions b/‎docs/KubeFlow_Azure.md‎
Lines changed: 38 additions & 30 deletions
diff --git a/‎docs/create_index_from_csv.md‎
Lines changed: 8 additions & 15 deletions b/‎docs/create_index_from_csv.md‎
Lines changed: 8 additions & 15 deletions
diff --git a/‎docs/images/RM-chat-playground-spaces.jpeg‎
337 KB b/‎docs/images/RM-chat-playground-spaces.jpeg‎
337 KB
diff --git a/‎docs/images/RM-hello.jpeg‎
188 KB b/‎docs/images/RM-hello.jpeg‎
188 KB
diff --git a/‎docs/images/RM-parameters.jpeg‎
148 KB b/‎docs/images/RM-parameters.jpeg‎
148 KB
diff --git a/‎docs/images/RM_chat-button.jpeg‎
246 KB b/‎docs/images/RM_chat-button.jpeg‎
246 KB
diff --git a/‎docs/images/RM_chat-playground.jpeg‎
335 KB b/‎docs/images/RM_chat-playground.jpeg‎
335 KB
diff --git a/‎docs/images/RM_create-open-ai-1.jpeg‎
178 KB b/‎docs/images/RM_create-open-ai-1.jpeg‎
178 KB
diff --git a/‎docs/images/RM_gpt-4o-deploy.jpeg‎
331 KB b/‎docs/images/RM_gpt-4o-deploy.jpeg‎
331 KB
diff --git a/‎docs/images/RM_gpt-4o-selection.jpeg‎
460 KB b/‎docs/images/RM_gpt-4o-selection.jpeg‎
460 KB
@@ -13,52 +13,60 @@
 # Azure Setup
 
 To log into Azure from the command line interface, run the following commands
-    -az login
-    -az account set --subscription <NAME OR ID OF SUBSCRIPTION>
-
+```
+az login
+az account set --subscription <NAME OR ID OF SUBSCRIPTION>
+```
 Create a resource group (if neccessary)
-    -az group create -n <RESOURCE_GROUP_NAME> -l <LOCATION>
-
+```
+az group create -n <RESOURCE_GROUP_NAME> -l <LOCATION>
+```
 
 Create a specifically defined cluster:
-    -az aks create -g <RESOURCE_GROUP_NAME> -n <NAME> -s <AGENT_SIZE> -c <AGENT_COUNT> -l <LOCATION> --generate-ssh-keys
-
+```
+az aks create -g <RESOURCE_GROUP_NAME> -n <NAME> -s <AGENT_SIZE> -c <AGENT_COUNT> -l <LOCATION> --generate-ssh-keys
+```
 
 
 # KubeFlow installation
 
 Create user credentials. You only need to run this command once.
-        -az aks get-credentials -n <NAME> -g <RESOURCE_GROUP_NAME>
-
+```
+az aks get-credentials -n <NAME> -g <RESOURCE_GROUP_NAME>
+```
 Download the kfctl v1.2.0 release from the [Kubeflow releases page](https://github.com/kubeflow/kfctl/releases/tag/v1.2.0)
 
-Unpack the tar ball
-    -tar -xvf kfctl_v1.2.0_<platform>.tar.gz
-
+Unpack the tar ball.
+```
+tar -xvf kfctl_v1.2.0_<platform>.tar.gz
+```
 Run the following commands to set up and deploy Kubeflow in order. The code below includes an optional command to add the binary  kfctl to your path. If you don’t add the binary to your path, you must use the full path to the kfctl binary each time you run it.
 
+```
+export PATH=$PATH:"<path-to-kfctl>
 
-    - export PATH=$PATH:"<path-to-kfctl>
-
-    - export KF_NAME=<your choice of name for the Kubeflow deployment>
+export KF_NAME=<your choice of name for the Kubeflow deployment>
 
-    - export BASE_DIR=<path to a base directory>
+export BASE_DIR=<path to a base directory>
         
-    - export KF_DIR=${BASE_DIR}/${KF_NAME}
+export KF_DIR=${BASE_DIR}/${KF_NAME}
 
-    - export CONFIG_URI="https://raw.githubusercontent.com/kubeflow/manifests/v1.2-branch/kfdef/kfctl_k8s_istio.v1.2.0.yaml"
+export CONFIG_URI="https://raw.githubusercontent.com/kubeflow/manifests/v1.2-branch/kfdef/kfctl_k8s_istio.v1.2.0.yaml"
 
-    - mkdir -p ${KF_DIR}
-    - cd ${KF_DIR}
-    - kfctl apply -V -f ${CONFIG_URI}
+mkdir -p ${KF_DIR}
+cd ${KF_DIR}
+kfctl apply -V -f ${CONFIG_URI}
+```
 
+Run this command to check that the resources have been deployed correctly in namespace kubeflow:
+    
+```
+kubectl get all -n kubeflow
+```
 
-    Run this command to check that the resources have been deployed correctly in namespace kubeflow
-
-    - kubectl get all -n kubeflow
-
-    Open the KubeFlow Dashboard , the default installation does not create an external endpoint but you can use port-forwarding to visit your cluster. Run the following command
-
-    - kubectl port-forward svc/istio-ingressgateway -n istio-system 8080:80
-
-    Next, open http://localhost:8080 in your browser.
+Open the KubeFlow Dashboard , the default installation does not create an external endpoint but you can use port-forwarding to visit your cluster. Run the following command:
+    
+```
+kubectl port-forward svc/istio-ingressgateway -n istio-system 8080:80
+```
+Next, open http://localhost:8080 in your browser.
@@ -1,16 +1,12 @@
 ### Create an Azure search index from a csv file
 :sparkles: Here we outline how to create an Azure search index from a CSV file summarizing funded award data exported from Reporter.nih.gov
 
-### 1) Generate input CSV
+### 1) Download input CSV
 :ear: If you already have your csv ready, skip to section (2)
 
-Our input data comes from the csv export option for [Reporter.nih.gov](https://reporter.nih.gov/). Navigate to reporter.nih.gov and select `Advanced Search`. Input your search parameters. In this case we filtered for awards made by NIGMS in FY 23. In the top right, select `Export`.
+Download this public [csv file](https://www.kaggle.com/datasets/henryshan/2023-data-scientists-salary?resource=download) from kaggle to use as our input. 
 
-Select your export columns and make sure you export as a csv. In the example input data file we only selected 'Title', 'Project_ID', and 'Total_Cost', although a few other columns were also exported.
-
-  ![Export from Reporter](/docs/images/1_export_reporter_csv.png)
-
-If using the UI to upload, you need to make two small edits to the csv that gets exported. First, remove the extra comma at the end of each line. Second, replace the spaces in column names in the header row. You can do this using something like Python, or just do a find/replace in a text editor.
+  ![Kaggle-csv](/docs/images/kaggle-input.jpeg)
 
 ### 2) Import data into Azure blob storage
 :ear: If you already added your data to blob storage skip to section (3)
@@ -35,13 +31,13 @@ Navigate to AI Search and [create a new search](https://learn.microsoft.com/en-u
 
   ![Create new search](/docs/images/5_create_new_db.png)
 
-Click `Import data`
+Click `Import data`.
 
   ![Import Data](/docs/images/6_import_data.png)
 
 Now fill out all the necessary parameters. 
 + Data Source: Select `Azure Blob Storage`. New options will drop down.
-+ Data source name: This can be anything, but go with something like `grant-data`.
++ Data source name: This can be anything, but go with something like `ds-salaries-data`.
 + Data to extract: Select `Content and metadata`.
 + Parsing mode: Select `Delimited text`. Check the `First Line Contains Header` box and leave `Delimiter Character` as `,`.
 + Connection string: Click `Choose an existing connection` and navigate to your storage account and container.
@@ -51,24 +47,21 @@ Now fill out all the necessary parameters.
 + Description: *Optional*.
 + If you get errors when trying to go to the next screen, make sure you don't have trailing commas in your csv, and there are not spaces in the header names. If this happens, fix those errors, re-upload to blob storage, and then try again! 
 
-  ![Connect to blog](/docs/images/7_connect_to_blob.png)
+  ![Connect to blog](/docs/images/import-data.jpeg)
 
 Skip ahead to `Customize target index`. 
 + Give your index a name.
 + Make `Project_Number` your key.
 + Make sure the expected column names are present under fields. For the columns you expect to use, select `Retrievable` and `Searchable`. If you select all the columns you will just pay for indexing you are not using.
 
-  ![Customize index](/docs/images/8_target_index.png)
+  ![Customize index](/docs/images/index-csv.jpeg)
 
 Advance to `Create an indexer`, name your indexer, then click `Submit`. 
 
-  ![Create indexer](/docs/images/9_create_indexer.png)
+  ![Create indexer](/docs/images/create-indexer.jpeg)
 
 Navigate to `Indexes` on the left panel and wait until your index shows as many documents as you have lines in your file. It will read 0 documents until it is finished indexing. The example 500 line csv takes about one minute.
 
-  ![Check index](/docs/images/10_check_index.png)
-
-
 And that is it! Now return to [the tutorial notebook to run queries against this csv using GPT-4]( /notebooks/GenAI/notebooks/AzureAIStudio_index_structured_with_console.ipynb).