You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
> 7. In the new window, use the following text `value.split('|').uniques().join('|')` as "Expression" and click on `OK`.
233
+
> 6. In the new window, use the following text `value.split('|').uniques().join('|')` as "Expression" and click on `OK`.
234
234
>
235
235
{: .hands_on}
236
236
@@ -319,7 +319,7 @@ You can now see, from which category the museum has the most objects, one of our
319
319
320
320
## Clustering
321
321
322
-
The clustering allows you to solve issues regarding case inconsistencies, incoherent use of either the singular or plural form, and simple spelling mistakes.
322
+
The clustering allows you to solve issues regarding case inconsistencies, incoherent use of either the singular or plural form, and simple spelling mistakes. We apply those to the object categories for the next step of cleaning.
323
323
324
324
> <hands-on-title>Clustering of similar categories</hands-on-title>
325
325
>
@@ -332,19 +332,24 @@ The clustering allows you to solve issues regarding case inconsistencies, incohe
332
332
>
333
333
> 
334
334
>
335
-
> 4. Here, you can see different suggestions from OpenRefine to cluster different categories and merge them into one. In our tutorial, we merge all of the suggestions by clicking on `select > all` and then clicking on `Merge selected and re-cluster`.
336
-
>
337
-
> 
335
+
> 4. Here, you can see different suggestions from OpenRefine to cluster different categories and merge them into one. In our tutorial, we merge all the suggestions by clicking on `Select all` and then clicking on `Merge selected and re-cluster`.
338
336
>
339
337
> 5. Now, you can close the clustering window by clicking on `close`.
340
338
>
341
-
> Be careful! Some methods are too aggressive, so you might end up clustering values that do not belong together. Now that the values have been clustered individually, we can put them back together in a single cell.
339
+
> Be careful with clustering! Some settings are very aggressive, so you might end up clustering values that do not belong together!
340
+
>
341
+
> Now that the different categories have been clustered individually, we can reassemble them in the respective object single cell.
342
+
>
342
343
> 6. Click the Categories triangle and hover over the `Edit cells` and click on `Join multi-valued cells`.
343
-
> 7. Choose the pipe character (`\|`) as a separator and click on `OK`.
344
+
> 7. Choose the pipe character (`|`) as a separator and click on `OK`.
345
+
>
346
+
> 
347
+
>
344
348
> The rows now look like before, with a multi-valued Categories field.
345
349
>
346
350
{: .hands_on}
347
351
352
+
You have now successfully split, cleaned and re-joined the various categories of objects in the museum's metadata! Congratulations.
348
353
When you’re happy with your analysis results, choose whether to export the dataset into your Galaxy history or download it directly onto your computer.
0 commit comments