You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
@@ -205,11 +205,15 @@ The dataset no longer contains duplicates based on the Record ID. However, we ne
205
205
206
206
## Use GREL
207
207
208
+
There are many ways to manipulate your dataset in OpenRefine. One of them is the Google Refine Expression Language (GREL). With the help of GREL, you can, for example, create custom facets or add columns by fetching URLs. We will use it to find and replace errors. For more information, refer to the [GREL documentation](https://openrefine.org/docs/manual/expressions).
209
+
210
+
Take a look at the `Categories` column of your dataset. Most objects were attributed to various categories, separated by "\|". However, several fields contain "\|\|" instead of "\|". We want to unify those.
211
+
208
212
> <hands-on-title>Find and replace typos using GREL</hands-on-title>
209
213
>
210
-
> To remove the occurance of double pipe \|\| from the file we can do the following:
214
+
> To remove the occurance of double pipe "\|\|" from the file we can do the following:
211
215
> 1. Click on the triangle on the left of `Categories` and select `Text filter`.
212
-
> 2. On the left, using the `Facet/Filter` section, search for the occurrence of \| and \|\|. There are 71061 rows with \| and 9 rows with \|\|. We want to remove these 9 lines as they are there by mistake.
216
+
> 2. On the left, using the `Facet/Filter` section, search for the occurrence of "\|" and "\|\|". There are 71061 rows with "\|" and 9 rows with "\|\|". We want to remove these nine lines as they were added by mistake.
213
217
> 3. Click on the triangle on the left of `Categories`, hover over `edit cells`, and click on `Transform...`.
214
218
> 4. In the new window, use the following text `value.replace('||', '|')` as "Expression" and click on `OK`.
0 commit comments