Skip to content

Commit 536de9f

Browse files
authored
Merge pull request #277 from NGO-Algorithm-Audit/feature/structural_edits
Translation edits SDG NL EN
2 parents 35b7f6f + aa2c66f commit 536de9f

File tree

2 files changed

+11
-12
lines changed
  • content
    • english/technical-tools
    • nederlands/technical-tools

2 files changed

+11
-12
lines changed

content/english/technical-tools/SDG.md

Lines changed: 5 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -99,17 +99,17 @@ Try the tool below ⬇️
9999

100100

101101

102-
<!-- Technical details -->
102+
<!-- Technical introduction -->
103103

104-
{{< container_open isAccordion="true" title="Technical details – Synthetic data generation" id="technical-introduction" >}}
104+
{{< container_open isAccordion="true" title="Technical introduction – Synthetic data generation" id="technical-introduction" >}}
105105

106106
<br>
107107

108108
The synthetic data generation tool performs a series of steps:
109109

110110
#### Required preparations by the user:
111111
The user shoulds prepare the following aspects to synthesize data:
112-
- <span style="color:#005AA7">Dataset:</span> Should consists of categorical, numerical and/or time data.
112+
- <span style="color:#005AA7">Dataset:</span> Only categorical, numerical, or time data can be processed. Datasets may contain a maximum of 8 columns, must have a header with column names and do not require an index column.
113113
- <span style="color:#005AA7">Method:</span> By default, the CART method is used to generate synthetic data. CART generally produces higher quality synthetic data, but might not work well on datasets with categorical variables with 20+ categories. Use Gaussian Copula in those cases.
114114
- <span style="color:#005AA7">Number of synthetic data points:</span> Number of synthetic data points to be generated by the tool. Due to computational contstraints of browser-based synthetic data generation, the maximum is set to 5.000.
115115

@@ -184,8 +184,8 @@ Computing the *disclosure protection metric* for synthetic data. This metric mea
184184
##### Step 5. Download:
185185
The generated synthetic data can de downloaded as csv and as json file. Evaluation of the synthetic data according to the above metrics can be downloaded as a evaluation report in pdf.
186186

187-
#### Documentatie
188-
Meer documentatie over de tool en onderliggende SDG methoden kunnen worden gevonden op <a href="https://github.com/NGO-Algorithm-Audit/python-synhtpop" target="_blank">Github</a>.
187+
#### Documentation
188+
More documentation about the tool and underlying SDG methods can be found on <a href="https://github.com/NGO-Algorithm-Audit/python-synhtpop" target="_blank">Github</a>.
189189

190190
{{< container_close >}}
191191

content/nederlands/technical-tools/SDG.md

Lines changed: 6 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -99,17 +99,17 @@ Probeer de tool hieronder uit ⬇️
9999

100100

101101

102-
<!-- Technische details -->
102+
<!-- Technische introductie -->
103103

104-
{{< container_open isAccordion="true" title="Technische details – Synthetische data generatie" id="technical-introduction" >}}
104+
{{< container_open isAccordion="true" title="Technische introductie – Synthetische data generatie" id="technical-introduction" >}}
105105

106106
<br>
107107

108108
De tool voor synthetische data generatie doorloopt de volgende stappen:
109109

110110
#### Benodigdheden van de gebruiker:
111111
De gebruiker dient de volgende aspecten voor te bereiden:
112-
- <span style="color:#005AA7">Dataset:</span> Moet bestaan uit categorische, numerieke of tijdsdata.
112+
- <span style="color:#005AA7">Dataset:</span> Alleen categorische, numerieke of tijdsdata kunnen worden verwerkt. Datasets mogen maximaal 8 kolommen bevatten, dienen een header te hebben met kolomnamen en hoeven geen index-kolom te hebben.
113113
- <span style="color:#005AA7">Methode:</span> Standaard wordt de CART-methode gebruikt om synthetische data te genereren. CART levert doorgaans synthetische data van hoge kwaliteit, maar werkt mogelijk minder goed bij datasets met categorische variabelen met meer dan 20 categorieën. Gebruik in dat geval Gaussian Copula.
114114
- <span style="color:#005AA7">Aantal synthetische datapunten:</span> Aantal synthetische datapunten die door de tool worden gegenereerd. Vanwege de rekencapaciteit van browser-gebaseerde datageneratie is het maximum ingesteld op 5.000.
115115

@@ -182,17 +182,16 @@ De *onthullings beschermings metriek* meet het aandeel synthetische datapunten d
182182
##### Step 5. Download:
183183
De gegenereerde synthetische data kan worden gedownload als csv- en json-bestand. De evaluatie volgens bovenstaande metrics kan als evaluatierapport in pdf worden gedownload.
184184

185-
186-
#### Documentation
187-
More documentation about the tool and underlying SDG methods can be found on <a href="https://github.com/NGO-Algorithm-Audit/python-synhtpop" target="_blank">Github</a>.
185+
#### Documentatie
186+
Meer documentatie over de tool en onderliggende SDG methoden kunnen worden gevonden op <a href="https://github.com/NGO-Algorithm-Audit/python-synhtpop" target="_blank">Github</a>.
188187

189188
{{< container_close >}}
190189

191190

192191

193192
<!-- Web app -->
194193

195-
{{< iframe src="https://local-first-bias-detection.s3.eu-central-1.amazonaws.com/synthetic-data.html?lang=nl" title="Synthetic data generation tool" icon="fas fa-search" id="web-app" height="800px" >}}
194+
{{< iframe src="https://local-first-bias-detection.s3.eu-central-1.amazonaws.com/synthetic-data.html?lang=nl" title="Synthetische data generatie tool" icon="fas fa-search" id="web-app" height="800px" >}}
196195

197196

198197

0 commit comments

Comments
 (0)