Skip to content

Commit d845239

Browse files
committed
column edits
1 parent a663958 commit d845239

File tree

1 file changed

+21
-20
lines changed

1 file changed

+21
-20
lines changed

articles/machine-learning/concept-sourcing-human-data.md

Lines changed: 21 additions & 20 deletions
Original file line numberDiff line numberDiff line change
@@ -28,21 +28,22 @@ These are emerging practices, and we are continually learning. The best practice
2828
We suggest the following best practices for manually collecting human data directly from people.
2929

3030
:::row:::
31-
:::column span="4":::
31+
:::column span="":::
3232
**Best Practice**
3333
:::column-end:::
34-
:::column span="4":::
34+
:::column span="":::
3535
**Why?**
3636
:::column-end:::
3737
:::row-end:::
38+
3839
-----
3940

4041
:::row:::
41-
:::column span="4":::
42+
:::column span="":::
4243
**Obtain voluntary informed consent.**
4344
:::column-end:::
4445

45-
:::column span="4":::
46+
:::column span="":::
4647
- Participants should understand and consent to data collection and how their data will be used.
4748
- Data should only be stored, processed, and used for purposes that are part of the original documented informed consent.
4849
- Consent documentation should be properly stored and associated with the collected data.
@@ -52,11 +53,11 @@ We suggest the following best practices for manually collecting human data direc
5253
-----
5354

5455
:::row:::
55-
:::column span="4":::
56+
:::column span="":::
5657
**Compensate data contributors appropriately.**
5758
:::column-end:::
5859

59-
:::column span="4":::
60+
:::column span="":::
6061
- Data contributors should not be pressured or coerced into data collections and should be fairly compensated for their time and data.
6162
- Inappropriate compensation can be exploitative or coercive.
6263
:::column-end:::
@@ -66,11 +67,11 @@ We suggest the following best practices for manually collecting human data direc
6667
-----
6768

6869
:::row:::
69-
:::column span="4":::
70+
:::column span="":::
7071
**Let contributors self-identify demographic information.**
7172
:::column-end:::
7273

73-
:::column span="4":::
74+
:::column span="":::
7475
- Demographic information that is not self-reported by data contributors but assigned by data collectors may 1) result in inaccurate metadata and 2) be disrespectful to data contributors.
7576
:::column-end:::
7677

@@ -79,11 +80,11 @@ We suggest the following best practices for manually collecting human data direc
7980
-----
8081

8182
:::row:::
82-
:::column span="4":::
83+
:::column span="":::
8384
**Anticipate harms when recruiting vulnerable groups.**
8485
:::column-end:::
8586

86-
:::column span="4":::
87+
:::column span="":::
8788
- Collecting data from vulnerable population groups introduces risk to data contributors and your organization.
8889
:::column-end:::
8990

@@ -92,11 +93,11 @@ We suggest the following best practices for manually collecting human data direc
9293
-----
9394

9495
:::row:::
95-
:::column span="4":::
96+
:::column span="":::
9697
**Treat data contributors with respect.**
9798
:::column-end:::
9899

99-
:::column span="4":::
100+
:::column span="":::
100101
- Improper interactions with data contributors at any phase of the data collection can negatively impact data quality, as well as the overall data collection experience for data contributors and data collectors.
101102
:::column-end:::
102103

@@ -105,11 +106,11 @@ We suggest the following best practices for manually collecting human data direc
105106
-----
106107

107108
:::row:::
108-
:::column span="4":::
109+
:::column span="":::
109110
**Qualify external suppliers carefully.**
110111
:::column-end:::
111112

112-
:::column span="4":::
113+
:::column span="":::
113114
- Data collections with unqualified suppliers may result in low quality data, poor data management, unprofessional practices, and potentially harmful outcomes for data contributors and data collectors (including violations of human rights).
114115
- Annotation or labeling work (e.g., audio transcription, image tagging) with unqualified suppliers may result in low quality or biased datasets, insecure data management, unprofessional practices, and potentially harmful outcomes for data contributors (including violations of human rights).
115116
:::column-end:::
@@ -119,11 +120,11 @@ We suggest the following best practices for manually collecting human data direc
119120
-----
120121

121122
:::row:::
122-
:::column span="4":::
123+
:::column span="":::
123124
**Communicate expectations clearly in the Statement of Work (SOW) with suppliers.**
124125
:::column-end:::
125126

126-
:::column span="4":::
127+
:::column span="":::
127128
- An SOW which lacks requirements for responsible data collection work may result in low-quality or poorly collected data.
128129
:::column-end:::
129130

@@ -132,11 +133,11 @@ We suggest the following best practices for manually collecting human data direc
132133
-----
133134

134135
:::row:::
135-
:::column span="4":::
136+
:::column span="":::
136137
**Qualify geographies carefully.**
137138
:::column-end:::
138139

139-
:::column span="4":::
140+
:::column span="":::
140141
- When applicable, collecting data in restricted and/or unfamiliar geographies may result in unusable or low-quality data and may impact the safety of involved parties.
141142
:::column-end:::
142143

@@ -145,11 +146,11 @@ We suggest the following best practices for manually collecting human data direc
145146
-----
146147

147148
:::row:::
148-
:::column span="4":::
149+
:::column span="":::
149150
**Be a good steward of your datasets.**
150151
:::column-end:::
151152

152-
:::column span="4":::
153+
:::column span="":::
153154
- Improper data management and poor documentation can result in data misuse.
154155
:::column-end:::
155156

0 commit comments

Comments
 (0)