Skip to content

Commit a663958

Browse files
committed
column edits
1 parent 27f051a commit a663958

File tree

1 file changed

+20
-22
lines changed

1 file changed

+20
-22
lines changed

articles/machine-learning/concept-sourcing-human-data.md

Lines changed: 20 additions & 22 deletions
Original file line numberDiff line numberDiff line change
@@ -27,24 +27,22 @@ These are emerging practices, and we are continually learning. The best practice
2727

2828
We suggest the following best practices for manually collecting human data directly from people.
2929

30-
:::column span="4":::
31-
3230
:::row:::
33-
:::column:::
31+
:::column span="4":::
3432
**Best Practice**
3533
:::column-end:::
36-
:::column:::
34+
:::column span="4":::
3735
**Why?**
3836
:::column-end:::
3937
:::row-end:::
4038
-----
4139

4240
:::row:::
43-
:::column:::
41+
:::column span="4":::
4442
**Obtain voluntary informed consent.**
4543
:::column-end:::
4644

47-
:::column:::
45+
:::column span="4":::
4846
- Participants should understand and consent to data collection and how their data will be used.
4947
- Data should only be stored, processed, and used for purposes that are part of the original documented informed consent.
5048
- Consent documentation should be properly stored and associated with the collected data.
@@ -54,11 +52,11 @@ We suggest the following best practices for manually collecting human data direc
5452
-----
5553

5654
:::row:::
57-
:::column:::
55+
:::column span="4":::
5856
**Compensate data contributors appropriately.**
5957
:::column-end:::
6058

61-
:::column:::
59+
:::column span="4":::
6260
- Data contributors should not be pressured or coerced into data collections and should be fairly compensated for their time and data.
6361
- Inappropriate compensation can be exploitative or coercive.
6462
:::column-end:::
@@ -68,11 +66,11 @@ We suggest the following best practices for manually collecting human data direc
6866
-----
6967

7068
:::row:::
71-
:::column:::
69+
:::column span="4":::
7270
**Let contributors self-identify demographic information.**
7371
:::column-end:::
7472

75-
:::column:::
73+
:::column span="4":::
7674
- Demographic information that is not self-reported by data contributors but assigned by data collectors may 1) result in inaccurate metadata and 2) be disrespectful to data contributors.
7775
:::column-end:::
7876

@@ -81,11 +79,11 @@ We suggest the following best practices for manually collecting human data direc
8179
-----
8280

8381
:::row:::
84-
:::column:::
82+
:::column span="4":::
8583
**Anticipate harms when recruiting vulnerable groups.**
8684
:::column-end:::
8785

88-
:::column:::
86+
:::column span="4":::
8987
- Collecting data from vulnerable population groups introduces risk to data contributors and your organization.
9088
:::column-end:::
9189

@@ -94,11 +92,11 @@ We suggest the following best practices for manually collecting human data direc
9492
-----
9593

9694
:::row:::
97-
:::column:::
95+
:::column span="4":::
9896
**Treat data contributors with respect.**
9997
:::column-end:::
10098

101-
:::column:::
99+
:::column span="4":::
102100
- Improper interactions with data contributors at any phase of the data collection can negatively impact data quality, as well as the overall data collection experience for data contributors and data collectors.
103101
:::column-end:::
104102

@@ -107,11 +105,11 @@ We suggest the following best practices for manually collecting human data direc
107105
-----
108106

109107
:::row:::
110-
:::column:::
108+
:::column span="4":::
111109
**Qualify external suppliers carefully.**
112110
:::column-end:::
113111

114-
:::column:::
112+
:::column span="4":::
115113
- Data collections with unqualified suppliers may result in low quality data, poor data management, unprofessional practices, and potentially harmful outcomes for data contributors and data collectors (including violations of human rights).
116114
- Annotation or labeling work (e.g., audio transcription, image tagging) with unqualified suppliers may result in low quality or biased datasets, insecure data management, unprofessional practices, and potentially harmful outcomes for data contributors (including violations of human rights).
117115
:::column-end:::
@@ -121,11 +119,11 @@ We suggest the following best practices for manually collecting human data direc
121119
-----
122120

123121
:::row:::
124-
:::column:::
122+
:::column span="4":::
125123
**Communicate expectations clearly in the Statement of Work (SOW) with suppliers.**
126124
:::column-end:::
127125

128-
:::column:::
126+
:::column span="4":::
129127
- An SOW which lacks requirements for responsible data collection work may result in low-quality or poorly collected data.
130128
:::column-end:::
131129

@@ -134,11 +132,11 @@ We suggest the following best practices for manually collecting human data direc
134132
-----
135133

136134
:::row:::
137-
:::column:::
135+
:::column span="4":::
138136
**Qualify geographies carefully.**
139137
:::column-end:::
140138

141-
:::column:::
139+
:::column span="4":::
142140
- When applicable, collecting data in restricted and/or unfamiliar geographies may result in unusable or low-quality data and may impact the safety of involved parties.
143141
:::column-end:::
144142

@@ -147,11 +145,11 @@ We suggest the following best practices for manually collecting human data direc
147145
-----
148146

149147
:::row:::
150-
:::column:::
148+
:::column span="4":::
151149
**Be a good steward of your datasets.**
152150
:::column-end:::
153151

154-
:::column:::
152+
:::column span="4":::
155153
- Improper data management and poor documentation can result in data misuse.
156154
:::column-end:::
157155

0 commit comments

Comments
 (0)