You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: src/data/table1.ts
+4-4Lines changed: 4 additions & 4 deletions
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@ import {renderUrlCell} from "../utils/renderUrlCell";
2
2
3
3
exportconstTABLE_1=[
4
4
{
5
-
"id": "Mr1",
5
+
"id": "MR1",
6
6
"Phase Category": "Requirement definition",
7
7
"Subcategory": "Metric selection",
8
8
"Description": "The goal of the model that is going to be trained should be considered when selecting the most appropriate metrics for assessing the model performance.",
@@ -13,7 +13,7 @@ export const TABLE_1 = [
13
13
"External URL(S) in post(s)-link": "Imbalanced learn-Sklearn"
14
14
},
15
15
{
16
-
"id": "Mr2",
16
+
"id": "MR2",
17
17
"Phase Category": "Requirement definition",
18
18
"Subcategory": "Retraining model",
19
19
"Description": "It is important to identify the needs/requirements of model retraining.",
@@ -23,7 +23,7 @@ export const TABLE_1 = [
23
23
"STE Post(s)-link": "STO/56859324",
24
24
},
25
25
{
26
-
"id": "Mr3",
26
+
"id": "MR3",
27
27
"Phase Category": "Requirement definition",
28
28
"Subcategory": "External services",
29
29
"Description": "If an ML model is published as a cloud service, specifically, when a classification/prediction task uses ML cloud-based services, it is important to define the use case and the model requirements in order to identify how frequently the service should be invoked.",
@@ -33,7 +33,7 @@ export const TABLE_1 = [
33
33
"STE Post(s)-link": "STO/56859324",
34
34
},
35
35
{
36
-
"id": "Mr4",
36
+
"id": "MR4",
37
37
"Phase Category": "Requirement definition",
38
38
"Subcategory": "Probabilistic model",
39
39
"Description": "When using probabilistic forecasting in a decision system, it is necessary to decouple the probabilistic model optimization from the probability threshold selection.",
Copy file name to clipboardExpand all lines: src/data/table10.ts
+8-8Lines changed: 8 additions & 8 deletions
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@ import {renderUrlCell} from "../utils/renderUrlCell";
2
2
3
3
exportconstTABLE_10=[
4
4
{
5
-
"id": "Me1",
5
+
"id": "ME1",
6
6
"Phase Category": "Validation",
7
7
"Subcategory": "Aspects to consider when validating a model",
8
8
"Description": "Use adversarial inputs for testing a model in order to ensure robustness.",
@@ -14,7 +14,7 @@ export const TABLE_10 = [
14
14
"References to practice terminology-link": "",
15
15
},
16
16
{
17
-
"id": "Me2",
17
+
"id": "ME2",
18
18
"Phase Category": "Validation",
19
19
"Subcategory": "Hyper parameter tuning",
20
20
"Description": "After the best hyper-parameters are found, the model with those \“optimum\" hyper-parameters should be tested in the set built for testing.",
@@ -26,7 +26,7 @@ export const TABLE_10 = [
26
26
"References to practice terminology-link": "",
27
27
},
28
28
{
29
-
"id": "Me3",
29
+
"id": "ME3",
30
30
"Phase Category": "Validation",
31
31
"Subcategory": "Aspects to consider when validating a model",
32
32
"Description": "If a superset vocabulary was built for a natural language processing {NLP} task, then the model bias should be checked.",
@@ -38,7 +38,7 @@ export const TABLE_10 = [
38
38
"References to practice terminology-link": "",
39
39
},
40
40
{
41
-
"id": "Me4",
41
+
"id": "ME4",
42
42
"Phase Category": "Validation",
43
43
"Subcategory": "Aspects to consider when validating a model",
44
44
"Description": "The comparison between models to identify the best model should be made in the test set.",
@@ -50,7 +50,7 @@ export const TABLE_10 = [
50
50
"References to practice terminology-link": "",
51
51
},
52
52
{
53
-
"id": "Me5",
53
+
"id": "ME5",
54
54
"Phase Category": "Validation",
55
55
"Subcategory": "Unit testing",
56
56
"Description": "Previously annotated data should be used when testing machine learning models with unit tests.",
@@ -62,7 +62,7 @@ export const TABLE_10 = [
62
62
"References to practice terminology-link": "",
63
63
},
64
64
{
65
-
"id": "Me6",
65
+
"id": "ME6",
66
66
"Phase Category": "Validation",
67
67
"Subcategory": "Aspects to consider when validating a model",
68
68
"Description": "The performance and the time required to train a model should be taken into account when comparing models",
@@ -75,7 +75,7 @@ export const TABLE_10 = [
75
75
"References to practice terminology-link": "",
76
76
},
77
77
{
78
-
"id": "Me7",
78
+
"id": "ME7",
79
79
"Phase Category": "Validation",
80
80
"Subcategory": "Aspects to consider when validating a model",
81
81
"Description": "The membership of the testing data to the training data should be tested.",
@@ -87,7 +87,7 @@ export const TABLE_10 = [
87
87
"References to practice terminology-link": "",
88
88
},
89
89
{
90
-
"id": "Me8",
90
+
"id": "ME8",
91
91
"Phase Category": "Validation",
92
92
"Subcategory": "Aspects to consider when validating a model",
93
93
"Description": "When cross-validation is used for testing, then the hold-out set should not be used.",
Copy file name to clipboardExpand all lines: src/data/table12.ts
+2-2Lines changed: 2 additions & 2 deletions
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@ import {renderUrlCell} from "../utils/renderUrlCell";
2
2
3
3
exportconstTABLE_12=[
4
4
{
5
-
"id": "Mm1",
5
+
"id": "MM1",
6
6
"Phase Category": "Monitoring",
7
7
"Subcategory": "Be aware of model performance and new data",
8
8
"Description": "After deployment, the new data that will serve as input for the model should be constantly monitored to detect any deviation from the original data.",
@@ -14,7 +14,7 @@ export const TABLE_12 = [
14
14
"References to practice terminology-link": "Model degradation (Mauri & Damiani), Model degradation (Adam et al.)",
15
15
},
16
16
{
17
-
"id": "Mm2",
17
+
"id": "MM2",
18
18
"Phase Category": "Monitoring",
19
19
"Subcategory": "Be aware of model performance and new data",
20
20
"Description": "When monitoring the data distribution, if it deviates from the original one used for training and tuning models, then the model should be retrained in order to avoid degradation.",
Copy file name to clipboardExpand all lines: src/data/table13.ts
+11-11Lines changed: 11 additions & 11 deletions
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@ import {renderUrlCell} from "../utils/renderUrlCell";
2
2
3
3
exportconstTABLE_13=[
4
4
{
5
-
"id": "I1",
5
+
"id": "CC1",
6
6
"Phase Category": "Implementation",
7
7
"Subcategory": "Reproducibility/replicability",
8
8
"Description": "For the data cleaning procedure stage, routines should be used to avoid implementing everything from scratch.",
@@ -14,7 +14,7 @@ export const TABLE_13 = [
14
14
"References to practice terminology-link": "",
15
15
},
16
16
{
17
-
"id": "I2",
17
+
"id": "CC2",
18
18
"Phase Category": "Implementation",
19
19
"Subcategory": "Reproducibility/replicability",
20
20
"Description": "Across the ML stages, pipelines should be used to automate processes and save time in complex tasks.",
@@ -27,7 +27,7 @@ export const TABLE_13 = [
27
27
"References to practice terminology-link": "",
28
28
},
29
29
{
30
-
"id": "I3",
30
+
"id": "CC3",
31
31
"Phase Category": "Implementation",
32
32
"Subcategory": "Reproducibility/replicability",
33
33
"Description": "In order to enable reproducibility in data pre-processing, pipelines should be used and exported.",
@@ -39,7 +39,7 @@ export const TABLE_13 = [
39
39
"References to practice terminology-link": "",
40
40
},
41
41
{
42
-
"id": "I4",
42
+
"id": "CC4",
43
43
"Phase Category": "Implementation",
44
44
"Subcategory": "Documentation/traceability",
45
45
"Description": "For each trained model, a separate file should be used in order to keep track of all the possible experiments.",
@@ -51,7 +51,7 @@ export const TABLE_13 = [
51
51
"References to practice terminology-link": "",
52
52
},
53
53
{
54
-
"id": "I5",
54
+
"id": "CC5",
55
55
"Phase Category": "Implementation",
56
56
"Subcategory": "Documentation/traceability",
57
57
"Description": "The (hyper-)parameters used in the training process should be documented.",
@@ -63,7 +63,7 @@ export const TABLE_13 = [
63
63
"References to practice terminology-link": "",
64
64
},
65
65
{
66
-
"id": "I6",
66
+
"id": "CC6",
67
67
"Phase Category": "Implementation",
68
68
"Subcategory": "Consistency/Integrity",
69
69
"Description": "Once a model is deployed and a petition of data deletion is made, the data should be deleted across the entire pipeline.",
@@ -76,7 +76,7 @@ export const TABLE_13 = [
76
76
"References to practice terminology-link": "",
77
77
},
78
78
{
79
-
"id": "I7",
79
+
"id": "CC7",
80
80
"Phase Category": "Implementation",
81
81
"Subcategory": "Resources usage",
82
82
"Description": "When dealing with large datasets or large files, aspects such as parallel executions, GPU usage, and input/output efficiency should be taken into account.",
@@ -89,7 +89,7 @@ export const TABLE_13 = [
89
89
"References to practice terminology-link": "",
90
90
},
91
91
{
92
-
"id": "I8",
92
+
"id": "CC8",
93
93
"Phase Category": "Implementation",
94
94
"Subcategory": "Resources usage",
95
95
"Description": "In order to optimize, parallel execution should be used if it supported by the algorithms.",
@@ -102,7 +102,7 @@ export const TABLE_13 = [
102
102
"References to practice terminology-link": "",
103
103
},
104
104
{
105
-
"id": "I9",
105
+
"id": "CC9",
106
106
"Phase Category": "Implementation",
107
107
"Subcategory": "Resources usage",
108
108
"Description": "It should be verified if both, model and dataset, fit in memory.",
@@ -115,7 +115,7 @@ export const TABLE_13 = [
115
115
"References to practice terminology-link": "",
116
116
},
117
117
{
118
-
"id": "I10",
118
+
"id": "CC10",
119
119
"Phase Category": "Implementation",
120
120
"Subcategory": "Resources usage",
121
121
"Description": "When dealing with large corpus, in NLP-related tasks, sparse structures should be used to improve the implementation performance.",
@@ -127,7 +127,7 @@ export const TABLE_13 = [
127
127
"References to practice terminology-link": "",
128
128
},
129
129
{
130
-
"id": "I11",
130
+
"id": "CC11",
131
131
"Phase Category": "Implementation",
132
132
"Subcategory": "Resources usage",
133
133
"Description": "When dealing with large data, resource-aware implementations should be used.",
Copy file name to clipboardExpand all lines: src/data/table2.ts
+7-7Lines changed: 7 additions & 7 deletions
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@ import {renderUrlCell} from "../utils/renderUrlCell";
2
2
3
3
exportconstTABLE_2=[
4
4
{
5
-
"id": "Dc1",
5
+
"id": "DC1",
6
6
"Phase Category": "Data",
7
7
"Subcategory": "Dataset construction",
8
8
"Description": "The distribution of the training data should reflect the real distribution.",
@@ -16,7 +16,7 @@ export const TABLE_2 = [
16
16
17
17
},
18
18
{
19
-
"id": "Dc2",
19
+
"id": "DC2",
20
20
"Phase Category": "Data",
21
21
"Subcategory": "Dataset construction",
22
22
"Description": "If a model is expected to detect \"something\" in an image, then representative examples of that \"something\" should be present in the training and testing data.",
@@ -29,7 +29,7 @@ export const TABLE_2 = [
29
29
30
30
},
31
31
{
32
-
"id": "Dc3",
32
+
"id": "DC3",
33
33
"Phase Category": "Data",
34
34
"Subcategory": "Ensure minimum size and how to measure the size",
35
35
"Description": "When measuring a dataset size it should not be done only by referring to storage space but also in terms of rows and columns.",
@@ -42,7 +42,7 @@ export const TABLE_2 = [
42
42
"References to practice terminology-link": "Compute models complexity- Surana, Time complexity analysis- Lee & Chen, Complexity NN- Bianchini & Scarselli",
43
43
},
44
44
{
45
-
"id": "Dc4",
45
+
"id": "DC4",
46
46
"Phase Category": "Data",
47
47
"Subcategory": "Dataset construction",
48
48
"Description": "If it is required to augment the number of instances in the negative class, preexisting datasets could be used for including more instances in the dataset.",
@@ -55,7 +55,7 @@ export const TABLE_2 = [
55
55
"References to practice terminology-link": "For example: RICO dataset",
56
56
},
57
57
{
58
-
"id": "Dc5",
58
+
"id": "DC5",
59
59
"Phase Category": "Data",
60
60
"Subcategory": "Dataset construction",
61
61
"Description": "The images that are going to be used as instances of the negative class should have some common characteristics with the positive ones.",
@@ -68,7 +68,7 @@ export const TABLE_2 = [
68
68
69
69
},
70
70
{
71
-
"id": "Dc6",
71
+
"id": "DC6",
72
72
"Phase Category": "Data",
73
73
"Subcategory": "Dataset construction",
74
74
"Description": "The minimum size of the object that is going to be detected should be present in the data that is going to be used for training the model.",
@@ -80,7 +80,7 @@ export const TABLE_2 = [
80
80
81
81
},
82
82
{
83
-
"id": "Dc7",
83
+
"id": "DC7",
84
84
"Phase Category": "Data",
85
85
"Subcategory": "Dataset construction",
86
86
"Description": "The object region of interest (ROI) should have a similar aspect ratio in all the positive images.",
0 commit comments