You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/04_process_discovery.md
+11-10Lines changed: 11 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,6 +8,8 @@ order of events/activities that are executed during a process execution.
8
8
In the following, we made up an overview to visualize the advantages and disadvantages of
9
9
the
10
10
mining algorithms.
11
+
12
+
11
13
|Alpha|Alpha+|Heuristic|Inductive|
12
14
|---|---|---|---|
13
15
|Cannot handle loops of length one and length two|Can handle loops of length one and length two|Takes frequency into account|Can handle invisible tasks|
@@ -165,6 +167,8 @@ if __name__ == "__main__":
165
167
```
166
168
167
169
170
+
171
+
168
172
|Parameter name|Meaning|
169
173
|---|---|
170
174
|dependency_threshold|dependency threshold of the Heuristics Miner (default: 0.5)|
@@ -400,17 +404,12 @@ if __name__ == "__main__":
400
404
Visualizing the DFGs, we can say that the correlation miner was able to discover a visualization
401
405
where the main path is clear.
402
406
Different variants of the correlation miner are available:
403
-
|Variants.CLASSIC|Calculates the P/S matrix and the duration matrix in the classic way (the entire list of
404
-
events is used)|
407
+
408
+
409
+
|Variants.CLASSIC|Calculates the P/S matrix and the duration matrix in the classic way (the entire list of events is used)|
405
410
|---|---|
406
-
|Variants.TRACE_BASED|Calculates the P/S matrix and the duration matrix on a classic event log,
407
-
trace-by-trace, and merges the results. The resolution of the linear problem permits to
408
-
obtain a model that is more understandable than the classic DFG calculated on top of the
409
-
log.|
410
-
|Variants.CLASSIC_SPLIT|Calculates the P/S matrix and the duration matrix on the entire list of events, as in
411
-
the classic version, but splits that in chunks to fasten the computation. Hence, the
412
-
generated model is less accurate (in comparison to the CLASSIC version) but the
413
-
calculation is faster. The default chunk size is 100000 events.|
411
+
|Variants.TRACE_BASED|Calculates the P/S matrix and the duration matrix on a classic event log, trace-by-trace, and merges the results. The resolution of the linear problem permits to obtain a model that is more understandable than the classic DFG calculated on top of the log.|
412
+
|Variants.CLASSIC_SPLIT|Calculates the P/S matrix and the duration matrix on the entire list of events, as in the classic version, but splits that in chunks to fasten the computation. Hence, the generated model is less accurate (in comparison to the CLASSIC version) but the calculation is faster. The default chunk size is 100000 events.|
414
413
415
414
416
415
@@ -443,6 +442,8 @@ if __name__ == "__main__":
443
442
Some parameters can be used in order to customize the execution of the temporal profile:
444
443
See Parameters
445
444
445
+
446
+
446
447
|Parameter Key|Type|Default|Description|
447
448
|---|---|---|---|
448
449
|Parameters.ACTIVITY_KEY|string|concept:name|The attribute to use as activity.|
Copy file name to clipboardExpand all lines: docs/06_conformance_checking.md
+6-14Lines changed: 6 additions & 14 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1035,6 +1035,8 @@ if __name__ == "__main__":
1035
1035
Some parameters can be used in order to customize the conformance checking of the temporal profile:
1036
1036
See Parameters
1037
1037
1038
+
1039
+
1038
1040
|Parameter Key|Type|Default|Description|
1039
1041
|---|---|---|---|
1040
1042
|Parameters.ACTIVITY_KEY|string|concept:name|The attribute to use as activity.|
@@ -1064,22 +1066,12 @@ execution is repeated (that means rework) from different people.
1064
1066
The verification of LTL rules requires the insertion of the required parameters
1065
1067
(of the specific rule). Hence, this form of conformance checking is not automatic.
1066
1068
The LTL rules that are implemented in pm4py are found in the following table:
1069
+
1070
+
1067
1071
|LTL rule|Description|
1068
1072
|---|---|
1069
-
|ltl.ltl_checker.four_eyes_principle(log, A, B)|Applies the four eyes principle on the activities A and B.
1070
-
Parameters:
1071
-
log: event log
1072
-
A: the activity A of the rule (an activity of the log)
1073
-
B: the activity B of the rule (an activity of the log)
1074
-
Returns:
1075
-
Filtered log object (containing the cases which have A and B done by the same person)|
1076
-
|ltl.ltl_checker.attr_value_different_persons(log, A)|Finds the process executions in which the activity A is repeated by
1077
-
different people.
1078
-
Parameters:
1079
-
log: event log
1080
-
A: the activity A of the rule (an activity of the log)
1081
-
Returns:
1082
-
Filtered log object (containing the cases which have A repeated by different people)|
1073
+
|ltl.ltl_checker.four_eyes_principle(log, A, B)|Applies the four eyes principle on the activities A and B. Parameters: log: event log A: the activity A of the rule (an activity of the log) B: the activity B of the rule (an activity of the log) Returns: Filtered log object (containing the cases which have A and B done by the same person)|
1074
+
|ltl.ltl_checker.attr_value_different_persons(log, A)|Finds the process executions in which the activity A is repeated by different people. Parameters: log: event log A: the activity A of the rule (an activity of the log) Returns: Filtered log object (containing the cases which have A repeated by different people)|
Copy file name to clipboardExpand all lines: docs/07_process_trees.md
+5-9Lines changed: 5 additions & 9 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -56,6 +56,8 @@ if __name__ == "__main__":
56
56
57
57
Suppose the following start activity and their respective occurrences.
58
58
59
+
60
+
59
61
|Parameter|Meaning|
60
62
|---|---|
61
63
|MODE|most frequent number of visible activities (default 20)|
@@ -66,19 +68,13 @@ Suppose the following start activity and their respective occurrences.
66
68
|PARALLEL|probability to add a parallel operator to tree (default 0.25)|
67
69
|LOOP|probability to add a loop operator to tree (default 0.25)|
68
70
|OR|probability to add an or operator to tree (default 0)|
69
-
|SILENT|probability to add silent activity to a choice or loop operator
70
-
(default 0.25)|
71
+
|SILENT|probability to add silent activity to a choice or loop operator (default 0.25)|
71
72
|DUPLICATE|probability to duplicate an activity label (default 0)|
72
73
|LT_DEPENDENCY|probability to add a random dependency to the tree (default 0)|
73
74
|INFREQUENT|probability to make a choice have infrequent paths (default 0.25)|
74
75
|NO_MODELS|number of trees to generate from model population (default 10)|
75
-
|UNFOLD|whether or not to unfold loops in order to include choices
76
-
underneath in dependencies: 0=False, 1=True
77
-
if lt_dependency <= 0: this should always be 0 (False)
78
-
if lt_dependency > 0: this can be 1 or 0 (True or False) (default
79
-
10)|
80
-
|MAX_REPEAT|maximum number of repetitions of a loop (only used when unfolding is
81
-
True) (default 10)|
76
+
|UNFOLD|whether or not to unfold loops in order to include choices underneath in dependencies: 0=False, 1=True if lt_dependency <= 0: this should always be 0 (False) if lt_dependency > 0: this can be 1 or 0 (True or False) (default 10)|
77
+
|MAX_REPEAT|maximum number of repetitions of a loop (only used when unfolding is True) (default 10)|
Copy file name to clipboardExpand all lines: docs/08_feature_selection.md
+6-10Lines changed: 6 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -114,18 +114,14 @@ log_to_features.apply
114
114
.
115
115
The types of features that can be considered by a manual feature selection are:
116
116
117
-
|str_ev_attr|String attributes at the event level: these are hot-encoded into features that may
118
-
assume value 0 or value 1.|
117
+
118
+
119
+
|str_ev_attr|String attributes at the event level: these are hot-encoded into features that may assume value 0 or value 1.|
119
120
|---|---|
120
-
|str_tr_attr|String attributes at the trace level: these are hot-encoded into features that may
121
-
assume value 0 or value 1.|
122
-
|num_ev_attr|Numeric attributes at the event level: these are encoded by including the last value of
123
-
the attribute among the events of the trace.|
121
+
|str_tr_attr|String attributes at the trace level: these are hot-encoded into features that may assume value 0 or value 1.|
122
+
|num_ev_attr|Numeric attributes at the event level: these are encoded by including the last value of the attribute among the events of the trace.|
124
123
|num_tr_attr|Numeric attributes at trace level: these are encoded by including the numerical value.|
125
-
|str_evsucc_attr|Successions related to the string attributes values at the event level: for example, if
126
-
we have a trace [A,B,C], it might be important to include not only the presence of the
127
-
single values A, B and C as features; but also the presence of the directly-follows
128
-
couples (A,B) and (B,C).|
124
+
|str_evsucc_attr|Successions related to the string attributes values at the event level: for example, if we have a trace [A,B,C], it might be important to include not only the presence of the single values A, B and C as features; but also the presence of the directly-follows couples (A,B) and (B,C).|
Copy file name to clipboardExpand all lines: docs/11_simulation.md
+25-34Lines changed: 25 additions & 34 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -14,19 +14,20 @@ been provided by the user.
14
14
A playout of a Petri net takes as input a Petri net along with an initial marking,
15
15
and returns a list of process executions that are allowed from the process model.
16
16
We offer different types of playouts:
17
-
|Variants.BASIC_PLAYOUT|A basic playout that accepts a Petri net along with an initial marking, and returns a
18
-
specified number of process executions (repetitions may be possible).|
17
+
18
+
19
+
|Variants.BASIC_PLAYOUT|A basic playout that accepts a Petri net along with an initial marking, and returns a specified number of process executions (repetitions may be possible).|
19
20
|---|---|
20
-
|Variants.EXTENSIVE|A playout that accepts a Petri net along with an initial marking, and returns all the
21
-
executions that are possible according to the model, up to a provided
22
-
length of trace (may be computationally expensive).|
21
+
|Variants.EXTENSIVE|A playout that accepts a Petri net along with an initial marking, and returns all the executions that are possible according to the model, up to a provided length of trace (may be computationally expensive).|
23
22
24
23
25
24
26
25
The list of parameters for such variants are:
27
26
28
27
Inspect parameters
29
28
29
+
30
+
30
31
|Variants.BASIC_PLAYOUT|Parameters.ACTIVITY_KEY|The name of the attribute to use as activity in the playout log.|
31
32
|---|---|---|
32
33
||Parameters.TIMESTAMP_KEY|The name of the attribute to use as timestamp in the playout log.|
@@ -145,6 +146,8 @@ if __name__ == "__main__":
145
146
146
147
During the replay operation, some debug messages are written to the screen. The main outputs of
147
148
the simulation process are:
149
+
150
+
148
151
|simulated_log|The traces that have been simulated during the simulation.|
149
152
|---|---|
150
153
|res|The result of the simulation (Python dictionary).|
@@ -157,18 +160,15 @@ res
157
160
158
161
Inspect outputs
159
162
160
-
|places_interval_trees|an interval tree for each place, that hosts an interval for each time when it was
161
-
“full” according to the specified maximum amount of tokens per place.|
163
+
164
+
165
+
|places_interval_trees|an interval tree for each place, that hosts an interval for each time when it was “full” according to the specified maximum amount of tokens per place.|
162
166
|---|---|
163
-
|transitions_interval_trees|an interval tree for each transition, that contains all the time intervals in which
164
-
the transition was enabled but not yet fired (so, the time between a transition was
165
-
fully enabled and the consumption of the tokens from the input places)|
167
+
|transitions_interval_trees|an interval tree for each transition, that contains all the time intervals in which the transition was enabled but not yet fired (so, the time between a transition was fully enabled and the consumption of the tokens from the input places)|
166
168
|cases_ex_time|a list containing the throughput times for all the cases of the log|
167
169
|median_cases_ex_time|the median throughput time of the cases in the simulated log|
168
-
|input_case_arrival_ratio|the case arrival ratio that was provided by the user, or automatically calculated
169
-
from the event log.|
170
-
|total_cases_time|the difference between the last timestamp of the log, and the first timestamp of the
171
-
simulated log.|
170
+
|input_case_arrival_ratio|the case arrival ratio that was provided by the user, or automatically calculated from the event log.|
171
+
|total_cases_time|the difference between the last timestamp of the log, and the first timestamp of the simulated log.|
172
172
173
173
174
174
@@ -253,32 +253,21 @@ petri_semaph_fifo
253
253
254
254
Inspect parameters
255
255
256
-
|Variants.PETRI_SEMAPH_FIFO|Parameters.PARAM_NUM_SIMULATIONS|Number of simulations that are performed (the goal is to have such number of traces
257
-
in the model)|
256
+
257
+
258
+
|Variants.PETRI_SEMAPH_FIFO|Parameters.PARAM_NUM_SIMULATIONS|Number of simulations that are performed (the goal is to have such number of traces in the model)|
258
259
|---|---|---|
259
260
||Parameters.PARAM_CASE_ARRIVAL_RATIO|The case arrival ratio that is specified by the user.|
260
261
||Parameters.PARAM_MAP_RESOURCES_PER_PLACE|A map containing for each place of the Petri net the maximum amount of tokens|
261
-
||Parameters.PARAM_DEFAULT_NUM_RESOURCES_PER_PLACE|If the map of resources per place is not specified, then use the specified maximum
262
-
number of resources per place.|
262
+
||Parameters.PARAM_DEFAULT_NUM_RESOURCES_PER_PLACE|If the map of resources per place is not specified, then use the specified maximum number of resources per place.|
263
263
||Parameters.PARAM_MAX_THREAD_EXECUTION_TIME|Specifies the maximum execution time of the simulation (for example, 60 seconds).|
264
-
||Parameters.PARAM_SMALL_SCALE_FACTOR|Specifies the ratio between the “real” time scale and the simulation time scale. A
265
-
higher ratio means that the simulation goes faster but is in general less accurate.
266
-
A lower ratio means that the simulation goes slower and is in general more accurate
267
-
(in providing detailed diagnostics). The default choice is 864000 seconds (10 days).
268
-
So that means that a second in the simulation is corresponding to 10 days of real
269
-
log.|
270
-
||Parameters.PARAM_ENABLE_DIAGNOSTICS|Enables the printing of the simulation diagnostics through the usage of the
271
-
“logging” class of Python|
264
+
||Parameters.PARAM_SMALL_SCALE_FACTOR|Specifies the ratio between the “real” time scale and the simulation time scale. A higher ratio means that the simulation goes faster but is in general less accurate. A lower ratio means that the simulation goes slower and is in general more accurate (in providing detailed diagnostics). The default choice is 864000 seconds (10 days). So that means that a second in the simulation is corresponding to 10 days of real log.|
265
+
||Parameters.PARAM_ENABLE_DIAGNOSTICS|Enables the printing of the simulation diagnostics through the usage of the “logging” class of Python|
272
266
||Parameters.ACTIVITY_KEY|The attribute of the log that should be used as activity|
273
267
||Parameters.TIMESTAMP_KEY|The attribute of the log that should be used as timestamp|
274
-
||Parameters.TOKEN_REPLAY_VARIANT|The variant of the token-based replay to use: token_replay,
275
-
the classic variant, that cannot handle duplicate transitions;
276
-
backwards, the backwards token-based replay, that is slower but can handle
277
-
invisible transitions.|
278
-
||Parameters.PARAM_FORCE_DISTRIBUTION|If specified, the distribution that is forced for the transitions (normal,
279
-
exponential)|
280
-
||Parameters.PARAM_DIAGN_INTERVAL|The time interval in which diagnostics should be printed (for example, diagnostics
281
-
should be printed every 10 seconds).|
268
+
||Parameters.TOKEN_REPLAY_VARIANT|The variant of the token-based replay to use: token_replay, the classic variant, that cannot handle duplicate transitions; backwards, the backwards token-based replay, that is slower but can handle invisible transitions.|
269
+
||Parameters.PARAM_FORCE_DISTRIBUTION|If specified, the distribution that is forced for the transitions (normal, exponential)|
270
+
||Parameters.PARAM_DIAGN_INTERVAL|The time interval in which diagnostics should be printed (for example, diagnostics should be printed every 10 seconds).|
282
271
283
272
284
273
@@ -320,6 +309,8 @@ The list of parameters are:
320
309
321
310
Inspect parameters
322
311
312
+
313
+
323
314
|MAX_LIMIT_NUM_TRACES|Maximum number of traces that are returned by the algorithm.|
324
315
|---|---|
325
316
|MAX_TRACE_LENGTH|Maximum length of a trace that is output of the algorithm.|
0 commit comments