Skip to content

Commit 9e3b934

Browse files
Merge remote-tracking branch 'upstream/hotfixes' into release
2 parents e06689b + c8d3569 commit 9e3b934

9 files changed

+66
-83
lines changed

docs/01_handling_event_data.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -108,6 +108,8 @@ as a
108108

109109
-file:
110110

111+
112+
111113
|CaseID|Activity|Timestamp|clientID|
112114
|---|---|---|---|
113115
|1|register request|20200422T0455|1337|

docs/04_process_discovery.md

Lines changed: 11 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -8,6 +8,8 @@ order of events/activities that are executed during a process execution.
88
In the following, we made up an overview to visualize the advantages and disadvantages of
99
the
1010
mining algorithms.
11+
12+
1113
|Alpha|Alpha+|Heuristic|Inductive|
1214
|---|---|---|---|
1315
|Cannot handle loops of length one and length two|Can handle loops of length one and length two|Takes frequency into account|Can handle invisible tasks|
@@ -165,6 +167,8 @@ if __name__ == "__main__":
165167
```
166168

167169

170+
171+
168172
|Parameter name|Meaning|
169173
|---|---|
170174
|dependency_threshold|dependency threshold of the Heuristics Miner (default: 0.5)|
@@ -400,17 +404,12 @@ if __name__ == "__main__":
400404
Visualizing the DFGs, we can say that the correlation miner was able to discover a visualization
401405
where the main path is clear.
402406
Different variants of the correlation miner are available:
403-
|Variants.CLASSIC|Calculates the P/S matrix and the duration matrix in the classic way (the entire list of
404-
events is used)|
407+
408+
409+
|Variants.CLASSIC|Calculates the P/S matrix and the duration matrix in the classic way (the entire list of events is used)|
405410
|---|---|
406-
|Variants.TRACE_BASED|Calculates the P/S matrix and the duration matrix on a classic event log,
407-
trace-by-trace, and merges the results. The resolution of the linear problem permits to
408-
obtain a model that is more understandable than the classic DFG calculated on top of the
409-
log.|
410-
|Variants.CLASSIC_SPLIT|Calculates the P/S matrix and the duration matrix on the entire list of events, as in
411-
the classic version, but splits that in chunks to fasten the computation. Hence, the
412-
generated model is less accurate (in comparison to the CLASSIC version) but the
413-
calculation is faster. The default chunk size is 100000 events.|
411+
|Variants.TRACE_BASED|Calculates the P/S matrix and the duration matrix on a classic event log, trace-by-trace, and merges the results. The resolution of the linear problem permits to obtain a model that is more understandable than the classic DFG calculated on top of the log.|
412+
|Variants.CLASSIC_SPLIT|Calculates the P/S matrix and the duration matrix on the entire list of events, as in the classic version, but splits that in chunks to fasten the computation. Hence, the generated model is less accurate (in comparison to the CLASSIC version) but the calculation is faster. The default chunk size is 100000 events.|
414413

415414

416415

@@ -443,6 +442,8 @@ if __name__ == "__main__":
443442
Some parameters can be used in order to customize the execution of the temporal profile:
444443
See Parameters
445444

445+
446+
446447
|Parameter Key|Type|Default|Description|
447448
|---|---|---|---|
448449
|Parameters.ACTIVITY_KEY|string|concept:name|The attribute to use as activity.|

docs/06_conformance_checking.md

Lines changed: 6 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -1035,6 +1035,8 @@ if __name__ == "__main__":
10351035
Some parameters can be used in order to customize the conformance checking of the temporal profile:
10361036
See Parameters
10371037

1038+
1039+
10381040
|Parameter Key|Type|Default|Description|
10391041
|---|---|---|---|
10401042
|Parameters.ACTIVITY_KEY|string|concept:name|The attribute to use as activity.|
@@ -1064,22 +1066,12 @@ execution is repeated (that means rework) from different people.
10641066
The verification of LTL rules requires the insertion of the required parameters
10651067
(of the specific rule). Hence, this form of conformance checking is not automatic.
10661068
The LTL rules that are implemented in pm4py are found in the following table:
1069+
1070+
10671071
|LTL rule|Description|
10681072
|---|---|
1069-
|ltl.ltl_checker.four_eyes_principle(log, A, B)|Applies the four eyes principle on the activities A and B.
1070-
Parameters:
1071-
log: event log
1072-
A: the activity A of the rule (an activity of the log)
1073-
B: the activity B of the rule (an activity of the log)
1074-
Returns:
1075-
Filtered log object (containing the cases which have A and B done by the same person)|
1076-
|ltl.ltl_checker.attr_value_different_persons(log, A)|Finds the process executions in which the activity A is repeated by
1077-
different people.
1078-
Parameters:
1079-
log: event log
1080-
A: the activity A of the rule (an activity of the log)
1081-
Returns:
1082-
Filtered log object (containing the cases which have A repeated by different people)|
1073+
|ltl.ltl_checker.four_eyes_principle(log, A, B)|Applies the four eyes principle on the activities A and B. Parameters: log: event log A: the activity A of the rule (an activity of the log) B: the activity B of the rule (an activity of the log) Returns: Filtered log object (containing the cases which have A and B done by the same person)|
1074+
|ltl.ltl_checker.attr_value_different_persons(log, A)|Finds the process executions in which the activity A is repeated by different people. Parameters: log: event log A: the activity A of the rule (an activity of the log) Returns: Filtered log object (containing the cases which have A repeated by different people)|
10831075

10841076

10851077

docs/07_process_trees.md

Lines changed: 5 additions & 9 deletions
Original file line numberDiff line numberDiff line change
@@ -56,6 +56,8 @@ if __name__ == "__main__":
5656

5757
Suppose the following start activity and their respective occurrences.
5858

59+
60+
5961
|Parameter|Meaning|
6062
|---|---|
6163
|MODE|most frequent number of visible activities (default 20)|
@@ -66,19 +68,13 @@ Suppose the following start activity and their respective occurrences.
6668
|PARALLEL|probability to add a parallel operator to tree (default 0.25)|
6769
|LOOP|probability to add a loop operator to tree (default 0.25)|
6870
|OR|probability to add an or operator to tree (default 0)|
69-
|SILENT|probability to add silent activity to a choice or loop operator
70-
(default 0.25)|
71+
|SILENT|probability to add silent activity to a choice or loop operator (default 0.25)|
7172
|DUPLICATE|probability to duplicate an activity label (default 0)|
7273
|LT_DEPENDENCY|probability to add a random dependency to the tree (default 0)|
7374
|INFREQUENT|probability to make a choice have infrequent paths (default 0.25)|
7475
|NO_MODELS|number of trees to generate from model population (default 10)|
75-
|UNFOLD|whether or not to unfold loops in order to include choices
76-
underneath in dependencies: 0=False, 1=True
77-
if lt_dependency <= 0: this should always be 0 (False)
78-
if lt_dependency > 0: this can be 1 or 0 (True or False) (default
79-
10)|
80-
|MAX_REPEAT|maximum number of repetitions of a loop (only used when unfolding is
81-
True) (default 10)|
76+
|UNFOLD|whether or not to unfold loops in order to include choices underneath in dependencies: 0=False, 1=True if lt_dependency <= 0: this should always be 0 (False) if lt_dependency > 0: this can be 1 or 0 (True or False) (default 10)|
77+
|MAX_REPEAT|maximum number of repetitions of a loop (only used when unfolding is True) (default 10)|
8278

8379

8480

docs/08_feature_selection.md

Lines changed: 6 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -114,18 +114,14 @@ log_to_features.apply
114114
.
115115
The types of features that can be considered by a manual feature selection are:
116116

117-
|str_ev_attr|String attributes at the event level: these are hot-encoded into features that may
118-
assume value 0 or value 1.|
117+
118+
119+
|str_ev_attr|String attributes at the event level: these are hot-encoded into features that may assume value 0 or value 1.|
119120
|---|---|
120-
|str_tr_attr|String attributes at the trace level: these are hot-encoded into features that may
121-
assume value 0 or value 1.|
122-
|num_ev_attr|Numeric attributes at the event level: these are encoded by including the last value of
123-
the attribute among the events of the trace.|
121+
|str_tr_attr|String attributes at the trace level: these are hot-encoded into features that may assume value 0 or value 1.|
122+
|num_ev_attr|Numeric attributes at the event level: these are encoded by including the last value of the attribute among the events of the trace.|
124123
|num_tr_attr|Numeric attributes at trace level: these are encoded by including the numerical value.|
125-
|str_evsucc_attr|Successions related to the string attributes values at the event level: for example, if
126-
we have a trace [A,B,C], it might be important to include not only the presence of the
127-
single values A, B and C as features; but also the presence of the directly-follows
128-
couples (A,B) and (B,C).|
124+
|str_evsucc_attr|Successions related to the string attributes values at the event level: for example, if we have a trace [A,B,C], it might be important to include not only the presence of the single values A, B and C as features; but also the presence of the directly-follows couples (A,B) and (B,C).|
129125

130126

131127

docs/09_statistics.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -124,8 +124,9 @@ In the following, we aim to insert the following attributes to events inside a l
124124

125125
Attributes
126126

127-
|@@approx_bh_partial_cycle_time|Incremental cycle time associated to the event (the cycle time of the last event is
128-
the cycle time of the instance)|
127+
128+
129+
|@@approx_bh_partial_cycle_time|Incremental cycle time associated to the event (the cycle time of the last event is the cycle time of the instance)|
129130
|---|---|
130131
|@@approx_bh_partial_lead_time|Incremental lead time associated to the event|
131132
|@@approx_bh_overall_wasted_time|Difference between the partial lead time and the partial cycle time values|

docs/10_log-model_evaluation.md

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -430,12 +430,12 @@ The list of parameters are:
430430

431431
Inspect parameters
432432

433-
|PRINT_DIAGNOSTICS|Enables the printing of the diagnostics on the Petri net, when WOFLAN is
434-
executed.|
433+
434+
435+
|PRINT_DIAGNOSTICS|Enables the printing of the diagnostics on the Petri net, when WOFLAN is executed.|
435436
|---|---|
436437
|RETURN_DIAGNOSTICS|Returns a dictionary containing the diagnostics.|
437-
|RETURN_ASAP_WHEN_NOT_SOUND|Stops the execution of WOFLAN when a condition determining that the Petri net
438-
is not a sound workflow net is found.|
438+
|RETURN_ASAP_WHEN_NOT_SOUND|Stops the execution of WOFLAN when a condition determining that the Petri net is not a sound workflow net is found.|
439439

440440

441441

@@ -503,6 +503,8 @@ the corresponding step):
503503

504504
Inspect outputs
505505

506+
507+
506508
|S_C_NET||
507509
|---|---|
508510
|PLACE_INVARIANTS||

docs/11_simulation.md

Lines changed: 25 additions & 34 deletions
Original file line numberDiff line numberDiff line change
@@ -14,19 +14,20 @@ been provided by the user.
1414
A playout of a Petri net takes as input a Petri net along with an initial marking,
1515
and returns a list of process executions that are allowed from the process model.
1616
We offer different types of playouts:
17-
|Variants.BASIC_PLAYOUT|A basic playout that accepts a Petri net along with an initial marking, and returns a
18-
specified number of process executions (repetitions may be possible).|
17+
18+
19+
|Variants.BASIC_PLAYOUT|A basic playout that accepts a Petri net along with an initial marking, and returns a specified number of process executions (repetitions may be possible).|
1920
|---|---|
20-
|Variants.EXTENSIVE|A playout that accepts a Petri net along with an initial marking, and returns all the
21-
executions that are possible according to the model, up to a provided
22-
length of trace (may be computationally expensive).|
21+
|Variants.EXTENSIVE|A playout that accepts a Petri net along with an initial marking, and returns all the executions that are possible according to the model, up to a provided length of trace (may be computationally expensive).|
2322

2423

2524

2625
The list of parameters for such variants are:
2726

2827
Inspect parameters
2928

29+
30+
3031
|Variants.BASIC_PLAYOUT|Parameters.ACTIVITY_KEY|The name of the attribute to use as activity in the playout log.|
3132
|---|---|---|
3233
||Parameters.TIMESTAMP_KEY|The name of the attribute to use as timestamp in the playout log.|
@@ -145,6 +146,8 @@ if __name__ == "__main__":
145146

146147
During the replay operation, some debug messages are written to the screen. The main outputs of
147148
the simulation process are:
149+
150+
148151
|simulated_log|The traces that have been simulated during the simulation.|
149152
|---|---|
150153
|res|The result of the simulation (Python dictionary).|
@@ -157,18 +160,15 @@ res
157160

158161
Inspect outputs
159162

160-
|places_interval_trees|an interval tree for each place, that hosts an interval for each time when it was
161-
“full” according to the specified maximum amount of tokens per place.|
163+
164+
165+
|places_interval_trees|an interval tree for each place, that hosts an interval for each time when it was “full” according to the specified maximum amount of tokens per place.|
162166
|---|---|
163-
|transitions_interval_trees|an interval tree for each transition, that contains all the time intervals in which
164-
the transition was enabled but not yet fired (so, the time between a transition was
165-
fully enabled and the consumption of the tokens from the input places)|
167+
|transitions_interval_trees|an interval tree for each transition, that contains all the time intervals in which the transition was enabled but not yet fired (so, the time between a transition was fully enabled and the consumption of the tokens from the input places)|
166168
|cases_ex_time|a list containing the throughput times for all the cases of the log|
167169
|median_cases_ex_time|the median throughput time of the cases in the simulated log|
168-
|input_case_arrival_ratio|the case arrival ratio that was provided by the user, or automatically calculated
169-
from the event log.|
170-
|total_cases_time|the difference between the last timestamp of the log, and the first timestamp of the
171-
simulated log.|
170+
|input_case_arrival_ratio|the case arrival ratio that was provided by the user, or automatically calculated from the event log.|
171+
|total_cases_time|the difference between the last timestamp of the log, and the first timestamp of the simulated log.|
172172

173173

174174

@@ -253,32 +253,21 @@ petri_semaph_fifo
253253

254254
Inspect parameters
255255

256-
|Variants.PETRI_SEMAPH_FIFO|Parameters.PARAM_NUM_SIMULATIONS|Number of simulations that are performed (the goal is to have such number of traces
257-
in the model)|
256+
257+
258+
|Variants.PETRI_SEMAPH_FIFO|Parameters.PARAM_NUM_SIMULATIONS|Number of simulations that are performed (the goal is to have such number of traces in the model)|
258259
|---|---|---|
259260
||Parameters.PARAM_CASE_ARRIVAL_RATIO|The case arrival ratio that is specified by the user.|
260261
||Parameters.PARAM_MAP_RESOURCES_PER_PLACE|A map containing for each place of the Petri net the maximum amount of tokens|
261-
||Parameters.PARAM_DEFAULT_NUM_RESOURCES_PER_PLACE|If the map of resources per place is not specified, then use the specified maximum
262-
number of resources per place.|
262+
||Parameters.PARAM_DEFAULT_NUM_RESOURCES_PER_PLACE|If the map of resources per place is not specified, then use the specified maximum number of resources per place.|
263263
||Parameters.PARAM_MAX_THREAD_EXECUTION_TIME|Specifies the maximum execution time of the simulation (for example, 60 seconds).|
264-
||Parameters.PARAM_SMALL_SCALE_FACTOR|Specifies the ratio between the “real” time scale and the simulation time scale. A
265-
higher ratio means that the simulation goes faster but is in general less accurate.
266-
A lower ratio means that the simulation goes slower and is in general more accurate
267-
(in providing detailed diagnostics). The default choice is 864000 seconds (10 days).
268-
So that means that a second in the simulation is corresponding to 10 days of real
269-
log.|
270-
||Parameters.PARAM_ENABLE_DIAGNOSTICS|Enables the printing of the simulation diagnostics through the usage of the
271-
“logging” class of Python|
264+
||Parameters.PARAM_SMALL_SCALE_FACTOR|Specifies the ratio between the “real” time scale and the simulation time scale. A higher ratio means that the simulation goes faster but is in general less accurate. A lower ratio means that the simulation goes slower and is in general more accurate (in providing detailed diagnostics). The default choice is 864000 seconds (10 days). So that means that a second in the simulation is corresponding to 10 days of real log.|
265+
||Parameters.PARAM_ENABLE_DIAGNOSTICS|Enables the printing of the simulation diagnostics through the usage of the “logging” class of Python|
272266
||Parameters.ACTIVITY_KEY|The attribute of the log that should be used as activity|
273267
||Parameters.TIMESTAMP_KEY|The attribute of the log that should be used as timestamp|
274-
||Parameters.TOKEN_REPLAY_VARIANT|The variant of the token-based replay to use: token_replay,
275-
the classic variant, that cannot handle duplicate transitions;
276-
backwards, the backwards token-based replay, that is slower but can handle
277-
invisible transitions.|
278-
||Parameters.PARAM_FORCE_DISTRIBUTION|If specified, the distribution that is forced for the transitions (normal,
279-
exponential)|
280-
||Parameters.PARAM_DIAGN_INTERVAL|The time interval in which diagnostics should be printed (for example, diagnostics
281-
should be printed every 10 seconds).|
268+
||Parameters.TOKEN_REPLAY_VARIANT|The variant of the token-based replay to use: token_replay, the classic variant, that cannot handle duplicate transitions; backwards, the backwards token-based replay, that is slower but can handle invisible transitions.|
269+
||Parameters.PARAM_FORCE_DISTRIBUTION|If specified, the distribution that is forced for the transitions (normal, exponential)|
270+
||Parameters.PARAM_DIAGN_INTERVAL|The time interval in which diagnostics should be printed (for example, diagnostics should be printed every 10 seconds).|
282271

283272

284273

@@ -320,6 +309,8 @@ The list of parameters are:
320309

321310
Inspect parameters
322311

312+
313+
323314
|MAX_LIMIT_NUM_TRACES|Maximum number of traces that are returned by the algorithm.|
324315
|---|---|
325316
|MAX_TRACE_LENGTH|Maximum length of a trace that is output of the algorithm.|

docs/15_streaming_process_mining.md

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -472,6 +472,8 @@ if __name__ == "__main__":
472472
```
473473

474474

475+
476+
475477
|Parameter Key|Type|Default|Description|
476478
|---|---|---|---|
477479
|Parameters.CASE_ID_KEY|string|case:concept:name|The attribute to use as case ID.|

0 commit comments

Comments
 (0)