|
1 | 1 | --- |
2 | | -title: 'NiaARM: A minimalistic framework for numerical association rule mining' |
| 2 | +title: 'NiaARM: A minimalistic framework for Numerical Association Rule Mining' |
3 | 3 | tags: |
4 | 4 | - Python |
5 | 5 | - association rule mining |
@@ -32,41 +32,40 @@ Association Rule Mining (ARM) is a data mining method intended for discovering r |
32 | 32 | transaction databases in the form of implications [@agrawal_fast_1994; @fister_jr_brief_2020]. Traditional |
33 | 33 | approaches, such as the Apriori algorithm [@agrawal_fast_1994] or ECLAT [@zaki_scalable_2000], |
34 | 34 | require the attributes in the database to be discretized. This can result in the incorporation of noise into data, |
35 | | -and potentially obtained associations may not fully reveal the story [@varol2020performance]. On the contrary, |
36 | | -Numerical association rule mining (NARM) is an extension of ARM that allows handling numerical attributes without |
| 35 | +and potentially the obtained associations may not reveal the story fully [@varol2020performance]. On the contrary, |
| 36 | +Numerical Association Rule Mining (NARM) is an extension of ARM that allows handling numerical attributes without |
37 | 37 | discretization [@fister_jr_improved_2021; @kaushik2020potential]. Thus, an algorithm can operate directly, not only with |
38 | 38 | categorical but also with numerical attributes concurrently. Interestingly, most NARM algorithms are based on |
39 | 39 | stochastic population-based nature-inspired algorithms, which proved to be very efficient in searching for association |
40 | 40 | rules [@alatas2008modenar; @kaushik2021systematic]. |
41 | 41 |
|
42 | | -The NiaARM framework is an implementation of the ARM-DE algorithm [@fister_differential_2018; @fister_jr_improved_2021], where |
43 | | -numeric association rule mining is modeled as a single objective, continuous optimization problem, where the fitness is a |
44 | | -weighted sum of the support and confidence of the built rule. The approach is extended by allowing the use of any optimization |
45 | | -algorithm from the related NiaPy framework [@vrbancic_niapy_2018], as well as having the option to select various interest |
| 42 | +The NiaARM framework is an extended implementation of the ARM-DE algorithm [@fister_differential_2018; @fister_jr_improved_2021], where |
| 43 | +Numerical Association Rule Mining is modeled as a single objective, continuous optimization problem, where the fitness is a weighted sum of the support and confidence of the built rule. The approach is extended by allowing the use of any optimization |
| 44 | +algorithm from the related NiaPy framework [@vrbancic_niapy_2018] and having the option to select various interest |
46 | 45 | measures and their corresponding weights for the fitness function. |
47 | 46 |
|
48 | 47 | The flow of the NiaARM framework is shown in \autoref{fig:NiaARM}. Users have the option to construct a dataset either from a |
49 | | -CSV file or a pandas DataFrame. The dataset is then used to build the optimization problem, along with user selected interest |
| 48 | +CSV file or a pandas Dataframe. The dataset is then used to build the optimization problem, along with user selected interest |
50 | 49 | measures to be used in the computation of the fitness function. Then the optimization problem can be solved using any algorithm |
51 | | -in the NiaPy library to mine association rules from the dataset. The rules can be exported to a CSV file, statistically |
52 | | -analysed or visualized using the visualization methods implemented in the framework, such as the hill slopes method |
| 50 | +in the NiaPy library to mine association rules from the dataset. The rules can be exported to a CSV file, |
| 51 | +analyzed statistically, or visualized using the visualization methods implemented in the framework, such as the hill slopes method |
53 | 52 | [@fister_visualization_2020]. A simple command-line interface for mining rules is also provided. |
54 | 53 |
|
55 | 54 |  |
56 | 55 |
|
57 | 56 | # Statement of need |
58 | 57 |
|
59 | | -Numerical association rule mining plays a vital role in the data revolution era [@telikani_survey_2020]. Several research |
| 58 | +Numerical Association Rule Mining plays a vital role in the data revolution era [@telikani_survey_2020]. Several research |
60 | 59 | papers that present NARM methods exist, but universal software where all primary tasks of NARM, i.e., preprocessing, searching |
61 | | -for association rules, and visualization, are lacking. The NiaARM framework provides users with methods that allow them to |
| 60 | +for association rules, and visualization, is lacking. The NiaARM framework provides users with methods that allow them to |
62 | 61 | preprocess their data, implement several interest measures, and powerful visualization techniques. In a nutshell, the benefits |
63 | 62 | of the NiaARM framework are: |
64 | 63 |
|
65 | | -1. Simple way to mine association rules on numerical, categorical, or mixed attribute-type datasets. |
| 64 | +1. A simple way to mine association rules on numerical, categorical, or mixed attribute-type datasets. |
66 | 65 |
|
67 | 66 | 2. Combined with the NiaPy library, it allows testing out the proposed approach using arbitrary nature-inspired algorithms. |
68 | 67 |
|
69 | | -3. A vast collection of implemented popular interest measures to measure mined rules' quality. |
| 68 | +3. A vast collection of implemented popular interest measures to measure the mined rules' quality. |
70 | 69 |
|
71 | 70 | 4. Powerful visualization methods. |
72 | 71 |
|
|
0 commit comments