You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+30-14Lines changed: 30 additions & 14 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -72,29 +72,45 @@ For the patterns output files, each pattern contains one more line than the grap
72
72
73
73
## Datasets
74
74
The datasets used in the paper are available in the `data` folder. The following datasets are available:
75
-
*`MUTAG` : MUTAG dataset, available [here](https://ls11-www.cs.tu-dortmund.de/staff/morris/graphkerneldatasets).
76
-
*`NCI1` : NCI1 dataset, available [here](https://ls11-www.cs.tu-dortmund.de/staff/morris/graphkerneldatasets).
77
-
*`PTC` : PTC dataset, available [here](https://ls11-www.cs.tu-dortmund.de/staff/morris/graphkerneldatasets).
78
-
*`DD` : DD dataset, available [here](https://ls11-www.cs.tu-dortmund.de/staff/morris/graphkerneldatasets).
75
+
*`MUTAG` : MUTAG dataset, representing chemical compounds and their mutagenic properties [[D'91](#references)],
76
+
*`NCI1` : NCI1 dataset, representing molecules and classified according to carcinogenicity [[W'06](#references)],
77
+
*`PTC` : PTC dataset, representing molecules and classified according to carcinogenicity [[T'03](#references)],
78
+
*`DD` : DD dataset, representing amino acids and their interactions [[D'03](#references)],
79
+
80
+
Each of these datasets can be found [here](https://www.philippe-fournier-viger.com/spmf/index.php?link=datasets.php).
79
81
*`FOPPA` : dataset extracted from FOPPA, a database of French public procurement notices [[P'22](#references)].
80
82
# Dependencies
81
-
Tested with `SPMF` version 2.54, and `python` version 3.8.0 with the following packages:
82
-
*[`pandas`](https://pypi.org/project/pandas/): version 1.3.5
83
-
*[`numpy`](https://pypi.org/project/numpy/): version 1.22.4
84
-
*[`networkx`](https://pypi.org/project/numpy/): version 2.6.3
85
-
*[`sklearn`](https://pypi.org/project/numpy/): version 0.0
86
-
*[`matplotlib`](https://pypi.org/project/numpy/): version 3.6.0
87
-
83
+
Tested with `SPMF` version 2.54, and `python` version 3.6.13 with the following packages:
84
+
*[`pandas`](https://pypi.org/project/pandas/): version 1.1.5
85
+
*[`numpy`](https://pypi.org/project/numpy/): version 1.19.5
86
+
*[`networkx`](https://pypi.org/project/numpy/): version 2.5.1
87
+
*[`sklearn`](https://pypi.org/project/numpy/): version 0.24.2
88
+
*[`matplotlib`](https://pypi.org/project/numpy/): version 3.3.4
89
+
*[`grakel`](https://pypi.org/project/numpy/): version 0.1.8
90
+
*[`karateclub`](https://pypi.org/project/numpy/): version 1.3.3
91
+
*[`stellargraph`](https://pypi.org/project/numpy/): version 1.2.1
88
92
89
93
90
94
The VF2 and ISMAGS algortihms are included in the [`Networkx` library](https://networkx.org/)
91
95
92
96
For the baselines:
93
-
* The WL and WLOA algorithms are included in the Grakel library, available [here](https://ysig.github.io/GraKeL/0.1a8/benchmarks.html)
94
-
* Graph2Vec is included in the karateclub library, available [here](https://karateclub.readthedocs.io/en/latest/)
95
-
* DGCNN is included in the stellargraph library, available [here](https://stellargraph.readthedocs.io/en/stable/).
97
+
* The WL and WLOA algorithms are included in the Grakel library, documentation available [here](https://ysig.github.io/GraKeL/0.1a8/benchmarks.html)
98
+
* Graph2Vec is included in the karateclub library, documentation available [here](https://karateclub.readthedocs.io/en/latest/)
99
+
* DGCNN is included in the stellargraph library, documentation available [here](https://stellargraph.readthedocs.io/en/stable/).
96
100
* We use the implementation of CORK from Marisa Thoma. This implementation is available in the `CORKcpp.zip` archive.
97
101
98
102
99
103
# References
100
104
***[P'22]** L. Potin, V. Labatut, R. Figueiredo, C. Largeron, P.-H. Morand. *FOPPA: A database of French Open Public Procurement Award notices*, Technical Report, Avignon University, 2022. [⟨hal-03796734⟩](https://hal.archives-ouvertes.fr/hal-03796734)
105
+
***[D'91]** A.S. Debnath, R.L. Lopez, G. Debnath, A. Shusterman, C. Hansch. *Structure-
106
+
activity relationship of mutagenic aromatic and heteroaromatic nitro compounds.
107
+
correlation with molecular orbital energies and hydrophobicity*, Journal of Medic-
108
+
inal Chemistry 34(2), 786–797, 1991.
109
+
***[W'06]** N.Wale, G. Karypis. *Comparison of descriptor spaces for chemical compound
110
+
retrieval and classification*, 6th International Conference on Data Mining, pp.
111
+
678–689, 2006.
112
+
***[T'03]** H . Toivonen, A. Srinivasan, R.D. King, S. Kramer, C. Helma.*Statistical eval-
113
+
uation of the predictive toxicology challenge 2000-2001*, Bioinformatics 19(10),
114
+
1183–1193, 2003.
115
+
***[D'03]** P.D. Dobson, A.J. Doig. *Distinguishing enzyme structures from non-enzymes
116
+
without alignments*, Journal of Molecular Biology 330(4), 771–783 ,2003.
0 commit comments