Skip to content

Commit d4da475

Browse files
committed
ReadME+requirements.txt
1 parent c9997d3 commit d4da475

File tree

1 file changed

+30
-14
lines changed

1 file changed

+30
-14
lines changed

README.md

Lines changed: 30 additions & 14 deletions
Original file line numberDiff line numberDiff line change
@@ -72,29 +72,45 @@ For the patterns output files, each pattern contains one more line than the grap
7272

7373
## Datasets
7474
The datasets used in the paper are available in the `data` folder. The following datasets are available:
75-
* `MUTAG` : MUTAG dataset, available [here](https://ls11-www.cs.tu-dortmund.de/staff/morris/graphkerneldatasets).
76-
* `NCI1` : NCI1 dataset, available [here](https://ls11-www.cs.tu-dortmund.de/staff/morris/graphkerneldatasets).
77-
* `PTC` : PTC dataset, available [here](https://ls11-www.cs.tu-dortmund.de/staff/morris/graphkerneldatasets).
78-
* `DD` : DD dataset, available [here](https://ls11-www.cs.tu-dortmund.de/staff/morris/graphkerneldatasets).
75+
* `MUTAG` : MUTAG dataset, representing chemical compounds and their mutagenic properties [[D'91](#references)],
76+
* `NCI1` : NCI1 dataset, representing molecules and classified according to carcinogenicity [[W'06](#references)],
77+
* `PTC` : PTC dataset, representing molecules and classified according to carcinogenicity [[T'03](#references)],
78+
* `DD` : DD dataset, representing amino acids and their interactions [[D'03](#references)],
79+
80+
Each of these datasets can be found [here](https://www.philippe-fournier-viger.com/spmf/index.php?link=datasets.php).
7981
* `FOPPA` : dataset extracted from FOPPA, a database of French public procurement notices [[P'22](#references)].
8082
# Dependencies
81-
Tested with `SPMF` version 2.54, and `python` version 3.8.0 with the following packages:
82-
* [`pandas`](https://pypi.org/project/pandas/): version 1.3.5
83-
* [`numpy`](https://pypi.org/project/numpy/): version 1.22.4
84-
* [`networkx`](https://pypi.org/project/numpy/): version 2.6.3
85-
* [`sklearn`](https://pypi.org/project/numpy/): version 0.0
86-
* [`matplotlib`](https://pypi.org/project/numpy/): version 3.6.0
87-
83+
Tested with `SPMF` version 2.54, and `python` version 3.6.13 with the following packages:
84+
* [`pandas`](https://pypi.org/project/pandas/): version 1.1.5
85+
* [`numpy`](https://pypi.org/project/numpy/): version 1.19.5
86+
* [`networkx`](https://pypi.org/project/numpy/): version 2.5.1
87+
* [`sklearn`](https://pypi.org/project/numpy/): version 0.24.2
88+
* [`matplotlib`](https://pypi.org/project/numpy/): version 3.3.4
89+
* [`grakel`](https://pypi.org/project/numpy/): version 0.1.8
90+
* [`karateclub`](https://pypi.org/project/numpy/): version 1.3.3
91+
* [`stellargraph`](https://pypi.org/project/numpy/): version 1.2.1
8892

8993

9094
The VF2 and ISMAGS algortihms are included in the [`Networkx` library](https://networkx.org/)
9195

9296
For the baselines:
93-
* The WL and WLOA algorithms are included in the Grakel library, available [here](https://ysig.github.io/GraKeL/0.1a8/benchmarks.html)
94-
* Graph2Vec is included in the karateclub library, available [here](https://karateclub.readthedocs.io/en/latest/)
95-
* DGCNN is included in the stellargraph library, available [here](https://stellargraph.readthedocs.io/en/stable/).
97+
* The WL and WLOA algorithms are included in the Grakel library, documentation available [here](https://ysig.github.io/GraKeL/0.1a8/benchmarks.html)
98+
* Graph2Vec is included in the karateclub library, documentation available [here](https://karateclub.readthedocs.io/en/latest/)
99+
* DGCNN is included in the stellargraph library, documentation available [here](https://stellargraph.readthedocs.io/en/stable/).
96100
* We use the implementation of CORK from Marisa Thoma. This implementation is available in the `CORKcpp.zip` archive.
97101

98102

99103
# References
100104
* **[P'22]** L. Potin, V. Labatut, R. Figueiredo, C. Largeron, P.-H. Morand. *FOPPA: A database of French Open Public Procurement Award notices*, Technical Report, Avignon University, 2022. [⟨hal-03796734⟩](https://hal.archives-ouvertes.fr/hal-03796734)
105+
* **[D'91]** A.S. Debnath, R.L. Lopez, G. Debnath, A. Shusterman, C. Hansch. *Structure-
106+
activity relationship of mutagenic aromatic and heteroaromatic nitro compounds.
107+
correlation with molecular orbital energies and hydrophobicity*, Journal of Medic-
108+
inal Chemistry 34(2), 786–797, 1991.
109+
* **[W'06]** N.Wale, G. Karypis. *Comparison of descriptor spaces for chemical compound
110+
retrieval and classification*, 6th International Conference on Data Mining, pp.
111+
678–689, 2006.
112+
* **[T'03]** H . Toivonen, A. Srinivasan, R.D. King, S. Kramer, C. Helma.*Statistical eval-
113+
uation of the predictive toxicology challenge 2000-2001*, Bioinformatics 19(10),
114+
1183–1193, 2003.
115+
* **[D'03]** P.D. Dobson, A.J. Doig. *Distinguishing enzyme structures from non-enzymes
116+
without alignments*, Journal of Molecular Biology 330(4), 771–783 ,2003.

0 commit comments

Comments
 (0)