This package can give predictions on stability of new perovskite compoounds using machine learning approach.
- New compounds info should be put in the
newCompound.xlsxfile, with the specified format as described in the excel file - The training data should be put in the
perovskite_DFT_EaH_FormE.xlsx. This file already contains over 1900 compounds. Users can add more compounds to this training set. - Run python script
energy_prediction.pyusing command:python energy_prediction.py. Requirements:
-
python version >= 3.5
-
sklearn, pandas, numpy.
- During the running of the python script,
temporary files *.csvwill be generated, which contain the generated features of the compounds in the training set and the new compounds to be predicted. The temporary files are generated byfeature_vector.py, called by theenergy_prediction.py. - Results are put in the
prediction_result.xlsxfile. In the stability column, 1 stands for stable, 0 stands for unstable.
The energy_prediction embeds the selected best classification model (extra trees) and the best regression model (kernel ridge regression).
The elemental property database is provided as continousproperty.xlsx, discrictproperty.xlsx and shannon_perovskite.xlsx.
Feature inportance list is stored by *.txt files.