Energy-GNoME

AI-Driven Screening and Prediction for Selected Advanced Energy Materials

This repository contains the database, documentation, Python library (coming soon), and notebooks used to build the Energy-GNoME database.

The purpose of this repository is to enable reproducibility and, more importantly, to support the continuous integration of your data points for model training, as the database is designed as a living database.

For further details, refer to the associated article:

De Angelis P., Barletta G., Trezza G., Asinari P., Chiavazzo E. "Energy-GNoME: A living database of selected materials for energy applications". Energy and AI 22, 100605, 2025. doi: 10.1016/j.egyai.2025.100605.

How to cite

If you find this project valuable, please consider citing the following pre-print work:

De Angelis P., Barletta G., Trezza G., Asinari P., Chiavazzo E. "Energy-GNoME: A living database of selected materials for energy applications". Energy and AI 22, 100605, 2025. doi: 10.1016/j.egyai.2025.100605.

@article{DEANGELIS2025100605,
title = {Energy-GNoME: A living database of selected materials for energy applications},
journal = {Energy and AI},
volume = {22},
pages = {100605},
year = {2025},
issn = {2666-5468},
doi = {https://doi.org/10.1016/j.egyai.2025.100605},
url = {https://www.sciencedirect.com/science/article/pii/S2666546825001375},
author = {Paolo {De Angelis} and Giulio Barletta and Giovanni Trezza and Pietro Asinari and Eliodoro Chiavazzo},
keywords = {Energy materials, Artificial Intelligence, Machine Learning, Deep Learning, Thermoelectric, Battery, Perovskite},
abstract = {Artificial Intelligence (AI) in materials science is driving significant advancements in the discovery of advanced materials for energy applications. The recent GNoME protocol identifies over 380,000 novel stable crystals. From this, we identify over 38,500 materials with potential as energy materials forming the core of the Energy-GNoME database. Our unique combination of Machine Learning (ML) and Deep Learning (DL) tools mitigates cross-domain data bias using feature spaces, thus identifying potential candidates for thermoelectric materials, novel battery cathodes, and novel perovskites. First, classifiers with both structural and compositional features detect domains of applicability, where we expect enhanced reliability of regressors. Here, regressors are trained to predict key materials properties, like thermoelectric figure of merit (zT), band gap (Eg), and cathode voltage (ΔVc). This method significantly narrows the pool of potential candidates, serving as an efficient guide for experimental and computational chemistry investigations and accelerating the discovery of materials suited for electricity generation, energy storage and conversion.}
}

Additional articles to cite:

GNoME Database: Additionally, please consider citing the foundational GNoME database work:

Merchant, A., Batzner, S., Schoenholz, S.S. et al. "Scaling deep learning for materials discovery". Nature 624, 80-85, 2023. doi: 10.1038/s41586-023-06735-9.
E(3)NN Model: And the E(3)NN Graph Neural Network model

Chen Z., Andrejevic N., Smidt T. et al. " Direct Prediction of Phonon Density of States With Euclidean Neural Networks." Advanced Science 8 (12), 2004214, 2021. 10.1002/advs.202004214

Project Status

Detailed TODO list:

Project Organization

├── LICENSE            <- Open-source license if one is chosen
├── Makefile           <- Makefile with convenience commands like `make data` or `make train`
├── README.md          <- The top-level README for developers using this project.
├── data
│   ├── external       <- Data from third party sources.
│   ├── final          <- The screened candidates, along with predictions on their properties of interest.
│   ├── interim        <- Intermediate data that has been transformed.
│   ├── processed      <- The final, canonical data sets for modeling.
│   └── raw            <- The original, immutable data dump.
│
├── docs               <- A default mkdocs project; see www.mkdocs.org for details
│
├── models             <- Trained and serialized models, model predictions, or model summaries
│
├── notebooks          <- Jupyter notebooks demonstrating example usage of the `energy-gnome`.
│
├── pyproject.toml     <- Project configuration file with package metadata for
│                         energy_gnome and configuration for tools like black
│
├── references         <- Data dictionaries, manuals, and all other explanatory materials.
│
├── reports            <- Generated analysis as HTML, PDF, LaTeX, etc.
│   └── figures        <- Generated graphics and figures to be used in reporting
│
├── requirements.txt   <- The requirements file for reproducing the analysis environment, e.g.
│                         generated with `pip freeze > requirements.txt`
│
├── setup.cfg          <- Configuration file for flake8
│
└── energy_gnome       <- Source code for use in this project.
    │
    ├── __init__.py    <- Makes energy_gnome a Python module
    │
    ├── config.py      <- Store useful variables and configuration
    │
    ├── dataset        <- Scripts to handle data and features for modeling
    │
    ├── models         <- Scripts to handle ML models

Contributing

⚠️ Work in Progress

We are actively working on improving testing and refining the API to support the seamless integration of new models and datasets. Our goal is to keep the project aligned with the latest advancements in computational materials science.

How You Can Contribute While the contribution process is still under development, you’re welcome to get involved by:

Reviewing the contribution guidelines and our Code of Conduct.

Forking the repository and creating a feature branch.

Adding your model or dataset (note: the test suite is still under construction).

Submitting a pull request for review.

For Larger Contributions If you're interested in integrating new material descriptors or machine learning models (for regression or classification), we recommend:

Opening an issue to discuss your proposal, or

Contacting us directly at paolo.deangelis@polito.it for guidance and support.

We are happy to assist with integration and discuss potential research collaborations using the protocol or database.

Name		Name	Last commit message	Last commit date
Latest commit History 230 Commits
.github		.github
apps		apps
assets		assets
data		data
devtools/conda-envs		devtools/conda-envs
docs		docs
energy_gnome		energy_gnome
includes		includes
notebooks		notebooks
references		references
reports		reports
.codecov.yml		.codecov.yml
.gitattributes		.gitattributes
.gitignore		.gitignore
.lgtm.yml		.lgtm.yml
.pre-commit-config.yaml		.pre-commit-config.yaml
.zenodo.json		.zenodo.json
CITATION.cff		CITATION.cff
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
Makefile		Makefile
README.md		README.md
environment.yml		environment.yml
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
readthedocs.yml		readthedocs.yml
requirements.txt		requirements.txt
setup.cfg		setup.cfg

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Energy-GNoME

How to cite

Project Status

Project Organization

Contributing

About

Uh oh!

Releases 3

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Energy-GNoME

How to cite

Project Status

Project Organization

Contributing

About

Resources

License

Code of conduct

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 3

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages