Predicting polymer performance from chemical structure using machine learning
Accelerating sustainable materials discovery with open data
Polymers are the backbone of innovation in medicine, electronics, and sustainability. Our goal is to predict a polymer's real-world performance directly from its chemical structure.
This project supports the Open Polymer Prediction 2025 initiative by leveraging machine learning on a large-scale, open-source dataset β ten times larger than any existing resource.
We predict five key physical properties from SMILES strings:
- Density
- Thermal conductivity (Tc)
- Glass transition temperature (Tg)
- Radius of gyration (Rg)
- Fractional free volume (FFV)
.
βββ data/ # Input datasets (SMILES, labels)
βββ src/
β βββ preprocessing.py # Molecular parsing and feature extraction
β βββ models/ # ML/DL model definitions
β βββ train.py # Training pipeline
β βββ evaluate.py # Evaluation scripts
βββ notebooks/ # Jupyter notebooks for EDA and prototyping
βββ requirements.txt # Python dependencies
βββ README.md # Project documentation