-
Notifications
You must be signed in to change notification settings - Fork 24
Add LSTMDoubleFit model for low-dimensional perovskite design #205
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
base: develop
Are you sure you want to change the base?
Add LSTMDoubleFit model for low-dimensional perovskite design #205
Conversation
update code from origin repo
move data_utils.py to dataset/utils.py delete useless code
fix bug
* fix: fix chgnet model download link * fix: set nan to 0
* feat: add task readme * fix error * update logo
* fix: update reshape * fix: fix
* feat: add task readme * fix error * update logo * Add files via upload * Update README.md * Add files via upload * Update README.md
* feat: add task readme * fix error * update logo * Add files via upload * Update README.md * Add files via upload * Update README.md * Add files via upload * Update README.md * Add files via upload * Update README.md * Delete docs/paddlematerial_overview_en.png * Delete docs/paddlematerial_overview_ch.png
* feat: add task readme * fix error * update logo * Add files via upload * Update README.md * Add files via upload * Update README.md * Add files via upload * Update README.md * Add files via upload * Update README.md * Delete docs/paddlematerial_overview_en.png * Delete docs/paddlematerial_overview_ch.png * Delete docs/logo_ppmat.png * Delete docs/ppmat_overview_en.png * Add files via upload * Update README.md * Update README.md * Update README.md * fix conflict
* feat: add task readme * fix error * update logo * Add files via upload * Update README.md * Add files via upload * Update README.md * Add files via upload * Update README.md * Add files via upload * Update README.md * Delete docs/paddlematerial_overview_en.png * Delete docs/paddlematerial_overview_ch.png * Delete docs/logo_ppmat.png * Delete docs/ppmat_overview_en.png * Add files via upload * Update README.md * Update README.md * Update README.md * fix conflict * fix words error
* Update README.md * Update README.md
* matbench_dataset * 训练文件 * Delete megnet_matbench_bulk_modulus_t_20250731_041800_s_42 directory * Delete megnet_matbench_shear_modulus_t_20250731_041740_s_42 directory * matbench数据集适配 * 修改PR * jarvis数据集适配 * megnet_readme修改 * 修改requirements,修改jarvis_dataset
* add DiffNMR * fix bugs * fix bugs * fix bugs * fix bugs * fix bugs of diffprior * fix bug * fix bugs
…set name=alex_mp_20 for mattergen training with alex_mp20 dataset. (PaddlePaddle#200) * fix diffnmr model and config. * fix AlexMP20MatterGenDataset name=alex_mp_20 for mattergen training with alex_mp20 dataset.
|
Thanks for your contribution! |
|
Thanks for your contribution! |
leeleolay
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please revise this PR
leeleolay
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
please revise this PR
✅ Description
📘 Overview
This PR contributes the Feature-Guided Inverse Design (LSTMDoubleFit) model for the inverse design of organic A-site cations in low-dimensional perovskites.
The project integrates descriptor calculation, LSTM-based generative learning, and feature-constrained molecular optimization into a unified Paddle-based workflow.
This work reproduces and extends the study:
🧠 Model Workflow
Descriptor Calculation (
Cal.py)Modeldata/.Dataset Preparation
Modeldata/directory into a single dataset:Modeldata.csvwill serve as the unified training dataset.Model Training and Generation (
Best_Seq2seq.py)Feature-Guided DoubleFit Model (
MolecularDoubleFitting.py)Postprocessing
📁 Directory Structure
project/
└── Feature-Guided Inverse Design of LDPs/
├── Best_Seq2seq.py # Main LSTM model: training & molecular generation
├── Cal_ATSC1pe_MATS2c.py # Descriptor calculator (ATSC1pe, MATS2c)
├── Cal_SlogP_VSA2.py # Descriptor calculator (SlogP_VSA2)
├── MolecularDoubleFitting.py # Feature-guided molecular fitting model
├── MSEcalculation.py # Evaluation metrics
├── ModelandDataAnalysis.py # Dataset statistics & analysis
├── Modeldata/ # Folder containing split CSV datasets
├── GreatMolecular.xlsx # High-quality generated molecules
├── NewMolecules.xlsx # Newly generated candidates
├── README.md # Project documentation
└── data_parts/ # (Optional) Split dataset parts (<100 MB each)
⚙️ How to Run
1. Environment