[BUG] out of memory by se_e3 #2723
QuantumMisaka
announced in
Announcement
Replies: 1 comment
-
|
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Bug summary
Out of memory error when running deepmd by using
se_e3descriptors instead ofse_e2_rDeePMD-kit Version
2.2.2
TensorFlow Version
2.9.0
How did you download the software?
conda
Input Files, Running Commands, Error Log, etc.
Input file
{ "model": { "descriptor": { "type": "se_e3", "sel": [80,120,40,40], "rcut_smth": 0.5, "rcut": 6.0, "neuron": [ 25, 50, 100 ], "resnet_dt": false, "seed": 19090, "_activation_function": "tanh" }, "fitting_net": { "neuron": [ 240, 240, 240 ], "resnet_dt": true, "_coord_norm": true, "_type_fitting_net": false, "seed": 2375417769, "_activation_function": "tanh" }, "type_map": [ "Fe", "C", "H", "O" ] }, "learning_rate": { "type": "exp", "start_lr": 0.001, "decay_steps": 80000, "stop_lr": 5e-08, "_decay_rate": 0.95 }, "loss": { "start_pref_e": 0.02, "limit_pref_e": 2, "start_pref_f": 1000, "limit_pref_f": 1, "start_pref_v": 0.01, "limit_pref_v": 1 }, "training": { "training_data": { "systems": "../dpdata_23265/", "batch_size": "auto" }, "numb_steps": 16000000, "_comment": "that's all", "disp_file": "lcurve.out", "disp_freq": 2000, "numb_test": 1, "save_freq": 20000, "save_ckpt": "model.ckpt", "disp_training": true, "time_training": true, "profiling": false, "profiling_file": "timeline.json" } }running by
wrong message:
Steps to Reproduce
dp train in A100-40G machine
Further Information, Files, and Links
same setting can be used in
se_e2_rdescriptors with 5000MiB memoryBeta Was this translation helpful? Give feedback.
All reactions