AttributeError: 'NoneType' object has no attribute 'model_checkpoint_path' #679
Unanswered
wangxinxiny
asked this question in
Q&A
Replies: 1 comment 6 replies
-
|
Beta Was this translation helpful? Give feedback.
6 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
When i run the dpgen,i meet a error:/home/customer/.local/lib/python3.9/site-packages/gromacs/init.py:286: GromacsImportWarning: Some Gromacs commands were NOT found; maybe source GMXRC first? The following are missing:
['release']
warnings.warn("Some Gromacs commands were NOT found; "
INFO:dpgen:-------------------------iter.000000 task 01--------------------------
/home/customer/.local/lib/python3.9/site-packages/dpgen/generator/run.py:584: UserWarning: the dpdispatcher will be updated to new version.And the interface may be changed. Please check the documents for more details
warnings.warn(f"the dpdispatcher will be updated to new version."
INFO:dpgen:cannot find key "batch" in machine file, try to use deprecated key "machine_type"
/home/customer/.local/lib/python3.9/site-packages/paramiko/client.py:837: UserWarning: Unknown ssh-ed25519 host key for 172.16.10.20: b'bd9729f31ac9072764c4a091c0c5b3e0'
warnings.warn(
INFO:dpgen:new submission of fb8ed06d-10d2-4eb5-b2e5-81afdb672131 for chunk 8aefb06c426e07a0a671a1e2488b4858d694a730
INFO:dpgen:new submission of cd139241-38a9-4a22-9fa1-d999333ecbb8 for chunk e193a01ecf8d30ad0affefd332ce934e32ffce72
INFO:dpgen:new submission of a94eca34-18d9-433d-872c-e24e72e387b6 for chunk 6fc978af728d43c59faa400d5f6e0471ac850d4c
INFO:dpgen:new submission of 79a2c2c8-38e3-4d2d-a530-5eb2191dda22 for chunk 221407c03ae5c73109cce71d27e24637824f3333
INFO:dpgen:job fb8ed06d-10d2-4eb5-b2e5-81afdb672131 finished
WARNING:dpgen:Get error code 2 in calling tar czfh fb8ed06d-10d2-4eb5-b2e5-81afdb672131.tar.gz 000/frozen_model.pb 000/lcurve.out 000/train.log 000/model.ckpt.meta 000/model.ckpt.index 000/model.ckpt.data-00000-of-00001 000/checkpoint through ssh with job: fb8ed06d-10d2-4eb5-b2e5-81afdb672131 . message: tar: 000/frozen_model.pb:无法 stat: 没有那个文件或目录
tar: 000/lcurve.out:无法 stat: 没有那个文件或目录
tar: 000/model.ckpt.meta:无法 stat: 没有那个文件或目录
tar: 000/model.ckpt.index:无法 stat: 没有那个文件或目录
tar: 000/model.ckpt.data-00000-of-00001:无法 stat: 没有那个文件或目录
tar: 000/checkpoint:无法 stat: 没有那个文件或目录
tar: 由于前次错误,将以上次的错误状态退出
WARNING:dpgen:Sleep 60 s and retry the command...
WARNING:dpgen:Get error code 2 in calling tar czfh fb8ed06d-10d2-4eb5-b2e5-81afdb672131.tar.gz 000/frozen_model.pb 000/lcurve.out 000/train.log 000/model.ckpt.meta 000/model.ckpt.index 000/model.ckpt.data-00000-of-00001 000/checkpoint through ssh with job: fb8ed06d-10d2-4eb5-b2e5-81afdb672131 . message: tar: 000/frozen_model.pb:无法 stat: 没有那个文件或目录
tar: 000/lcurve.out:无法 stat: 没有那个文件或目录
tar: 000/model.ckpt.meta:无法 stat: 没有那个文件或目录
tar: 000/model.ckpt.index:无法 stat: 没有那个文件或目录
tar: 000/model.ckpt.data-00000-of-00001:无法 stat: 没有那个文件或目录
tar: 000/checkpoint:无法 stat: 没有那个文件或目录
tar: 由于前次错误,将以上次的错误状态退出
WARNING:dpgen:Sleep 60 s and retry the command...
WARNING:dpgen:Get error code 2 in calling tar czfh fb8ed06d-10d2-4eb5-b2e5-81afdb672131.tar.gz 000/frozen_model.pb 000/lcurve.out 000/train.log 000/model.ckpt.meta 000/model.ckpt.index 000/model.ckpt.data-00000-of-00001 000/checkpoint through ssh with job: fb8ed06d-10d2-4eb5-b2e5-81afdb672131 . message: tar: 000/frozen_model.pb:无法 stat: 没有那个文件或目录
tar: 000/lcurve.out:无法 stat: 没有那个文件或目录
tar: 000/model.ckpt.meta:无法 stat: 没有那个文件或目录
tar: 000/model.ckpt.index:无法 stat: 没有那个文件或目录
tar: 000/model.ckpt.data-00000-of-00001:无法 stat: 没有那个文件或目录
tar: 000/checkpoint:无法 stat: 没有那个文件或目录
tar: 由于前次错误,将以上次的错误状态退出
WARNING:dpgen:Sleep 60 s and retry the command...
Traceback (most recent call last):
File "/home/customer/.local/bin/dpgen", line 8, in
sys.exit(main())
File "/home/customer/.local/lib/python3.9/site-packages/dpgen/main.py", line 175, in main
args.func(args)
File "/home/customer/.local/lib/python3.9/site-packages/dpgen/generator/run.py", line 2997, in gen_run
run_iter (args.PARAM, args.MACHINE)
File "/home/customer/.local/lib/python3.9/site-packages/dpgen/generator/run.py", line 2962, in run_iter
run_train (ii, jdata, mdata)
File "/home/customer/.local/lib/python3.9/site-packages/dpgen/generator/run.py", line 587, in run_train
dispatcher.run_jobs(mdata['train_resources'],
File "/home/customer/.local/lib/python3.9/site-packages/dpgen/dispatcher/Dispatcher.py", line 95, in run_jobs
while not self.all_finished(job_handler, mark_failure) :
File "/home/customer/.local/lib/python3.9/site-packages/dpgen/dispatcher/Dispatcher.py", line 230, in all_finished
rjob['context'].download(task_chunks[idx], backward_task_files)
File "/home/customer/.local/lib/python3.9/site-packages/dpgen/dispatcher/SSHContext.py", line 185, in download
self._get_files(file_list)
File "/home/customer/.local/lib/python3.9/site-packages/dpgen/dispatcher/SSHContext.py", line 310, in _get_files
self.block_checkcall('tar czfh %s %s' % (of, " ".join(files)))
File "/home/customer/.local/lib/python3.9/site-packages/dpgen/dispatcher/SSHContext.py", line 201, in block_checkcall
return self.block_checkcall(cmd, retry=retry+1)
File "/home/customer/.local/lib/python3.9/site-packages/dpgen/dispatcher/SSHContext.py", line 201, in block_checkcall
return self.block_checkcall(cmd, retry=retry+1)
File "/home/customer/.local/lib/python3.9/site-packages/dpgen/dispatcher/SSHContext.py", line 201, in block_checkcall
return self.block_checkcall(cmd, retry=retry+1)
File "/home/customer/.local/lib/python3.9/site-packages/dpgen/dispatcher/SSHContext.py", line 202, in block_checkcall
raise RuntimeError("Get error code %d in calling %s through ssh with job: %s . message: %s" %
RuntimeError: Get error code 2 in calling tar czfh fb8ed06d-10d2-4eb5-b2e5-81afdb672131.tar.gz 000/frozen_model.pb 000/lcurve.out 000/train.log 000/model.ckpt.meta 000/model.ckpt.index 000/model.ckpt.data-00000-of-00001 000/checkpoint through ssh with job: fb8ed06d-10d2-4eb5-b2e5-81afdb672131 . message: tar: 000/frozen_model.pb:无法 stat: 没有那个文件或目录
tar: 000/lcurve.out:无法 stat: 没有那个文件或目录
tar: 000/model.ckpt.meta:无法 stat: 没有那个文件或目录
tar: 000/model.ckpt.index:无法 stat: 没有那个文件或目录
tar: 000/model.ckpt.data-00000-of-00001:无法 stat: 没有那个文件或目录
tar: 000/checkpoint:无法 stat: 没有那个文件或目录
tar: 由于前次错误,将以上次的错误状态退出
the train.log:
WARNING:tensorflow:From /home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/tensorflow/python/compat/v2_compat.py:96: disable_resource_variables (from tensorflow.python.ops.variable_scope) is deprecated and will be removed in a future version.
Instructions for updating:
non-resource variables are not supported in the long term
WARNING:root:Environment variable KMP_BLOCKTIME is empty. Use the default value 0
WARNING:root:Environment variable KMP_AFFINITY is empty. Use the default value granularity=fine,verbose,compact,1,0
/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/importlib/init.py:169: UserWarning: The NumPy module was reloaded (imported a second time). This can in some cases result in small but subtle issues and is discouraged.
_bootstrap.exec(spec, module)
/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/deepmd/utils/compat.py:316: UserWarning: It seems that you are using a deepmd-kit input of version 1.x.x, which is deprecated. we have converted the input to >2.0.0 compatible, and output it to file input_v2_compat.json
warnings.warn(msg)
Traceback (most recent call last):
File "/home/customer/Software/anaconda/anaconda3/envs/deepmd/bin/dp", line 10, in
sys.exit(main())
File "/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/deepmd/entrypoints/main.py", line 437, in main
train_dp(**dict_args)
File "/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/deepmd/entrypoints/train.py", line 90, in train
jdata = update_sel(jdata)
File "/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/deepmd/entrypoints/train.py", line 329, in update_sel
descrpt_data = update_one_sel(jdata, descrpt_data)
File "/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/deepmd/entrypoints/train.py", line 305, in update_one_sel
tmp_sel = get_sel(jdata, rcut)
File "/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/deepmd/entrypoints/train.py", line 268, in get_sel
, max_nbor_size = get_nbor_stat(jdata, rcut)
File "/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/deepmd/entrypoints/train.py", line 253, in get_nbor_stat
train_data = get_data(jdata["training"]["training_data"], max_rcut, type_map, None)
File "/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/deepmd/entrypoints/train.py", line 197, in get_data
data = DeepmdDataSystem(
File "/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/deepmd/utils/data_system.py", line 79, in init
DeepmdData(
File "/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/deepmd/utils/data.py", line 60, in init
atom_type = [type_map.index(self.type_map[ii]) for ii in self.atom_type]
File "/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/deepmd/utils/data.py", line 60, in
atom_type = [type_map.index(self.type_map[ii]) for ii in self.atom_type]
ValueError: 'Re' is not in list
WARNING:tensorflow:From /home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/tensorflow/python/compat/v2_compat.py:96: disable_resource_variables (from tensorflow.python.ops.variable_scope) is deprecated and will be removed in a future version.
Instructions for updating:
non-resource variables are not supported in the long term
WARNING:root:Environment variable KMP_BLOCKTIME is empty. Use the default value 0
WARNING:root:Environment variable KMP_AFFINITY is empty. Use the default value granularity=fine,verbose,compact,1,0
/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/importlib/init.py:169: UserWarning: The NumPy module was reloaded (imported a second time). This can in some cases result in small but subtle issues and is discouraged.
_bootstrap._exec(spec, module)
Traceback (most recent call last):
File "/home/customer/Software/anaconda/anaconda3/envs/deepmd/bin/dp", line 10, in
sys.exit(main())
File "/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/deepmd/entrypoints/main.py", line 439, in main
freeze(**dict_args)
File "/home/customer/Software/anaconda/anaconda3/envs/deepmd/lib/python3.9/site-packages/deepmd/entrypoints/freeze.py", line 141, in freeze
input_checkpoint = checkpoint.model_checkpoint_path
AttributeError: 'NoneType' object has no attribute 'model_checkpoint_path'
How can i solve it?
Beta Was this translation helpful? Give feedback.
All reactions