Skip to content
Discussion options

You must be logged in to vote

using 4 GPU effective increase the batch size by 4 times, but it does not always leads to faster decay of the error by 4 times. One may have to test case by case

for dpa-1 and se_a descriptors, smaller batch size (like auto:32) is usually preferred, while for dpa-2 and dpa-3, larger batch size may speedup the training.

Replies: 1 comment 3 replies

Comment options

You must be logged in to vote
3 replies
@roger13231
Comment options

@wanghan-iapcm
Comment options

Answer selected by roger13231
@roger13231
Comment options

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants