Replies: 1 comment 3 replies
-
The data is shuffled within a "set". In your case it means that the data at different temperatures will not be mixed. If you wants to mix data from different temperatures you may use the |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
-
Hi all,
I'm wondering if there exists any way to prepare raw data more flexible?
I have notice that in #460, the kit could shuffles the order of frames automatically.
In my calculation, I collect output data of a quenching process, from 1500K to 500K, at an interval 200K.
Total frame numbers are 12,0000 steps, that is each temperature contain 2,0000.
The pre-process of raw data is used by dpdata package. Here, I use set_size=2,0000 to divide my raw data to 5 training sets and 1 testing set.
I want to know if the "shuffle" done by dpMD-kit could contain all of temperature data in testing set? That is, whether my this simple setting is reasonable?
By the way, the data of cooling process, e.g., 1500-1300K, whether should contain in training? To my best knowledge, it seems not, at least in the literature I have read.
Any feedback is welcome.
Beta Was this translation helpful? Give feedback.
All reactions