atomic energy after linear regression #4069
Unanswered
chtchelkatchev
asked this question in
Q&A
Replies: 1 comment 1 reply
-
Hi @chtchelkatchev , thank you for your question. Indeed, the documentation for these parameters needs to be improved. However, I have a concern: theoretically, only rank 0 should perform the statistics, but it appears that all ranks are doing this in your case. Could you please provide your execution command, code version (and an input example if possible)? Have you followed the multi-GPU training documentation properly? |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hi,
At the beginning of the finetuning of a deepmd model on two GPUs I get the following info in the logs:
[1] DEEPMD rank:1 INFO RMSE of atomic energy after linear regression is: 0.023360175044598894 eV/atom.
[1] DEEPMD rank:1 INFO Change energy bias of ['B', 'S', 'Se'] from [-11.62286998 -1.07731472 21.40182854] to [-11.62282221 -1.04902143 21.44841567].
[0] DEEPMD rank:0 INFO RMSE of atomic energy after linear regression is: 0.029234642187003657 eV/atom.
[0] DEEPMD rank:0 INFO Change energy bias of ['B', 'S', 'Se'] from [-11.62286998 -1.07731472 21.40182854] to [-11.5774448 -1.28134359 21.26522962].
The problem is that calculated energy shifts are very inaccurate and they depend on the task number. This problem is probably related to insufficient statistics (my dataset is good). I found the parameters "data_stat_nch", "data_protect" and "bias_sample" in the manual. By increasing "bias_nsample" to 1000, I got better statistics and less different energy shifts. Can you please expand on the meaning of these parameters "data_stat_nch", "data_protect" and "bias_sample" in the manual and provide recommendations on how to optimize accuracy in calculating energy shifts?
A big request for the next version of Deepmd is to ensure that, by default, the energy shift is calculated as accurately as possible during tuning (across the entire dataset).
Beta Was this translation helpful? Give feedback.
All reactions