Inconsistent results from qmca when analysing multiple files #4556

prckent · 2023-04-17T17:45:24Z

Describe the bug
While checking #4549 I noticed that at some recent point we have reordered the scalar.dat output. The columns are correctly labeled. Unfortunately, when files of different versions are mixedm qmca doesn't completely handle the situation and gives inconsistent results . They are nearly OK which suggests the correct data is being used for the most part.

To Reproduce

Analyze different versions of scalar files simultaneously. In the following notice the debug* results are reported consistently but the orig* ones are not. Quantities other than the energy are suspect, e.g. samples, although I noticed this first by the energy.

Archive of the respective files attached. Used qmca from develop.

$ qmca -q e */*/*.scalar.dat
debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_4990_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.290314 +/- 0.029507 
debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_5000_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.336379 +/- 0.093084 
debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_5010_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.176166 +/- 0.030095 
orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_4990_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -198.202521 +/- 0.026945 
orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_5000_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.273603 +/- 0.026549 
orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_5010_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -198.168970 +/- 0.031373 
$ qmca -q e d*/*/*.scalar.dat
debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_4990_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.290314 +/- 0.029507 
debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_5000_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.336379 +/- 0.093084 
debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_5010_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.176166 +/- 0.030095 
$ qmca -q e o*/*/*.scalar.dat
orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_4990_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -198.197432 +/- 0.026929 
orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_5000_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.274287 +/- 0.026617 
orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_5010_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -198.168970 +/- 0.031373 
$ head -1 */*/*.scalar.dat
==> debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_4990_ccecparep_legacy_500_500_1.5_50_True_hf/vmc.s000.scalar.dat <==
#   index    LocalEnergy         LocalEnergy_sq      LocalPotential      Kinetic             LocalECP            NonLocalECP         ElecElec            IonIon              MPC                 KEcorr              BlockWeight         BlockCPU            AcceptRatio         

==> debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_5000_ccecparep_legacy_500_500_1.5_50_True_hf/vmc.s000.scalar.dat <==
#   index    LocalEnergy         LocalEnergy_sq      LocalPotential      Kinetic             LocalECP            NonLocalECP         ElecElec            IonIon              MPC                 KEcorr              BlockWeight         BlockCPU            AcceptRatio         

==> debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_5010_ccecparep_legacy_500_500_1.5_50_True_hf/vmc.s000.scalar.dat <==
#   index    LocalEnergy         LocalEnergy_sq      LocalPotential      Kinetic             LocalECP            NonLocalECP         ElecElec            IonIon              MPC                 KEcorr              BlockWeight         BlockCPU            AcceptRatio         

==> orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_4990_ccecparep_legacy_500_500_1.5_50_True_hf/vmc.s000.scalar.dat <==
#   index    LocalEnergy         LocalEnergy_sq      LocalPotential      Kinetic             ElecElec            IonIon              LocalECP            NonLocalECP         MPC                 KEcorr              BlockWeight         BlockCPU            AcceptRatio         

==> orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_5000_ccecparep_legacy_500_500_1.5_50_True_hf/vmc.s000.scalar.dat <==
#   index    LocalEnergy         LocalEnergy_sq      LocalPotential      Kinetic             ElecElec            IonIon              LocalECP            NonLocalECP         MPC                 KEcorr              BlockWeight         BlockCPU            AcceptRatio         

==> orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_5010_ccecparep_legacy_500_500_1.5_50_True_hf/vmc.s000.scalar.dat <==
#   index    LocalEnergy         LocalEnergy_sq      LocalPotential      Kinetic             ElecElec            IonIon              LocalECP            NonLocalECP         MPC                 KEcorr              BlockWeight         BlockCPU            AcceptRatio

Expected behavior

Either abort in this scenario (easiest) or give consistent results.

System:

Develop, runs & analysis on nitrogen2 with amdclang cpu nightly configuration.

Additional context

scalar.tar.zip

Add any other context about the problem here.

The text was updated successfully, but these errors were encountered:

ye-luo · 2023-04-17T18:23:24Z

FYI. If you lock the equlibration steps, consistent results can be obtained.

yeluo@yeluo-thinkpad-x13:~/Downloads/scalar$ qmca -q e -e 5 */*/*.scalar.dat
debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_4990_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.288722 +/- 0.029702 
debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_5000_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.337095 +/- 0.093857 
debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_5010_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.174814 +/- 0.030334 
orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_4990_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -198.214318 +/- 0.025597 
orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_5000_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.276979 +/- 0.026209 
orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_5010_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -198.174400 +/- 0.031409 
yeluo@yeluo-thinkpad-x13:~/Downloads/scalar$ qmca -q e -e 5 d*/*/*.scalar.dat
debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_4990_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.288722 +/- 0.029702 
debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_5000_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.337095 +/- 0.093857 
debug/vmc_perf_pbe_u_None_3x3x1_1x1x1_5010_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.174814 +/- 0.030334 
yeluo@yeluo-thinkpad-x13:~/Downloads/scalar$ qmca -q e -e 5 o*/*/*.scalar.dat
orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_4990_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -198.214318 +/- 0.025597 
orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_5000_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -197.276979 +/- 0.026209 
orig/vmc_perf_pbe_u_None_3x3x1_1x1x1_5010_ccecparep_legacy_500_500_1.5_50_True_hf/vmc  series 0  LocalEnergy           =  -198.174400 +/- 0.031409

jtkrogel · 2023-04-17T18:33:58Z

So this relates just to the default method of selecting the equilibration length? If so, the observed behavior is not a bug. That algorithm uses a random number generator to avoid bias in the selection process.

ye-luo · 2023-04-17T18:38:47Z

I would agree with @prckent that qmca should not give a different set of results.

equilibration should be independent per scalar.dat.
do we really have benefit from adding a random number?

jtkrogel · 2023-04-17T18:43:37Z

If you choose either edge of the selection window, you will systematically bias the result to be high or low. If you choose the middle, you are biasing the result toward the mean of the last half of the data.

Up to you/choose your bias.

markdewing · 2023-04-17T18:45:03Z

While I appreciate the need to have a random number to avoid bias, the randomness has caught me out several times when looking at run-to-run reproducibility from qmca.

There's always the xkcd solution :)
https://xkcd.com/221/

jtkrogel · 2023-04-17T18:45:54Z

If you don't want to choose, but like the aesthetics of same-input/same-output, we could just make a hash of the input data and use that to seed the rng.

jtkrogel · 2023-04-18T13:54:41Z

Thoughts here?

prckent · 2023-04-18T14:05:35Z

I haven't had any chance since yesterday to examine this but I think at least a simple fix is needed here. First of all, the use of randomness isn't documented or at least I totally missed it and was surprised. This could be fixed with a few extra sentences in the help docs. Second, and I consider this an actual bug, my understanding is that the analysis results today depend on the order that the files are specified in or if they are analyzed separately. One solution would be to use the same seed for each file. Third, if we are going to use randomness at all, there needs to be a way of setting the seed. I favor the XKCD solution suggested by Mark as the default with an option for using a robust initialization from recent numpy. If the randomness is only used to find the equilibration period, I am pretty sure that there are several published/peer-reviewed schemes that do not require any randomness.

jtkrogel · 2023-04-18T14:15:05Z

The equilibration detection algorithm is heuristic and meant for convenience (good faith attempt at statistically valid result) rather than to be relied on in all cases for production. I will implement a modification that sets the seed reproducibly for each file.

In general, a more deeply vetted scheme is desirable. If one is proposed here, I will look into implementing it.

jtkrogel · 2023-04-18T14:50:45Z

See #4557

markdewing · 2023-04-18T15:24:24Z

I'm concerned about creating a situation where the inconsistent results are more subtle. The hash fix reduces the chances of running into inconsistent results, but doesn't eliminate them completely (will give details on the PR).

Is there any way to get the equilibration time as an output from qmca? It shows up in the trace graph, but is there any textual output? Should be something printed by default, at least with "-e auto"?. The cost is more complicated output, but it might be a good reminder that the equilibration time is being computed and removed from the data.

prckent added the bug label Apr 17, 2023

prckent changed the title ~~Inconsistent results from qmca when using different scalar.dat versions~~ Inconsistent results from qmca when analysing multiple files Apr 18, 2023

jtkrogel mentioned this issue Apr 18, 2023

Nexus: use data hashing in heuristic equilibration detection algorithm #4557

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Inconsistent results from qmca when analysing multiple files #4556

Inconsistent results from qmca when analysing multiple files #4556

prckent commented Apr 17, 2023 •

edited

Loading

ye-luo commented Apr 17, 2023

jtkrogel commented Apr 17, 2023

ye-luo commented Apr 17, 2023

jtkrogel commented Apr 17, 2023

markdewing commented Apr 17, 2023

jtkrogel commented Apr 17, 2023 •

edited

Loading

jtkrogel commented Apr 18, 2023

prckent commented Apr 18, 2023 •

edited

Loading

jtkrogel commented Apr 18, 2023 •

edited

Loading

jtkrogel commented Apr 18, 2023

markdewing commented Apr 18, 2023

Inconsistent results from qmca when analysing multiple files #4556

Inconsistent results from qmca when analysing multiple files #4556

Comments

prckent commented Apr 17, 2023 • edited Loading

ye-luo commented Apr 17, 2023

jtkrogel commented Apr 17, 2023

ye-luo commented Apr 17, 2023

jtkrogel commented Apr 17, 2023

markdewing commented Apr 17, 2023

jtkrogel commented Apr 17, 2023 • edited Loading

jtkrogel commented Apr 18, 2023

prckent commented Apr 18, 2023 • edited Loading

jtkrogel commented Apr 18, 2023 • edited Loading

jtkrogel commented Apr 18, 2023

markdewing commented Apr 18, 2023

prckent commented Apr 17, 2023 •

edited

Loading

jtkrogel commented Apr 17, 2023 •

edited

Loading

prckent commented Apr 18, 2023 •

edited

Loading

jtkrogel commented Apr 18, 2023 •

edited

Loading