-
Notifications
You must be signed in to change notification settings - Fork 139
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DLA implementation and speed #1973
Comments
@ye-luo can comment on the speed vs LA or TM. To use Summit with good efficiency will require the new batched drivers (e.g. #1947, done), various dependencies such as population control (e.g. #1819 ), plus updates for observables. We are targeting the new INCITE year for this and will check with you about exactly what you need first. |
DLA should not significantly impact performance. However if it affects the random walking a lot and acceptance ratio is increased, the whole code can be slower. |
I think we can close this provided the results are correct and we are not doing any extra work (redundant wavefunction evaluations, ratios etc.). OK to close @zenandrea ? |
@prckent |
@zenandrea |
@zenandrea We do print a line if DLA is enabled.
What is your DLA input line? |
@jtkrogel Fantastic, many thanks @ye-luo Ah, now I see it. It is in the section of the ECP potential. Just to know, does that mean that if I set DLA="yes" and T-moves (i.e., nonlocalmoves=v1 or v3) it does the tmove with the determinant only (which is what I called DLTM in the manuscript)? |
@zenandrea In this case, "Using Locality Approximation" is a bit misleading. I need to think about how deliver more precise messages.
Yes. DLA and T-move can work together. T-move uses T probability computed during NLPP calculation. |
@ye-luo @prckent
I am testing the DLA implementation (from the develop version of Oct 1st, as v.3.8.0 does not implement it), and I am observing that DLA is around 20% slower than LA or TM.
This is quite unexpected, as DLA should perform a smaller number of operations than LA and TM, so it should be systematically faster.
Am I missing something?
By the way, I have seen that DLA works on the GPU implementation, so I could in principle use it in Summit.
However, I have noticed in
https://cdash.qmcpack.org/CDash/index.php?project=QMCPACK
that there are some failures in the tests of summit.olcf.ornl.gov/gcc6.4-Real-SoA-Release and on some other cluster (e.g. oxygen.ornl.gov) with CUDA-Release.
Thus, I wonder if it is save to use on summit the development version for DMC calculations with spline and DLA?
The text was updated successfully, but these errors were encountered: