Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Option to store Choleskies in shared memory #236

Open
shufay opened this issue Jun 20, 2023 · 1 comment
Open

Option to store Choleskies in shared memory #236

shufay opened this issue Jun 20, 2023 · 1 comment
Assignees
Labels
enhancement New feature or request

Comments

@shufay
Copy link
Collaborator

shufay commented Jun 20, 2023

With MPI and sufficiently large basis set sizes, storing copies of Choleskies in each task can lead to out-of-memory errors. Would it be possible to implement the option of storing 1 copy of Choleskies in shared memory accessible to all tasks?

@shufay shufay added the enhancement New feature or request label Jun 20, 2023
@fdmalone
Copy link
Collaborator

Could you provide some more information? Where do you see the problem? What size are the choleskies? how much memory per node do you have? How many walkers are you using? An example script would be very helpful here, or at least a minimal problem which reproduces it.

Currently we DO store the choleskies in shared memory (one copy of the full thing + the half rotated a/b tensors per node) using MPI3 or at least we did, but with all the infrastructure changes it's possible something was modified, or maybe you're hitting some other limit / issue.

On a separate note we should print out some more detailed (dynamic) memory consumption information.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

3 participants