-
Notifications
You must be signed in to change notification settings - Fork 3k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[CPU][Kernel] Single socket spmm #3024
Merged
Merged
Changes from 17 commits
Commits
Show all changes
23 commits
Select commit
Hold shift + click to select a range
7c944c0
optimizations of spmm for CPU
sanchit-misra e68fa07
Added names of contributors
sanchit-misra 30e3951
Minor code cleanup
sanchit-misra a797833
Moved the spmm optimization code to a new header file
sanchit-misra 78cece7
Moved to DGL's logging method
sanchit-misra e47c0c0
removed duplicate code between SpMMSumCsr and SpMMCmpCsr
sanchit-misra ad78519
Changes made to follow Google coding style
sanchit-misra d51301f
Fixed lint errors in spmm.h
sanchit-misra bed06d6
Fixed some lint errors from spmm_blocking_libxsmm.h
sanchit-misra 9c941ba
Fixed lint errors from spmm_blocking_libxsmm.h
sanchit-misra e11f8e8
Added comments to SpMMCreateLibxsmmKernel
sanchit-misra 3eaa153
to enable building of tests, and other cosmetic changes
sanchit-misra ffc8520
disabling libxsmm on windows
sanchit-misra 5dec62f
Put a condition to avoid opt impl for FP64 as libxsmm does not have F…
sanchit-misra 7b8781b
Merge branch 'master' into single-socket-spmm
jermainewang 7c0a9c3
cosmetic changes and documentation
sanchit-misra 5f4ce36
Merge branch 'single-socket-spmm' of https://github.com/sanchit-misra…
sanchit-misra ada9e41
cosmetic changes
sanchit-misra 1e32045
to pass lint tests
sanchit-misra e1288f2
replaced multiple allocations for buffers of indices and edges with a…
sanchit-misra a1c3734
Merge branch 'master' into single-socket-spmm
jermainewang dae56cc
Merge branch 'master' into single-socket-spmm
jermainewang 64fbcad
Merge branch 'master' into single-socket-spmm
jermainewang File filter
Filter by extension
Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Oops, something went wrong.
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
How should users set a proper
DGL_CPU_LLC_SIZE
value?There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
This is the size of the LLC of the CPU the code is run on. This macro here is just failsafe. The code automatically gets the LLC size using sysconf() in the function getLLCSize(). Only if that fails, it uses this number. The number I have used as default here is quite safe for most server class CPUs. Only if sysconf fails and this number is also bigger than the user's LLC, then they will have to set the LLC size here.
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
In this case, will the program crash or just run with under-optimal config?
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
It won't crash. Will just run with under-optimal config.