forked from PaddlePaddle/Paddle
-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Flatten kernel refactor #8
Closed
YuanRisheng
wants to merge
9
commits into
chenwhql:op2func_refactor
from
YuanRisheng:kernel_refactor_demo
Closed
Flatten kernel refactor #8
YuanRisheng
wants to merge
9
commits into
chenwhql:op2func_refactor
from
YuanRisheng:kernel_refactor_demo
Commits on Sep 18, 2021
-
Configuration menu - View commit details
-
Copy full SHA for e761751 - Browse repository at this point
Copy the full SHA e761751View commit details
Commits on Sep 19, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 9f88d32 - Browse repository at this point
Copy the full SHA 9f88d32View commit details -
Optimization of pool2d grad (PaddlePaddle#35389)
* Optimization of pool2d grad, first commit. * remove useless print codes * refine codes * refine codes * seal more operation into template specialization * fix template struct error in MaxPool2dGrad. * Fix header including error * refine code with comment * Seal the param-preparation codes into function for common use. * Seal the param-preparation codes into function for common use. * Seal the param-preparation into funciton and make it common for other kernels * polish code and erase useless template speicalization * Rerun triger * rerun trigger
Configuration menu - View commit details
-
Copy full SHA for 8668519 - Browse repository at this point
Copy the full SHA 8668519View commit details
Commits on Sep 20, 2021
-
Reuse OneDNN handler for SGD and SUM for SelectedRows input tensors. (P…
…addlePaddle#35510) * Create stateful OneDNNAXPYHandler object. This makes it possible to call it multiple times without recreating the oneDNN primitives every time. * Prepare SGDOpKernel to reuse its implementation from OneDNN kernel. * OneDNN SGD kernel. * Update call to use new OneDNNAXPYHandler object api. * Setup seed in proper place. * Enable OneDNN kernel only for single case. * For dense param and sparse grad. * Small refactor. * Enable oneDNN by op attr or by cmd line flag. * Use int64_t type for number of elements. * Support dense param and grad from OneDNN kernel. * Enable SGD OneDNN kernel when use MP BF16 optimizer. * Force non-copyable/movable OneDNNAXPYHandler. * Reuse OneDNNAXPYHandler for spare tensors in SUM op. * Fix SFINAE rules. * Remove recording event inside AXPY. * Get rid of internal primitive caching. * Stop use PP cache mechanims to store mem and primitive obj. * Handler obj store and reuse needed desc & prim * Do not derive from MKLDNNHandlerT
Configuration menu - View commit details
-
Copy full SHA for 799f386 - Browse repository at this point
Copy the full SHA 799f386View commit details
Commits on Sep 21, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 087c23a - Browse repository at this point
Copy the full SHA 087c23aView commit details
Commits on Sep 22, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 7713430 - Browse repository at this point
Copy the full SHA 7713430View commit details -
Configuration menu - View commit details
-
Copy full SHA for 12ab017 - Browse repository at this point
Copy the full SHA 12ab017View commit details -
Merge commit 'refs/pull/34425/head' of https://github.com/PaddlePaddl…
…e/Paddle into kernel_refactor_demo
Configuration menu - View commit details
-
Copy full SHA for 3c0fea7 - Browse repository at this point
Copy the full SHA 3c0fea7View commit details
Commits on Sep 28, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 4b498c4 - Browse repository at this point
Copy the full SHA 4b498c4View commit details
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.