Flatten kernel refactor #8

…addlePaddle#35863)

* Optimization of pool2d grad, first commit. * remove useless print codes * refine codes * refine codes * seal more operation into template specialization * fix template struct error in MaxPool2dGrad. * Fix header including error * refine code with comment * Seal the param-preparation codes into function for common use. * Seal the param-preparation codes into function for common use. * Seal the param-preparation into funciton and make it common for other kernels * polish code and erase useless template speicalization * Rerun triger * rerun trigger

…addlePaddle#35510) * Create stateful OneDNNAXPYHandler object. This makes it possible to call it multiple times without recreating the oneDNN primitives every time. * Prepare SGDOpKernel to reuse its implementation from OneDNN kernel. * OneDNN SGD kernel. * Update call to use new OneDNNAXPYHandler object api. * Setup seed in proper place. * Enable OneDNN kernel only for single case. * For dense param and sparse grad. * Small refactor. * Enable oneDNN by op attr or by cmd line flag. * Use int64_t type for number of elements. * Support dense param and grad from OneDNN kernel. * Enable SGD OneDNN kernel when use MP BF16 optimizer. * Force non-copyable/movable OneDNNAXPYHandler. * Reuse OneDNNAXPYHandler for spare tensors in SUM op. * Fix SFINAE rules. * Remove recording event inside AXPY. * Get rid of internal primitive caching. * Stop use PP cache mechanims to store mem and primitive obj. * Handler obj store and reuse needed desc & prim * Do not derive from MKLDNNHandlerT

…ddlePaddle#35862)

…e/Paddle into kernel_refactor_demo

Commits on Sep 21, 2021

support fp16 (PaddlePaddle#35888 )

GuoxiaWang committed Sep 21, 2021

Configuration menu

View commit details

Copy full SHA for 087c23a

Browse repository at this point

Copy the full SHA

087c23a View commit details

Browse the repository at this point in the history

Commits on Sep 28, 2021

refactor flatten kernel

YuanRisheng committed Sep 28, 2021

Configuration menu

View commit details

Copy full SHA for 4b498c4

Browse repository at this point

Copy the full SHA

4b498c4 View commit details

Browse the repository at this point in the history

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Flatten kernel refactor #8

Flatten kernel refactor #8

Commits on Sep 18, 2021

Commits on Sep 19, 2021

Commits on Sep 20, 2021

Commits on Sep 21, 2021

Commits on Sep 22, 2021

Commits on Sep 28, 2021