Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Cherry-pick 2.3] Autotune the workspace and kernel choosing of conv #41833

Merged
merged 3 commits into from
Apr 19, 2022

Conversation

Xreki
Copy link
Contributor

@Xreki Xreki commented Apr 15, 2022

PR types

Performance optimization

PR changes

OPs

Describe

Cherry-pick #40338 #41741 #41313

JamesLim-sy and others added 3 commits April 15, 2022 03:09
* Using the maximum workspace_size of all alogirhms to limit the workspace size in exhaustive search mode.

* Use the system cudaMalloc and cudaFree to allocate workspace during searching.

* Enable switch of two kind of workspace setting methods.

Co-authored-by: Liu Yiqun <liuyiqun01@baidu.com>
* change cudnn helper for auto-tune

* Add FLAGS_use_autotune to set the global status of autotune and change the order of choosing algorithm.

* Fix the bug in calculating and printing current step cache hit rate.

* Improve the autotune cache and fix unittest.

* Change the key from AlgorithmType to int64_t.

* Fix unittest for cpu-only env.

* change ChooseAlgoByWorkspace for heuristic mode

Co-authored-by: Liu Yiqun <liuyiqun01@baidu.com>
@paddle-bot-old
Copy link

paddle-bot-old bot commented Apr 15, 2022

✅ This PR's description meets the template requirements!
Please wait for other CI results.

@paddle-bot-old
Copy link

你的PR提交成功,感谢你对开源项目的贡献!
请关注后续CI自动化测试结果,详情请参考Paddle-CI手册
Your PR has been submitted. Thanks for your contribution!
Please wait for the result of CI firstly. See Paddle CI Manual for details.

@PaddlePaddle PaddlePaddle locked and limited conversation to collaborators Apr 19, 2022
@PaddlePaddle PaddlePaddle unlocked this conversation Apr 19, 2022
@Xreki Xreki closed this Apr 19, 2022
@Xreki Xreki reopened this Apr 19, 2022
@lanxianghit lanxianghit merged commit b4adbe5 into PaddlePaddle:release/2.3 Apr 19, 2022
@Xreki Xreki deleted the release/2.3 branch April 19, 2022 08:49
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants