-
Notifications
You must be signed in to change notification settings - Fork 269
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
【Hackathon4 No.35】为 Paddle 优化 prelu op 在 GPU 上的计算性能 #370
Conversation
PR描述写错了吧,是prelu |
是的 写错了,谢谢提醒。 |
目前的实现有一定的性能优化空间,可以加入一些性能优化的技巧。当前forward性能如下表: | ||
|
||
| Case No. | device | input_shape | input_type | weight_type | Paddle Perf(ms) | | ||
|---|---|---|---|---|---| |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
请补充下FP16的性能测试数据,优化时要考虑FP16的性能情况。
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
@JamesLim-sy 已补充
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
LGTM
为 Paddle 优化 prelu op 在 GPU 上的计算性能
任务:PaddlePaddle/Paddle#50657 (comment)