Skip to content
This repository has been archived by the owner on Nov 17, 2023. It is now read-only.

[PERF] Improve add_bias kernel for small bias length #19744

Merged
merged 1 commit into from
Feb 21, 2021

Conversation

ptrendx
Copy link
Member

@ptrendx ptrendx commented Jan 12, 2021

Description

This PR improves performance of the add_bias kernel for FullyConnected layer when the bias length is small, by computing multiple rows of the result in a single thread block.

Checklist

Essentials

  • PR's title starts with a category (e.g. [BUGFIX], [MODEL], [TUTORIAL], [FEATURE], [DOC], etc)
  • Changes are complete (i.e. I finished coding on this PR)
  • All changes have test coverage
  • Code is well-documented

@mxnet-bot
Copy link

Hey @ptrendx , Thanks for submitting the PR
All tests are already queued to run once. If tests fail, you can trigger one or more tests again with the following commands:

  • To trigger all jobs: @mxnet-bot run ci [all]
  • To trigger specific jobs: @mxnet-bot run ci [job1, job2]

CI supported jobs: [clang, miscellaneous, centos-cpu, edge, centos-gpu, unix-gpu, unix-cpu, windows-gpu, sanity, website, windows-cpu]


Note:
Only following 3 categories can trigger CI :PR Author, MXNet Committer, Jenkins Admin.
All CI tests must pass before the PR can be merged.

@lanking520 lanking520 added pr-awaiting-testing PR is reviewed and waiting CI build and test pr-awaiting-review PR is waiting for code review and removed pr-awaiting-testing PR is reviewed and waiting CI build and test labels Jan 12, 2021
@szha szha merged commit 2a60f0d into apache:master Feb 21, 2021
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
pr-awaiting-review PR is waiting for code review
Projects
None yet
Development

Successfully merging this pull request may close these issues.

4 participants