Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add benchmark and reduce GPU memory for cudnn_conv and speed up cudnn_conv. #217

Closed
wants to merge 4 commits into from

Commits on Oct 17, 2016

  1. Configuration menu
    Copy the full SHA
    a8714a7 View commit details
    Browse the repository at this point in the history
  2. Configuration menu
    Copy the full SHA
    fa0075a View commit details
    Browse the repository at this point in the history
  3. Add unit test for ConvProjection.

    1. unit test in test_LayerGrad.
    2. compare the ConvPorjection and CudnnConvLayer, also compare the concat_layer+img_conv_layer and concat_layer_conv_projection.
    qingqing01 committed Oct 17, 2016
    Configuration menu
    Copy the full SHA
    bf28e21 View commit details
    Browse the repository at this point in the history
  4. Reduce cudnn_conv memory and add benchmark document.

    1. Use TmpMatrix as the workspace in cudnn_conv to reduce gpu memory. It reduce lots of memory.
    2. Add benchmark document.
    3. fix smallnet_mnist_cifar.py in paddle.
    qingqing01 committed Oct 17, 2016
    Configuration menu
    Copy the full SHA
    91d9bf1 View commit details
    Browse the repository at this point in the history