-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
merge paddle develop #2
Commits on Mar 2, 2021
-
Add attrs
deformable_groups
for deformable_conv API (#31335)* add attrs deformable_groups
Configuration menu - View commit details
-
Copy full SHA for 1cbccfa - Browse repository at this point
Copy the full SHA 1cbccfaView commit details
Commits on Mar 3, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 946dbda - Browse repository at this point
Copy the full SHA 946dbdaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 13e4280 - Browse repository at this point
Copy the full SHA 13e4280View commit details -
Configuration menu - View commit details
-
Copy full SHA for c1bc223 - Browse repository at this point
Copy the full SHA c1bc223View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6626c6a - Browse repository at this point
Copy the full SHA 6626c6aView commit details -
Configuration menu - View commit details
-
Copy full SHA for e312a1f - Browse repository at this point
Copy the full SHA e312a1fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 32211fe - Browse repository at this point
Copy the full SHA 32211feView commit details -
Configuration menu - View commit details
-
Copy full SHA for db50fb6 - Browse repository at this point
Copy the full SHA db50fb6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3b9db17 - Browse repository at this point
Copy the full SHA 3b9db17View commit details -
[ROCM] update fluid operators for rocm (part3), test=develop (#31213)
* [ROCM] update fluid operators for rocm (part3), test=develop * fix clang format error, test=develop
Configuration menu - View commit details
-
Copy full SHA for 84639b6 - Browse repository at this point
Copy the full SHA 84639b6View commit details -
[ROCM] update fluid elementwise op for rocm (part10), test=develop (#…
…31361) * [ROCM] update fluid elementwise op for rocm (part10), test=develop * update, test=develop * address review comments, test=develop
Configuration menu - View commit details
-
Copy full SHA for 7cdf6ea - Browse repository at this point
Copy the full SHA 7cdf6eaView commit details
Commits on Mar 4, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 5b4f8aa - Browse repository at this point
Copy the full SHA 5b4f8aaView commit details -
[Dy2stat] Fix Read-Only Attribute as while_loop Output (#31415)
Fix Read-Only Attribute as while_loop Output: Usually, our convert_while_loop will be like: ``` [a, b, c] = paddle.jit.dy2static.convert_while_loop( condition_name, body_name, [a, b, c]) ``` where a, b, c are in loop_var_names. However, if loop_var_names contains property such as foo.x, we cannot assign the attribute as output of convert_while_loop because Python property is a kind of read-only attribute. To handle the case, we replace the attributes which are output of convert_while_loop with generated variables, then if we know the attribute is not read-only at runtime, we assign the attribute. The created statements are like: ``` [a, b, __attribute_variable_1] = paddle.jit.dy2static.convert_while_loop( condition_name, body_name, [a, b, foo.x]) if not isinstance(getattr(type(foo), x, None), property): foo.x = __attribute_variable_1 ```
Configuration menu - View commit details
-
Copy full SHA for 6bf02a1 - Browse repository at this point
Copy the full SHA 6bf02a1View commit details -
Configuration menu - View commit details
-
Copy full SHA for c40b98e - Browse repository at this point
Copy the full SHA c40b98eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0fff930 - Browse repository at this point
Copy the full SHA 0fff930View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4d6d2db - Browse repository at this point
Copy the full SHA 4d6d2dbView commit details -
improve performance of depthwise_conv2d (#31099)
* improve performance of depthwise_conv2d * add unittest
Configuration menu - View commit details
-
Copy full SHA for dcce54e - Browse repository at this point
Copy the full SHA dcce54eView commit details -
fix modified_retry_method_only_win (#31404)
* fix modified_retry_method_only_win * fix bug * fix retry bug on windows
Configuration menu - View commit details
-
Copy full SHA for 3a8ef10 - Browse repository at this point
Copy the full SHA 3a8ef10View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7d95e59 - Browse repository at this point
Copy the full SHA 7d95e59View commit details -
prepare remove grad script and update PADDLE_CI_INFERENCE pipeline (#…
…31149) prepare remove grad op and kernel script. update Paddle_CI_Inference pipeline.
Configuration menu - View commit details
-
Copy full SHA for c9a7bfe - Browse repository at this point
Copy the full SHA c9a7bfeView commit details -
fix python full coverage decrease issue (#31429)
* fix python full coverage decrease issue * fix
Configuration menu - View commit details
-
Copy full SHA for 62289fc - Browse repository at this point
Copy the full SHA 62289fcView commit details -
Configuration menu - View commit details
-
Copy full SHA for 522c91e - Browse repository at this point
Copy the full SHA 522c91eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4d647ec - Browse repository at this point
Copy the full SHA 4d647ecView commit details
Commits on Mar 5, 2021
-
[Kunlun]Multi xpu dygraph performance optimization , add distributed.…
…spawn support for multi xpu and some bug-fixes (#31130)
Configuration menu - View commit details
-
Copy full SHA for 9ebf05b - Browse repository at this point
Copy the full SHA 9ebf05bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1321c47 - Browse repository at this point
Copy the full SHA 1321c47View commit details -
Configuration menu - View commit details
-
Copy full SHA for 30717a6 - Browse repository at this point
Copy the full SHA 30717a6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8491ae9 - Browse repository at this point
Copy the full SHA 8491ae9View commit details -
Configuration menu - View commit details
-
Copy full SHA for bc7632b - Browse repository at this point
Copy the full SHA bc7632bView commit details
Commits on Mar 8, 2021
-
Configuration menu - View commit details
-
Copy full SHA for ffdd5b7 - Browse repository at this point
Copy the full SHA ffdd5b7View commit details -
[CustomOp] Automatically specify PADDLE_WITH_MKLDNN & Remove Interpre…
…ter argument (#31391) * auto specify PADDLE_WITH_MKLDNN and remove Interpretper * remove print * fix check abi * fix windows * fix compile flags
Configuration menu - View commit details
-
Copy full SHA for fadabbe - Browse repository at this point
Copy the full SHA fadabbeView commit details -
Configuration menu - View commit details
-
Copy full SHA for f937796 - Browse repository at this point
Copy the full SHA f937796View commit details -
Configuration menu - View commit details
-
Copy full SHA for 133a914 - Browse repository at this point
Copy the full SHA 133a914View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5f62132 - Browse repository at this point
Copy the full SHA 5f62132View commit details
Commits on Mar 9, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 39a5424 - Browse repository at this point
Copy the full SHA 39a5424View commit details -
Configuration menu - View commit details
-
Copy full SHA for b85c8e0 - Browse repository at this point
Copy the full SHA b85c8e0View commit details -
Configuration menu - View commit details
-
Copy full SHA for e03e467 - Browse repository at this point
Copy the full SHA e03e467View commit details -
Configuration menu - View commit details
-
Copy full SHA for 50af0c2 - Browse repository at this point
Copy the full SHA 50af0c2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 43d6abf - Browse repository at this point
Copy the full SHA 43d6abfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 634a12b - Browse repository at this point
Copy the full SHA 634a12bView commit details -
Prec on windows exclude check_added_ut (#31372)
* add precision test for windows ci exclude check_added_ut * fix error * added PRECISION_TEST parameters * fix format error
Configuration menu - View commit details
-
Copy full SHA for 390cebe - Browse repository at this point
Copy the full SHA 390cebeView commit details -
[oneDNN] bumpup onednn 2.2 fixup version (#31473)
* - introduced fix onednn 2.2 version * - compilation fix
Configuration menu - View commit details
-
Copy full SHA for 23d96cf - Browse repository at this point
Copy the full SHA 23d96cfView commit details -
* add precision on mac * added judge * match file_ut.json on mac * fix code format error * fix code format error * fix error caused by length of ut_lists exceeds the limit * fix format error,notest,test=cpu * fix code format error * add windows judge on get_pr_ut
Configuration menu - View commit details
-
Copy full SHA for 0b3c229 - Browse repository at this point
Copy the full SHA 0b3c229View commit details
Commits on Mar 10, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 45c7d90 - Browse repository at this point
Copy the full SHA 45c7d90View commit details -
Configuration menu - View commit details
-
Copy full SHA for f57739b - Browse repository at this point
Copy the full SHA f57739bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 416e47e - Browse repository at this point
Copy the full SHA 416e47eView commit details -
* bugfix for test_cholesky_op * bugfix for test_compare_op * bugfix for lookup_table_op * bugfix for affine_channel_op
Configuration menu - View commit details
-
Copy full SHA for 910f377 - Browse repository at this point
Copy the full SHA 910f377View commit details -
Configuration menu - View commit details
-
Copy full SHA for c8ae837 - Browse repository at this point
Copy the full SHA c8ae837View commit details -
remove the send/recv of tensor size (#31460)
* remove the send/recv of tensor size, but users have to specify the shape of the received var explicitly.
lilong12 authoredMar 10, 2021 Configuration menu - View commit details
-
Copy full SHA for 0205e9f - Browse repository at this point
Copy the full SHA 0205e9fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 83a2fb1 - Browse repository at this point
Copy the full SHA 83a2fb1View commit details
Commits on Mar 11, 2021
-
[save/load] Fix bug with input_spec=dict[InputSpec] in jit.save (#31517)
* fix bug with jit.save * refine code
Configuration menu - View commit details
-
Copy full SHA for f3959e9 - Browse repository at this point
Copy the full SHA f3959e9View commit details -
add softmax_switch for softmax_with_cross_entropy_op, test=develop (#…
…31428) * add softmax_switch for softmax_with_cross_entropy_op, test=develop * delete using EigenMatrix in softmax_with_cross_entropy_op.h, test=develop * add REGISTER_OP_VERSION for softmax_switch attr of softmax_with_cross_entropy_op, test=develop
Configuration menu - View commit details
-
Copy full SHA for 6148b87 - Browse repository at this point
Copy the full SHA 6148b87View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3789a69 - Browse repository at this point
Copy the full SHA 3789a69View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9ed6c89 - Browse repository at this point
Copy the full SHA 9ed6c89View commit details -
[Bug fix] Different machine generate different binary file, remove md…
…5 check (#31482) * Different machine generate different binary file, remove md5 check * remove unnecessary functions
Configuration menu - View commit details
-
Copy full SHA for 0f1e7e3 - Browse repository at this point
Copy the full SHA 0f1e7e3View commit details -
Update comments for API
RandomResizedCrop
(#31539)* update comments
Configuration menu - View commit details
-
Copy full SHA for ac493f2 - Browse repository at this point
Copy the full SHA ac493f2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 49c3d2a - Browse repository at this point
Copy the full SHA 49c3d2aView commit details -
[Dy2stat]Fix bug with static_convert_var_shape in locals scope (#31556)
* Fix bug with static_convert_var_shape * replace dot with dash
Configuration menu - View commit details
-
Copy full SHA for def27bc - Browse repository at this point
Copy the full SHA def27bcView commit details
Commits on Mar 12, 2021
-
[CustomOp] Support duplicable op input and output (#31535)
* support duplicable op inout * add costom concat op test
Configuration menu - View commit details
-
Copy full SHA for 95cceb2 - Browse repository at this point
Copy the full SHA 95cceb2View commit details -
help timeout ut debug (#31500)
* To help timeout_ut debug * To help timeout_ut debug * added show information
Configuration menu - View commit details
-
Copy full SHA for f302bb4 - Browse repository at this point
Copy the full SHA f302bb4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3d5aa9d - Browse repository at this point
Copy the full SHA 3d5aa9dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 99dcd66 - Browse repository at this point
Copy the full SHA 99dcd66View commit details -
Configuration menu - View commit details
-
Copy full SHA for da9dda5 - Browse repository at this point
Copy the full SHA da9dda5View commit details -
Support loading parameters from checkpoint to save quantized model (#…
…31419) * Support loading parameters from checkpoint to save quantized model * Fix the unittest test_moving_average_abs_max_scale_op * Add unittest of save_quantized_model from checkpoint * Add comments to explain the function
Configuration menu - View commit details
-
Copy full SHA for ef0dd3e - Browse repository at this point
Copy the full SHA ef0dd3eView commit details -
Trt elementwise plugin serialize (#31587)
* add serialize unittest * fix element_op trt plugin serialize bug
Configuration menu - View commit details
-
Copy full SHA for 50ac7db - Browse repository at this point
Copy the full SHA 50ac7dbView commit details -
[Paddle-TRT] Fix engine key in trt int8 calibration (#31513)
* fix engine key in trt int8 calibration * fix unit test
Configuration menu - View commit details
-
Copy full SHA for cac9635 - Browse repository at this point
Copy the full SHA cac9635View commit details
Commits on Mar 15, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 30a627a - Browse repository at this point
Copy the full SHA 30a627aView commit details -
DataLoader supprot dict str (#31481)
* add dict/str/list supprot for DataLoader. test=develop
Configuration menu - View commit details
-
Copy full SHA for a32e8bf - Browse repository at this point
Copy the full SHA a32e8bfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9066b74 - Browse repository at this point
Copy the full SHA 9066b74View commit details -
[CustomOp] Remove the dependence of the underlying data types on eigen (
#31602) * init commit * move eigen of bfloat16 * add complex header
Configuration menu - View commit details
-
Copy full SHA for 027b574 - Browse repository at this point
Copy the full SHA 027b574View commit details -
Configuration menu - View commit details
-
Copy full SHA for c3634c6 - Browse repository at this point
Copy the full SHA c3634c6View commit details -
Fix summary bug when calaculating output shape (#31549)
* fix summary bug
Configuration menu - View commit details
-
Copy full SHA for 7543312 - Browse repository at this point
Copy the full SHA 7543312View commit details
Commits on Mar 16, 2021
-
Configuration menu - View commit details
-
Copy full SHA for da10c5c - Browse repository at this point
Copy the full SHA da10c5cView commit details -
fix wget with no proxy on windows (#31505)
* fix wget with no proxy on windows * modified import packages * fix format error * fix bug * fix format error * fix format error
Configuration menu - View commit details
-
Copy full SHA for 580442c - Browse repository at this point
Copy the full SHA 580442cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9c624b1 - Browse repository at this point
Copy the full SHA 9c624b1View commit details -
Configuration menu - View commit details
-
Copy full SHA for c1b1ccf - Browse repository at this point
Copy the full SHA c1b1ccfView commit details -
Optimize compilation with Ninja (#31449)
* Optimize compilation with Ninja, notest, test=windows_ci, test=windows_op * no cache on windows ci, notest, test=windows_ci, test=windows_op * delete /Zc:inline compiled in NVCC, notest, test=windows_ci, test=windows_op * fix test_warpctc_op, notest, test=windows_ci * remove test code, test=develop
Configuration menu - View commit details
-
Copy full SHA for 41e9ecf - Browse repository at this point
Copy the full SHA 41e9ecfView commit details -
Second optimization of retry method (#31646)
* Second optimization of retry method * fix show_ut_retry_result repeat execuate
Configuration menu - View commit details
-
Copy full SHA for 1a6e3b0 - Browse repository at this point
Copy the full SHA 1a6e3b0View commit details -
Configuration menu - View commit details
-
Copy full SHA for d9b50f6 - Browse repository at this point
Copy the full SHA d9b50f6View commit details -
Configuration menu - View commit details
-
Copy full SHA for cdc5a55 - Browse repository at this point
Copy the full SHA cdc5a55View commit details
Commits on Mar 17, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 4c0c55b - Browse repository at this point
Copy the full SHA 4c0c55bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 19592d2 - Browse repository at this point
Copy the full SHA 19592d2View commit details -
[CustomOp] Remove Eigen dependencies of float16 (#31669)
* remove eigen deps dof float16 * add cstdlib header * replace stdlib header by cmath
Configuration menu - View commit details
-
Copy full SHA for 2fbe9b0 - Browse repository at this point
Copy the full SHA 2fbe9b0View commit details -
In __getitem__, convert integers to int64 Tensor not int32 to be comp…
…atible with Lite(#31658)
Configuration menu - View commit details
-
Copy full SHA for 402288a - Browse repository at this point
Copy the full SHA 402288aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7f50bb7 - Browse repository at this point
Copy the full SHA 7f50bb7View commit details
Commits on Mar 18, 2021
-
remove useless import (#31700)
* remove useless import. test=develop
Configuration menu - View commit details
-
Copy full SHA for 740359e - Browse repository at this point
Copy the full SHA 740359eView commit details -
【Paddle.Fleet】Fix one ps gradient clip (#31664)
* fix one ps gradient clip
Configuration menu - View commit details
-
Copy full SHA for 09482dd - Browse repository at this point
Copy the full SHA 09482ddView commit details -
Configuration menu - View commit details
-
Copy full SHA for d4282ea - Browse repository at this point
Copy the full SHA d4282eaView commit details -
[Paddle-TRT] support batch axis concatenation when using dynamic shape (
#31627) * support batch axis concatenation when using dynamic shape * opteller can't return true early, or some test will not be executed
Configuration menu - View commit details
-
Copy full SHA for 4ea3427 - Browse repository at this point
Copy the full SHA 4ea3427View commit details -
[Paddle-TRT] gather converter (#31640)
* trt gather converter * add trt gather unit_test
Configuration menu - View commit details
-
Copy full SHA for fe241fd - Browse repository at this point
Copy the full SHA fe241fdView commit details -
[CustomOp] Support complex dtype in custom op (#31657)
* support custom complex op * fix detail error * add inference support * fix setup windows failed
Configuration menu - View commit details
-
Copy full SHA for 8785261 - Browse repository at this point
Copy the full SHA 8785261View commit details
Commits on Mar 19, 2021
-
[ROCM] fix layer_norm, norm, p_norm, test_sequence_softmax_op, test_m…
…ath_op_patch_var_base (#31709)
Configuration menu - View commit details
-
Copy full SHA for 420527f - Browse repository at this point
Copy the full SHA 420527fView commit details -
[dgraph qat] Refine calculating output scale of dygraph qat (#31710)
* Refine calculating output scale of dygraph qat, test=develop
Configuration menu - View commit details
-
Copy full SHA for 1d197f6 - Browse repository at this point
Copy the full SHA 1d197f6View commit details -
remove redundant sync, set collect/dist kernel to context stream, sub…
…_lod memcpy opt (#31641)
Configuration menu - View commit details
-
Copy full SHA for 50cafa0 - Browse repository at this point
Copy the full SHA 50cafa0View commit details -
NMS Performance Optimization (#31634)
* replace mask vector to raw ptr * launch nms on context stream * remove redundant mask declaration
Configuration menu - View commit details
-
Copy full SHA for c86e771 - Browse repository at this point
Copy the full SHA c86e771View commit details -
Configuration menu - View commit details
-
Copy full SHA for a4a2b77 - Browse repository at this point
Copy the full SHA a4a2b77View commit details -
[CustomOp] Support attribute in infershape function (#31713)
* support attribute in infershape * polish details
Configuration menu - View commit details
-
Copy full SHA for e429deb - Browse repository at this point
Copy the full SHA e429debView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1c67cf0 - Browse repository at this point
Copy the full SHA 1c67cf0View commit details -
Configuration menu - View commit details
-
Copy full SHA for c9e1d9d - Browse repository at this point
Copy the full SHA c9e1d9dView commit details -
[CustomOp] Support float16 in custom op (#31725)
* support float16 in custom op * fix failed unittests
Configuration menu - View commit details
-
Copy full SHA for 878e117 - Browse repository at this point
Copy the full SHA 878e117View commit details -
Configuration menu - View commit details
-
Copy full SHA for 25fc2a1 - Browse repository at this point
Copy the full SHA 25fc2a1View commit details
Commits on Mar 21, 2021
-
Configuration menu - View commit details
-
Copy full SHA for a45c8ca - Browse repository at this point
Copy the full SHA a45c8caView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8c19d7a - Browse repository at this point
Copy the full SHA 8c19d7aView commit details -
Configuration menu - View commit details
-
Copy full SHA for ed7956a - Browse repository at this point
Copy the full SHA ed7956aView commit details
Commits on Mar 22, 2021
-
[3D-parallel] add 1f1b scheduler for pipeline (#31566)
* add 1f1b scheduler for pp, test=develop
lilong12 authoredMar 22, 2021 Configuration menu - View commit details
-
Copy full SHA for a501a7b - Browse repository at this point
Copy the full SHA a501a7bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 7ccf6b6 - Browse repository at this point
Copy the full SHA 7ccf6b6View commit details -
[Paddle-TRT] nearest_interp op (#31626)
* nearest_interp op converter w/ dynamic/static * fix data_layout include * add trt nearest unit_test * add nearest_interp NHWC test * update trt nearest interp nhwc testcase * remove asterisk for python2 compatibility * add empty line to prevent conflict * nearest_interp op converter w/ dynamic/static * fix data_layout include * add trt nearest unit_test * add nearest_interp NHWC test * update trt nearest interp nhwc testcase * remove asterisk for python2 compatibility * add empty line to prevent conflict * change the priority of out_h, out_w
Configuration menu - View commit details
-
Copy full SHA for bfced39 - Browse repository at this point
Copy the full SHA bfced39View commit details
Commits on Mar 23, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 032de0b - Browse repository at this point
Copy the full SHA 032de0bView commit details -
fix launch ps ut test=develop (#31771)
fix launch ps ut test=develop
Configuration menu - View commit details
-
Copy full SHA for f72d197 - Browse repository at this point
Copy the full SHA f72d197View commit details -
Configuration menu - View commit details
-
Copy full SHA for 46dd1d4 - Browse repository at this point
Copy the full SHA 46dd1d4View commit details -
fix tensorrt output varible reshape (#31733)
* fix tensorrt output varible reshape * move padding shape x 1 x 1 in ernie to qkv and fc * update layer name * fix softmax when input is dynamic, fc not padding any more * fix varlen * move fc x_dim assert to op_teller
Configuration menu - View commit details
-
Copy full SHA for 9d04ef7 - Browse repository at this point
Copy the full SHA 9d04ef7View commit details -
Delete fast_check_nan_inf (#31788)
* Delete fast_check_nan_inf * Delete run_fast_nan_inf_debug
Configuration menu - View commit details
-
Copy full SHA for 513641e - Browse repository at this point
Copy the full SHA 513641eView commit details -
update scale collection and propagation algorithm (#31783)
Wojciech Uss authoredMar 23, 2021 Configuration menu - View commit details
-
Copy full SHA for 814b38e - Browse repository at this point
Copy the full SHA 814b38eView commit details -
add relu forward kernel and backward kernel (#31613)
* add relu forward kernel and backward kernel
Configuration menu - View commit details
-
Copy full SHA for 372ac08 - Browse repository at this point
Copy the full SHA 372ac08View commit details -
Configuration menu - View commit details
-
Copy full SHA for f4d9212 - Browse repository at this point
Copy the full SHA f4d9212View commit details -
Update windows compiler and CI from VS2015 to VS2017 (#31652)
* modify windows CI to VS2017 * modify windows CI to VS2017 * modify windows CI to VS2017
Configuration menu - View commit details
-
Copy full SHA for a70de87 - Browse repository at this point
Copy the full SHA a70de87View commit details -
add coalesce_tensor into white list when checking re-creation of para…
…meters (#31800)
Feiyu Chan authoredMar 23, 2021 Configuration menu - View commit details
-
Copy full SHA for 4046f13 - Browse repository at this point
Copy the full SHA 4046f13View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3f66e7d - Browse repository at this point
Copy the full SHA 3f66e7dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1eb927f - Browse repository at this point
Copy the full SHA 1eb927fView commit details
Commits on Mar 24, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 270699e - Browse repository at this point
Copy the full SHA 270699eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 68497e7 - Browse repository at this point
Copy the full SHA 68497e7View commit details -
[dygraph qat] Refine saving output scale to infer program (#31784)
* Refine saving output scale to infer program
Configuration menu - View commit details
-
Copy full SHA for 84a5513 - Browse repository at this point
Copy the full SHA 84a5513View commit details -
Configuration menu - View commit details
-
Copy full SHA for f2cfc0f - Browse repository at this point
Copy the full SHA f2cfc0fView commit details -
fix cache key in concat oneDNN kernel (#31820)
* fix cache key in concat oneDNN kernel * key simplified
Wojciech Uss authoredMar 24, 2021 Configuration menu - View commit details
-
Copy full SHA for e5f7a83 - Browse repository at this point
Copy the full SHA e5f7a83View commit details -
[Dy2stat] Fix the bug that loop_body_func may return single element (#…
…31806) Our old `loop_body` function may return single element when `loop_vars` just contains only 1 element, which can cause bug. The key point of this PR is forcing `loop_body` functions always return tuple.
Configuration menu - View commit details
-
Copy full SHA for 649868f - Browse repository at this point
Copy the full SHA 649868fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 5d89ec3 - Browse repository at this point
Copy the full SHA 5d89ec3View commit details -
Configuration menu - View commit details
-
Copy full SHA for e7f28d6 - Browse repository at this point
Copy the full SHA e7f28d6View commit details
Commits on Mar 25, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 6472d62 - Browse repository at this point
Copy the full SHA 6472d62View commit details -
Configuration menu - View commit details
-
Copy full SHA for 511e204 - Browse repository at this point
Copy the full SHA 511e204View commit details -
Polish two error messages (#31852)
* polish two error messages * polish details
Configuration menu - View commit details
-
Copy full SHA for 27f2d8d - Browse repository at this point
Copy the full SHA 27f2d8dView commit details -
Configuration menu - View commit details
-
Copy full SHA for bf09dcb - Browse repository at this point
Copy the full SHA bf09dcbView commit details -
Configuration menu - View commit details
-
Copy full SHA for f58cb01 - Browse repository at this point
Copy the full SHA f58cb01View commit details
Commits on Mar 26, 2021
-
delete include framework.pb.h (#31859)
* delete include framework.pb.h * fix error
Configuration menu - View commit details
-
Copy full SHA for e804f08 - Browse repository at this point
Copy the full SHA e804f08View commit details -
Configuration menu - View commit details
-
Copy full SHA for 70b67f1 - Browse repository at this point
Copy the full SHA 70b67f1View commit details -
[Paddle-TRT] multiclass nms (#31742)
* add multiclass_nms * add multiclass_nms unittest * add default enable_tensorrt_oss option * refine multiclas nms unittest and add serialization/dynamic test * change super to InferencePassTest for python2 compatibility * refine multiclass nms unittest * move out dynamic shape test due to ci timelimit
Configuration menu - View commit details
-
Copy full SHA for 01aa252 - Browse repository at this point
Copy the full SHA 01aa252View commit details -
[3D-parallel] Reformat pipeline parallel (#31786)
* update, test=develop
lilong12 authoredMar 26, 2021 Configuration menu - View commit details
-
Copy full SHA for c3974d0 - Browse repository at this point
Copy the full SHA c3974d0View commit details -
[dygraph qat] Use layer to calculate output scale (#31861)
* Use layer to calculate output scale * add backward for moving_average_abs_max_scale and save output scales to op's attr
Configuration menu - View commit details
-
Copy full SHA for b47478e - Browse repository at this point
Copy the full SHA b47478eView commit details
Commits on Mar 29, 2021
-
[Paddle-TRT] trt affine channel converter (#31628)
* trt affine channel converter * add trt affine channel base test * add trt affine channel NHWC * remove asterisk for python2 compatibility * trt affine channel converter * add trt affine channel base test * add trt affine channel NHWC * remove asterisk for python2 compatibility * fix rebase * move LodTensor to Tensor * add dbg info * affine channel converter only support NCHW * scale,bias are parameters, use create_parameters api * reduce test input size to not exceed the timelimit of ci * refine affine channel unittest and add serialization/dynamic test * change super to InferencePassTest for python2 compatibility * change super to InferencePassTest for python2 compatibility * fix affine channel fp16 serialize setting
Configuration menu - View commit details
-
Copy full SHA for bfb5cf5 - Browse repository at this point
Copy the full SHA bfb5cf5View commit details -
[Paddle-TRT] roi_align_plugin (#31732)
* add roi_align_plugin * add roi align unit_test * add roi align serialization * remove roi align static plugin because of batch dim issue * refine roi align unittest and add fp16/serialization * add trt roi align condition to op_teller * refine error message * remove unnecessary reshape layer
Configuration menu - View commit details
-
Copy full SHA for e3a38d7 - Browse repository at this point
Copy the full SHA e3a38d7View commit details -
[CustomOP] Add shape related constructor for Tensor (#31681)
* give shape related contructor and reshape warning * change line num to fit ut * change ut to fit * remove useless code * call resize directly in constructor
Configuration menu - View commit details
-
Copy full SHA for 51eb29d - Browse repository at this point
Copy the full SHA 51eb29dView commit details -
* fix cmake model path * update cmake * fix unittest * fix unittest
Configuration menu - View commit details
-
Copy full SHA for 61805d8 - Browse repository at this point
Copy the full SHA 61805d8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 123949e - Browse repository at this point
Copy the full SHA 123949eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 525c32e - Browse repository at this point
Copy the full SHA 525c32eView commit details -
modify API nn.Bilinear's doc (#31889)
* modify API nn.Bilinear's doc, test=develop * modify API nn.Bilinear's doc, test=develop
Configuration menu - View commit details
-
Copy full SHA for b48841b - Browse repository at this point
Copy the full SHA b48841bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 8829a30 - Browse repository at this point
Copy the full SHA 8829a30View commit details -
Configuration menu - View commit details
-
Copy full SHA for a71d72d - Browse repository at this point
Copy the full SHA a71d72dView commit details
Commits on Mar 30, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 17030ff - Browse repository at this point
Copy the full SHA 17030ffView commit details -
Fix segment Fault from set_value (#31891)
* Avoid raising warning while import paddle * fix segment fault of set_value * fix code style
Configuration menu - View commit details
-
Copy full SHA for c4b60ef - Browse repository at this point
Copy the full SHA c4b60efView commit details -
* yolobox converter and plugin * yolobox unittest * add dynamic shape restriction * fix git merge log
Configuration menu - View commit details
-
Copy full SHA for 64ee255 - Browse repository at this point
Copy the full SHA 64ee255View commit details -
fix batchnorm when inpu dims < 3 (#31933)
* fix batchnorm when inpu dims < 3 * add unittest for batchnorm dims = 2
Configuration menu - View commit details
-
Copy full SHA for 8084b75 - Browse repository at this point
Copy the full SHA 8084b75View commit details -
add deprecated for softmax_with_cross_entropy (#31722)
* add deprecated for softmax_with_cross_entropy, test=develop * test for deprecated in english doc, test=develop * test deprecated for softmax_with_cross_entropy in english doc, test=develop * fix readme and English doc for cross_entropy, test=develop * rm test for softmax_with_cross_entropy deprecated, test=develop * update readme for CrossEntropyLoss, test=develop * fix readme format, test=develop * fix readme format, test=develop * fix readme format for cross_entropy, test=develop * add softmax_switch and fix softlabel for cross_entropy, test=develop * 1)recovery softmax_with_cross_entropy in fluid 2) change softmax_switch to use_softmax 3) add example for softlabel for cross_entropy, test=develop * fix Example number for cross_entropy, test=develop * fix code format, test=develop * fix for CI-Coverage, test=develop * fix for CI-Coverage, test=develop * fix ci-coverage for Non-ASCII character '\xe2' in file, test=develop * fix ci-coverage for Non-ASCII character '\xe2' in nn.layer.loss.py, test=develop * update description for doc when use_softmax=Fasle, test=develop * fix some docs and code example for cross_entropy, test=develop * delete redundant description for soft_label parameter of cross_entropy, test=develop * fix some comment for test_cross_entropy_loss.py, test=develop
Configuration menu - View commit details
-
Copy full SHA for 73a6fa3 - Browse repository at this point
Copy the full SHA 73a6fa3View commit details -
Configuration menu - View commit details
-
Copy full SHA for fe28486 - Browse repository at this point
Copy the full SHA fe28486View commit details -
[Custom OP]Remove old custom OP and reduce whl package volume (#31813)
* Remove old custom OP to reduce whl package volume * [Custom OP]Remove old custom OP to reduce whl package volume
Configuration menu - View commit details
-
Copy full SHA for 04a49b0 - Browse repository at this point
Copy the full SHA 04a49b0View commit details -
Configuration menu - View commit details
-
Copy full SHA for e50bc2c - Browse repository at this point
Copy the full SHA e50bc2cView commit details -
Configuration menu - View commit details
-
Copy full SHA for e1f9316 - Browse repository at this point
Copy the full SHA e1f9316View commit details -
Configuration menu - View commit details
-
Copy full SHA for 245252b - Browse repository at this point
Copy the full SHA 245252bView commit details -
[Paddle-TRT] TRT inference support for BERT/Transformer in paddle 2.0…
… api (#31744) * support multihead_matmul_fuse_pass_v3 * fix compile problems * embedding_eltwise_ln pass support lookup_table_v2 * suppoort matmul and matmul_v2 in qkv matmul
Configuration menu - View commit details
-
Copy full SHA for 14b7e3c - Browse repository at this point
Copy the full SHA 14b7e3cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6dca7a1 - Browse repository at this point
Copy the full SHA 6dca7a1View commit details -
Configuration menu - View commit details
-
Copy full SHA for a37a7f6 - Browse repository at this point
Copy the full SHA a37a7f6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 98e803e - Browse repository at this point
Copy the full SHA 98e803eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0fa6c8a - Browse repository at this point
Copy the full SHA 0fa6c8aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 57d4288 - Browse repository at this point
Copy the full SHA 57d4288View commit details -
Configuration menu - View commit details
-
Copy full SHA for 95f808c - Browse repository at this point
Copy the full SHA 95f808cView commit details
Commits on Mar 31, 2021
-
[ROCM] Add ROCm support for warpctc op (#31817)
* bugfix for warpctc * fix warpctc commit id * fix warpctc commit id * fix warpctc commit id * fix warpctc commit id * fix warpctc commit id * fix WARPCTC_WITH_HIP invalid * Add logs to find out why can not dlopen libwarpctc.so * fix warpctc commit id * fix unit test test_warpctc_op * Optime failed log for dlopen * Optime failed log for dlopen * Delete extra changes * fix warpctc commit id * fix warpctc commit id * Add is_compiled_with_rocm for test_warpctc_op * fix warpctc commit id * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed * Cancel optimize dlopen failed reason, move to next pr, due to it makes windows ci failed * fix code style problems
Configuration menu - View commit details
-
Copy full SHA for ef8323d - Browse repository at this point
Copy the full SHA ef8323dView commit details -
support minus-int idx to LayerList (#31750)
* support minus-int idx to LayerList * update layerlist test
Configuration menu - View commit details
-
Copy full SHA for 5394194 - Browse repository at this point
Copy the full SHA 5394194View commit details -
Configuration menu - View commit details
-
Copy full SHA for 52b05ba - Browse repository at this point
Copy the full SHA 52b05baView commit details -
update cmake minimum version to 3.15 (#31807)
* update cmake minimum version to 3.15, test=develop * fix compilation error on Windows, test=develop * fix compilation error on Windows, test=develop * fix compilation error on Windows, test=develop
Configuration menu - View commit details
-
Copy full SHA for 3a95a0b - Browse repository at this point
Copy the full SHA 3a95a0bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 393b3bd - Browse repository at this point
Copy the full SHA 393b3bdView commit details -
fix whl package push pypi (#31585)
* fix whl package push pypi * add rst
Configuration menu - View commit details
-
Copy full SHA for b09c1ce - Browse repository at this point
Copy the full SHA b09c1ceView commit details -
update compilation with C++14 (#31815)
* update compilation with C++14, test=develop * fix compilation error in eigen, test=develop
Configuration menu - View commit details
-
Copy full SHA for 587d99a - Browse repository at this point
Copy the full SHA 587d99aView commit details -
Update eigen version to f612df27 (#31832)
* update eigen version to f612df27, test=develop * fix compilation error, test=develop * remove patch command in eigen, test=develop * fix compilation error caused by call Eigen function with float16 and bfloat16, test=develop * fix unittest error, test=develop * fix unittest error caused by precision, test=develop * remove patch files used by old version eigen, test=develop
Configuration menu - View commit details
-
Copy full SHA for 495e7f9 - Browse repository at this point
Copy the full SHA 495e7f9View commit details -
Polish tensor pipeline (#31701)
* polish tensor pipeline. test=develop
Configuration menu - View commit details
-
Copy full SHA for e973bd7 - Browse repository at this point
Copy the full SHA e973bd7View commit details -
Configuration menu - View commit details
-
Copy full SHA for ea738dd - Browse repository at this point
Copy the full SHA ea738ddView commit details -
fix one error massage (#31904)
* fix one error massage * fix a error message * new fix three error messages * new fix three error messages * new fix some error * new fix one error message
Configuration menu - View commit details
-
Copy full SHA for 6f85e24 - Browse repository at this point
Copy the full SHA 6f85e24View commit details -
Adjust pipeline optimizer for 3d parallelism (#31939)
* update, test=develop
lilong12 authoredMar 31, 2021 Configuration menu - View commit details
-
Copy full SHA for 695dd37 - Browse repository at this point
Copy the full SHA 695dd37View commit details -
[Parallel UT]Improve Parallel UT level on Windows/Linux (#31377)
* [Parallel UT]improve Parallel UT level on Windows/Linux * [Parallel UT]improve Parallel UT level on Windows/Linux * [Parallel UT]Improve Parallel UT level on Windows/Linux * [Parallel UT]Improve Parallel UT level on Windows/Linux * fix CI
Configuration menu - View commit details
-
Copy full SHA for b05f614 - Browse repository at this point
Copy the full SHA b05f614View commit details -
Delete legacy C++ training user-interface (#31949)
* delete include framework.pb.h * fix error * delete fluid_train
Configuration menu - View commit details
-
Copy full SHA for d5b5004 - Browse repository at this point
Copy the full SHA d5b5004View commit details
Commits on Apr 1, 2021
-
Configuration menu - View commit details
-
Copy full SHA for eb3199f - Browse repository at this point
Copy the full SHA eb3199fView commit details -
* fix en doc for emb, test=document_fix; Change-Id: I4757e67caacd7189f068493ed45a7445f87ffb40
Configuration menu - View commit details
-
Copy full SHA for 6b74486 - Browse repository at this point
Copy the full SHA 6b74486View commit details -
Refactor and simplify hook design & add Tensor.register_hook API (#31775
) * refactor and simplify hook design * fix reducer add hook error * add Tensor.register_hook basic impl * refine prepare data impl * revert prepare data change * support register_hook for Tensor * add hook test in model * polish tests and doc example * fix double grad test failed * remove reduce hook func * fix set empty error * polish code by comments * change reduce_hook to mutable_hook * remove useless tmp_ins * fix shape code format error * fix shape code format error
Configuration menu - View commit details
-
Copy full SHA for dbeb3ea - Browse repository at this point
Copy the full SHA dbeb3eaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0774159 - Browse repository at this point
Copy the full SHA 0774159View commit details -
Configuration menu - View commit details
-
Copy full SHA for 980227f - Browse repository at this point
Copy the full SHA 980227fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4acc87b - Browse repository at this point
Copy the full SHA 4acc87bView commit details -
[Paddle-TRT] add anchor generator op plugin (#31730)
* add anchor generator op plugin * add anchor generator unit_test * remove dbg info * remove redundant line * replace assertion with paddle enforce * dynamic plugin replaces assertion with paddle enforce * anchor generator support dynamic shape on spatial axis * anchor generator test with fp16, dynamic shape * add anchor generator test all * add back main * reduce test input size to not exceed the timelimit of ci * change super to InferencePassTest for python2 compatibility * reuse paddle operator anchor generator * move creator construct to header with default * add cuda ifdef * reduce line * change super to InferencePassTest for python2 compatibility * fix anchor generator fp16 serialize setting * split unittest from test_all * restrict anchor generator input format before version 7234 * anchor generator only support greater than trt7.1 * change min_graph_size to 2 * min_graph size to 3 if dynamic shape * reduce dynamic shape size to avoid trt search tactic too long to exceed time limit * remove anchor from fetch list * anchor generator support all trt version * fix memory not allocated but if serialized
Configuration menu - View commit details
-
Copy full SHA for b807e40 - Browse repository at this point
Copy the full SHA b807e40View commit details -
* upgrade vlog * train from dataset fetch optimize
Configuration menu - View commit details
-
Copy full SHA for 0589ed2 - Browse repository at this point
Copy the full SHA 0589ed2View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9c5d028 - Browse repository at this point
Copy the full SHA 9c5d028View commit details -
add custom init grad for backward function (#31540)
* add custom init grad for backward function * add custom init grad for backward function * handle when the grad_tensor is none * handle when the grad_tensor is none * fix the args type error on windows platform * modify the args order and doc * format code * add grad_tensor to xpu * modify the grad_tensor type check * add paddle.backward api to support multi tensors gradient compute * add paddle.backward api to support multi tensors gradient compute * add paddle.atuograd module and backward api * change tensor.backward func args * modify tensor backward api * remove create_graph intputs args * add doc and examplex code for backward api * when have the same tensor, throw error * modify test Init func args * modify the execute.Init func args in test files * add paddle.autograd package in setup.py.in * modify error msg, remove _run_backward method in class Tensor * add test cases for backward api
Configuration menu - View commit details
-
Copy full SHA for 83b953f - Browse repository at this point
Copy the full SHA 83b953fView commit details -
fix doc of Pooling layers (#31977)
* fix doc of MaxPool1D * fix doc * fix doc format error * dbg * fix doc * dbg doc format test=document_fix * fix format test=document_fix * test doc * remove - from doc * fix indent * remove space before bracket * dbg format * fix indent test=document_fix * remove new line * fix descrip of Shape test=document_fix * add description for default value test=document_fix * fix bug test=document_fix
Configuration menu - View commit details
-
Copy full SHA for 40e6c57 - Browse repository at this point
Copy the full SHA 40e6c57View commit details -
Support control flow in DataParallel (#31625)
* support control flow * supoort sync_parameters_buffers * fix the bug of sparse embedding
Configuration menu - View commit details
-
Copy full SHA for 8460698 - Browse repository at this point
Copy the full SHA 8460698View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1b6c1d3 - Browse repository at this point
Copy the full SHA 1b6c1d3View commit details -
Configuration menu - View commit details
-
Copy full SHA for 68e7de2 - Browse repository at this point
Copy the full SHA 68e7de2View commit details -
Configuration menu - View commit details
-
Copy full SHA for a4b30a1 - Browse repository at this point
Copy the full SHA a4b30a1View commit details -
Configuration menu - View commit details
-
Copy full SHA for df5aff8 - Browse repository at this point
Copy the full SHA df5aff8View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0e52cdf - Browse repository at this point
Copy the full SHA 0e52cdfView commit details
Commits on Apr 2, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 0b42f48 - Browse repository at this point
Copy the full SHA 0b42f48View commit details -
add leaky_relu forward and backward in activation_op.cu (#31841)
* add leaky_relu forward and backward in activation_op.cu
Configuration menu - View commit details
-
Copy full SHA for 4490e8a - Browse repository at this point
Copy the full SHA 4490e8aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9e06a64 - Browse repository at this point
Copy the full SHA 9e06a64View commit details -
* graph engine demo * upload unsaved changes * fix dependency error * fix shard_num problem * py client * remove lock and graph-type * add load direct graph * add load direct graph * add load direct graph * batch random_sample * batch_sample_k * fix num_nodes size * batch brpc * batch brpc * add test * add test * add load_nodes; change add_node function * change sample return type to pair * resolve conflict * resolved conflict * resolved conflict * separate server and client * merge pair type * fix * resolved conflict * fixed segment fault; high-level VLOG for load edges and load nodes * random_sample return 0 * rm useless loop * test:load edge * fix ret -1 * test: rm sample * rm sample * random_sample return future * random_sample return int * test fake node * fixed here * memory leak * remove test code * fix return problem * add common_graph_table * random sample node &test & change data-structure from linkedList to vector * add common_graph_table * sample with srand * add node_types * optimize nodes sample * recover test * random sample * destruct weighted sampler * GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * pybind sample nodes api * pull nodes with step * fixed pull_graph_list bug; add test for pull_graph_list by step * add graph table;name * add graph table;name * add pybind * add pybind * add FeatureNode * add FeatureNode * add FeatureNode Serialize * add FeatureNode Serialize * get_feat_node * avoid local rpc * fix get_node_feat * fix get_node_feat * remove log * get_node_feat return py:bytes * merge develop with graph_engine * fix threadpool.h head * fix * fix typo * resolve conflict * fix conflict * recover lost content * fix pybind of FeatureNode * recover cmake * recover tools * resolve conflict * resolve linking problem * code style * change test_server port * fix code problems * remove shard_num config * remove redundent threads * optimize start server * remove logs * fix code problems by reviewers' suggestions Co-authored-by: Huang Zhengjie <270018958@qq.com> Co-authored-by: Weiyue Su <weiyue.su@gmail.com> Co-authored-by: suweiyue <suweiyue@baidu.com> Co-authored-by: luobin06 <luobin06@baidu.com> Co-authored-by: liweibin02 <liweibin02@baidu.com>
Configuration menu - View commit details
-
Copy full SHA for 94736d6 - Browse repository at this point
Copy the full SHA 94736d6View commit details -
update trt engine addplugin name. (#32018)
* update trt engine addplugin name. * update
Configuration menu - View commit details
-
Copy full SHA for d918786 - Browse repository at this point
Copy the full SHA d918786View commit details -
Configuration menu - View commit details
-
Copy full SHA for ed49b41 - Browse repository at this point
Copy the full SHA ed49b41View commit details -
Configuration menu - View commit details
-
Copy full SHA for cd74b20 - Browse repository at this point
Copy the full SHA cd74b20View commit details -
Configuration menu - View commit details
-
Copy full SHA for bf10d56 - Browse repository at this point
Copy the full SHA bf10d56View commit details -
support save/load single tensor (#31756)
* support save/load single tensor * compatibility modification according to unnittest * Some python2.7 don't have 'copyreg' modules * Handle a syntax error. * Dealing with compatibility problems on Mac. * Dealing with compatibility problems on Mac. * edit unittest to improve coverage. * Modify the code according to the review comments * Reduce redundant code. * support for static graph loading dygraph state_dict * edit code according to CI * edit unittest * edit unnittest * delete redundant file * edit code according to Comments * edit english doc * edit english doc * edit English DOC. * get/set_tensor->get/set_value; return_numpy=False * get/set_tensor->get/set_value; return_numpy=False * edit unnittest * edit unnittest * polish code.
Configuration menu - View commit details
-
Copy full SHA for 43367e4 - Browse repository at this point
Copy the full SHA 43367e4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 69c874f - Browse repository at this point
Copy the full SHA 69c874fView commit details -
use busybox run test on windows openblas (#31728)
* use busybox run test on windows openblas * fix error * fix disable_quick and nightly lable issue * add retry on windows openblas * fix bug * use one file to run cpu and gpu tests * fix with grep warning * fix syntax error * change run_unittest to run_unittest_gpu * Update run_unittests.sh fix error
Configuration menu - View commit details
-
Copy full SHA for 290be88 - Browse repository at this point
Copy the full SHA 290be88View commit details
Commits on Apr 3, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 36687d7 - Browse repository at this point
Copy the full SHA 36687d7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1e52f32 - Browse repository at this point
Copy the full SHA 1e52f32View commit details
Commits on Apr 6, 2021
-
Configuration menu - View commit details
-
Copy full SHA for a3b08ba - Browse repository at this point
Copy the full SHA a3b08baView commit details -
[Hybrid Parallel] Add Topology for hybrid communicate (#32011)
* support hyparallel, add topology * fix utest
Configuration menu - View commit details
-
Copy full SHA for 2e82b6c - Browse repository at this point
Copy the full SHA 2e82b6cView commit details -
fix two error message (#32039)
* fix two error message * fix two error message * fix error * fix error * fix error * fix error
Configuration menu - View commit details
-
Copy full SHA for 9e8f903 - Browse repository at this point
Copy the full SHA 9e8f903View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6d6ea56 - Browse repository at this point
Copy the full SHA 6d6ea56View commit details -
[PaddleTRT] Yolov3 bugfix (#32064)
* fix yolobox teller condition * fix cuda double free bug
Configuration menu - View commit details
-
Copy full SHA for b17e36a - Browse repository at this point
Copy the full SHA b17e36aView commit details -
fix test of affine_grid with rocm (#32047)
* fix test of affine_grid with rocm * fix test of affine_grid with rocm
Configuration menu - View commit details
-
Copy full SHA for 78af100 - Browse repository at this point
Copy the full SHA 78af100View commit details -
Configuration menu - View commit details
-
Copy full SHA for 187bf41 - Browse repository at this point
Copy the full SHA 187bf41View commit details -
Configuration menu - View commit details
-
Copy full SHA for a17c369 - Browse repository at this point
Copy the full SHA a17c369View commit details -
Configuration menu - View commit details
-
Copy full SHA for b8b82b7 - Browse repository at this point
Copy the full SHA b8b82b7View commit details
Commits on Apr 7, 2021
-
Struct SparseValue && Bug Fix (#31721)
* add PullSparseValue for pull sparse * fix bug for PullSparseValue * add test mode in lookuptable * revert API change * add comment for is_training
Configuration menu - View commit details
-
Copy full SHA for a881b4d - Browse repository at this point
Copy the full SHA a881b4dView commit details -
* print build summary * print build summary * print build summary * print build summary
Configuration menu - View commit details
-
Copy full SHA for e625f88 - Browse repository at this point
Copy the full SHA e625f88View commit details -
update name of develop whl package and upgrade gcc 4.8.2 to gcc 5.4 (#…
…31240) * update develop whl package name * distingush cpu and gpu name * fix ref_gcc * change whl name * upgrade gcc 4.8 to 5.4 in ubuntu_dev * update gcc4.8 to 5.4 in centos * Upgrade pip from 18.0 to 20.0.1 * change 2.1.0_dev0 to 2.1.0.dev0 in gpu version
Configuration menu - View commit details
-
Copy full SHA for f5186c3 - Browse repository at this point
Copy the full SHA f5186c3View commit details -
update the TraceLayer.save_inference_model method with add file suffi…
…x automatically (#31989) As the title
Configuration menu - View commit details
-
Copy full SHA for 10af966 - Browse repository at this point
Copy the full SHA 10af966View commit details -
improve performance of DepthwiseConv(NHWC) (#31677)
* improve performance of DepthwiseConv(NWHC)
Configuration menu - View commit details
-
Copy full SHA for 363b25a - Browse repository at this point
Copy the full SHA 363b25aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1e60a0c - Browse repository at this point
Copy the full SHA 1e60a0cView commit details -
【NPU】Merge ascend GE&distributed code by 0208 from ascendrc (#31957)
* Ascend rc (#30483) * Fix compilcation on CANN20.1 and older (#30494) Fix compilcation on CANN20.1 and older * Add distribution supported (#30578) Add distribution supported * Build praser for Hcom* operators (#30627) Build praser for Hcom* operators * Pass device_ids info from launch to trainer. (#30632) Pass device_ids info from launch to trainer * Add Hccl program group (#30642) Add Hccl program group * Add startup bash files of test_ascend_group. (#30645) Add startup bash files of test_ascend_group * cleanup (#30646) cleanup test_ascend_group.py * [Feature] Build parser to support distributed training (#30658) [Feature] Build parser to support distributed training * fix compilation on ascend-20.1 (#30722) fix compilation on ascend-20.1 * Dev/fix ascend string (#30749) Dev/fix ascend string * code style (#30781) code style * Merge ascend_optimizer and ascend_parser. (#30776) Merge ascend_optimizer and ascend_parser. * Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug (#30797) Ascendrc add converted op : [range/equal/range/uniform_random/expand/squeeze], fix cast op bug * Add paddle ascend distribution training supported (#30796) Add paddle ascend distribution training supported * pass cxx_flags to gloo cmake (#30857) * Destroy session first. (#30954) Destroy session first. * merge * fix, test=develop * fix, test=develop * fix style, test=develop * fix, test=develop * fix * fix log fatal, test=develop * fix enforce style, test=develop * fix, test=develop * fix, test=develop * fix rccl, test=develop * fix test, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix node_num, test=develop * fix ids str, test=develop * fix ids str, test=develop * fix ids str, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix, test=develop * fix style code, test=develop * fix style code, test=develop * fix style code, test=develop * fix style code, test=develop Co-authored-by: hutuxian <hutuxian2011@sina.cn> Co-authored-by: gongweibao <weibao.gong@gmail.com> Co-authored-by: Void Main <voidmain1313113@gmail.com> Co-authored-by: Leo Chen <chenqiuliang@baidu.com> Co-authored-by: dingsiyu <18369187719@163.com> Co-authored-by: OleNet <olenet@126.com>
Configuration menu - View commit details
-
Copy full SHA for 8c7c53b - Browse repository at this point
Copy the full SHA 8c7c53bView commit details -
Configuration menu - View commit details
-
Copy full SHA for d91faf2 - Browse repository at this point
Copy the full SHA d91faf2View commit details -
Check added ut on windows (#31826)
* added ut check on windows,notest,test=windows_ci * debug,notest,test=windows_ci * debug,notest,test=windows_ci * fix bug,notest,test=windows_ci * added ut check * test for new ut add on windows * test,notest,test=windows_ci * fix bug,notest,test=windows_ci * test * test * test * test,notest,test=windows_ci * test,notest,test=windows_ci * check added ut on windows * only fetch upstream develop * modified according comment * Update run_unittests.sh * Update run_unittests.sh
Configuration menu - View commit details
-
Copy full SHA for e09f4db - Browse repository at this point
Copy the full SHA e09f4dbView commit details -
* graph engine demo * upload unsaved changes * fix dependency error * fix shard_num problem * py client * remove lock and graph-type * add load direct graph * add load direct graph * add load direct graph * batch random_sample * batch_sample_k * fix num_nodes size * batch brpc * batch brpc * add test * add test * add load_nodes; change add_node function * change sample return type to pair * resolve conflict * resolved conflict * resolved conflict * separate server and client * merge pair type * fix * resolved conflict * fixed segment fault; high-level VLOG for load edges and load nodes * random_sample return 0 * rm useless loop * test:load edge * fix ret -1 * test: rm sample * rm sample * random_sample return future * random_sample return int * test fake node * fixed here * memory leak * remove test code * fix return problem * add common_graph_table * random sample node &test & change data-structure from linkedList to vector * add common_graph_table * sample with srand * add node_types * optimize nodes sample * recover test * random sample * destruct weighted sampler * GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * WeightedGraphEdgeBlob to GraphEdgeBlob * pybind sample nodes api * pull nodes with step * fixed pull_graph_list bug; add test for pull_graph_list by step * add graph table;name * add graph table;name * add pybind * add pybind * add FeatureNode * add FeatureNode * add FeatureNode Serialize * add FeatureNode Serialize * get_feat_node * avoid local rpc * fix get_node_feat * fix get_node_feat * remove log * get_node_feat return py:bytes * merge develop with graph_engine * fix threadpool.h head * fix * fix typo * resolve conflict * fix conflict * recover lost content * fix pybind of FeatureNode * recover cmake * recover tools * resolve conflict * resolve linking problem * code style * change test_server port * fix code problems * remove shard_num config * remove redundent threads * optimize start server * remove logs * fix code problems by reviewers' suggestions * move graph files into a folder * code style change * remove graph operations from base table Co-authored-by: Huang Zhengjie <270018958@qq.com> Co-authored-by: Weiyue Su <weiyue.su@gmail.com> Co-authored-by: suweiyue <suweiyue@baidu.com> Co-authored-by: luobin06 <luobin06@baidu.com> Co-authored-by: liweibin02 <liweibin02@baidu.com> Co-authored-by: tangwei12 <tangwei12@baidu.com>
Configuration menu - View commit details
-
Copy full SHA for 4935b8e - Browse repository at this point
Copy the full SHA 4935b8eView commit details -
add uint8 type for flatten op (#32120)
* add uint8 type for flatten;test=develop
Configuration menu - View commit details
-
Copy full SHA for 297290a - Browse repository at this point
Copy the full SHA 297290aView commit details
Commits on Apr 8, 2021
-
Configuration menu - View commit details
-
Copy full SHA for f74f976 - Browse repository at this point
Copy the full SHA f74f976View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7230203 - Browse repository at this point
Copy the full SHA 7230203View commit details -
The unsupported_fp16_list using in AMP will be created automatically …
…during the runtime. (#32102) * Use the runtime to create the unsupported_fp16_list using in AMP. * Add more infos about supported ops. * Add some comments for the function of OpSupportedInfos. * Fix the unit test of test_multi_precision_fp16_train.
Configuration menu - View commit details
-
Copy full SHA for 6e65fe0 - Browse repository at this point
Copy the full SHA 6e65fe0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 5434496 - Browse repository at this point
Copy the full SHA 5434496View commit details -
* add layerdict class * add docs and test cases for LayerDict class * remove the arguments type in function define * add update inputs type check
Configuration menu - View commit details
-
Copy full SHA for e45c3fa - Browse repository at this point
Copy the full SHA e45c3faView commit details -
Support converting the model from fp32 to fp16 (#32112)
* Support converting the model from fp32 to fp16
Configuration menu - View commit details
-
Copy full SHA for 1bae1e7 - Browse repository at this point
Copy the full SHA 1bae1e7View commit details
Commits on Apr 9, 2021
-
Configuration menu - View commit details
-
Copy full SHA for 3822247 - Browse repository at this point
Copy the full SHA 3822247View commit details -
Configuration menu - View commit details
-
Copy full SHA for dabaca0 - Browse repository at this point
Copy the full SHA dabaca0View commit details -
[Dy2Stat] Support DictCmp and zip grammer (#32159)
* support DictCmp and zip grammar * fix code style
Configuration menu - View commit details
-
Copy full SHA for 55730d9 - Browse repository at this point
Copy the full SHA 55730d9View commit details -
[CustomOp]Support MacOS platform and Remove libpaddle_custom_op.so de…
…pendency (#31976) * Remove old custom OP to reduce whl package volume * [Custom OP]Remove old custom OP to reduce whl package volume * support macos
Configuration menu - View commit details
-
Copy full SHA for d815fbf - Browse repository at this point
Copy the full SHA d815fbfView commit details -
Configuration menu - View commit details
-
Copy full SHA for 95122eb - Browse repository at this point
Copy the full SHA 95122ebView commit details -
[Dy2Stat] Fix undefined var used in For (#32153)
* fix undefind var in For * fix code style
Configuration menu - View commit details
-
Copy full SHA for 4636d13 - Browse repository at this point
Copy the full SHA 4636d13View commit details -
Configuration menu - View commit details
-
Copy full SHA for a73cb67 - Browse repository at this point
Copy the full SHA a73cb67View commit details -
[NPU] cherry-pick basic NPU components/allocator/operator/executor su…
…pports from ascendrc (#32144) * [feature] support npu allocator (#30840) [feature] support npu allocator * [feature] support npu operator (#30951) [feature] support npu operator * [feature] support npu allocator, part 2 (#30972) * support npu allocator * add npu device context * fix some compile problem * fix some compile problem * add npu info * compile ok * fix include dir * support naive_best_fit_allocator * run ut ok, bug failed to exit * call aclrtResetDevice before exit * fix aclFinilize * add system allocatot test * add selected_gpus in gtest * add tensor_test for npu * support npu op, initial commit * add npu stream * add elementwise_add_op * compile ok * fix typo * fix elementwise_add_op_npu_test * support op run * test can run but failed * change aclopExecuteV2 to aclopCompileAndExecute * support parsing ascend rank table file (#31000) support parsing ascend rank table file * Fix reshape on GE graph. (#31084) Fix reshape on GE graph * add npu kernel for elementwise_sub and elementwise_sub_grad (#30973) * add npu sub op * fix typo * rename test * fix bug * fix bug * add fp16 kernel * fix typo * support sub grad op * support elementwise_sub_grad op Co-authored-by: frankwhzhang <frankwhzhang@126.com> * Fix compilation problem (#31100) Fix compilation problem (#31100) * fix compile * fix code stype * remove const_cast * support adding correct npu op in pybind.h (#31143) * support adding correct npu op in pybind.h * refine code * [NPU] Support executor with NPU (#31057) * [NPU] Support executor with NPU * Fix code according to reviews * Fix code * Add unittest for sub op npu * refactor npu device manager (#31154) refactor npu device manager (#31154) * fix selected npus * fix compile * fix reading flags from env * format Co-authored-by: xiayanming <41795079@qq.com> Co-authored-by: gongweibao <weibao.gong@gmail.com> Co-authored-by: frankwhzhang <frankwhzhang@126.com> Co-authored-by: liym27 <33742067+liym27@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for ccf5709 - Browse repository at this point
Copy the full SHA ccf5709View commit details -
make high precision for avg_pool and adaptive_avg_pool when data_type…
… is float16 (#31887) * make high precision for avg_pool
Configuration menu - View commit details
-
Copy full SHA for ec2ffb6 - Browse repository at this point
Copy the full SHA ec2ffb6View commit details
Commits on Apr 10, 2021
-
Configuration menu - View commit details
-
Copy full SHA for afa3720 - Browse repository at this point
Copy the full SHA afa3720View commit details -
Optimize the performance of the forward of log_softmax when axis is -…
…1 and dim <= 1024 (#31630)
Configuration menu - View commit details
-
Copy full SHA for f8bab5b - Browse repository at this point
Copy the full SHA f8bab5bView commit details
Commits on Apr 12, 2021
-
fix concat_grad on kunlun (#32151)
* fix concat_grad on kunlun * fix concat_grad on kunlun
Configuration menu - View commit details
-
Copy full SHA for a2387ef - Browse repository at this point
Copy the full SHA a2387efView commit details -
Configuration menu - View commit details
-
Copy full SHA for 80698ca - Browse repository at this point
Copy the full SHA 80698caView commit details -
Configuration menu - View commit details
-
Copy full SHA for af374ae - Browse repository at this point
Copy the full SHA af374aeView commit details -
Configuration menu - View commit details
-
Copy full SHA for d8afe40 - Browse repository at this point
Copy the full SHA d8afe40View commit details -
[ROCM] fix some unittests (#32129)
* [ROCM] fix test_gru_rnn_op * [ROCM] fix test_expand_op * [ROCM] fix test_cross_entropy_loss * [ROCM] fix test_conv_nn_grad * [ROCM] fix test_bilinear_tensor_product_op * [ROCM] fix elementwise_op_function * [ROCM] fix test_lstm_cudnn_op * [ROCM] fix test_gpu_package_without_gpu_device * [ROCM] fix test_gru_unit_op * [ROCM] fix test_imperative_optimizer * [ROCM] fix rnn * [ROCM] fix group_norm_op * [ROCM] fix test_pool3d_api * [ROCM] fix test_pool3d_op
Configuration menu - View commit details
-
Copy full SHA for bd2a4e2 - Browse repository at this point
Copy the full SHA bd2a4e2View commit details -
Configuration menu - View commit details
-
Copy full SHA for bb3b790 - Browse repository at this point
Copy the full SHA bb3b790View commit details -
Optimize the process of obtaining prec_list on windows (#32123)
* test,test,notest,test=windows_ci * test,notest,test=windows_ci * test,notest,test=windows_ci * test,notest,test=windows_ci * remove test code * delete some unnecessary logs * fix format error * turn on added ut check on windows
Configuration menu - View commit details
-
Copy full SHA for 8dacfb5 - Browse repository at this point
Copy the full SHA 8dacfb5View commit details