-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add flash attn for af2 #8
base: develop
Are you sure you want to change the base?
Commits on May 5, 2023
-
Configuration menu - View commit details
-
Copy full SHA for e85fbac - Browse repository at this point
Copy the full SHA e85fbacView commit details -
Configuration menu - View commit details
-
Copy full SHA for b02de1b - Browse repository at this point
Copy the full SHA b02de1bView commit details -
Configuration menu - View commit details
-
Copy full SHA for d27f15e - Browse repository at this point
Copy the full SHA d27f15eView commit details -
[XPU] Fusion of gather and assign operators to fused_mt op for reduci…
…ng memory usage (PaddlePaddle#53262)
Configuration menu - View commit details
-
Copy full SHA for 2039115 - Browse repository at this point
Copy the full SHA 2039115View commit details -
remove some [-Wunused-parameter]warning (PaddlePaddle#53397)
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
Configuration menu - View commit details
-
Copy full SHA for 58435ae - Browse repository at this point
Copy the full SHA 58435aeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0d9a23b - Browse repository at this point
Copy the full SHA 0d9a23bView commit details -
Revert "【Hackathon No.52】为 Paddle dist 算子实现 float16 数据类型支持 (PaddlePad…
…dle#50915)" (PaddlePaddle#53527) This reverts commit 9c40653.
Configuration menu - View commit details
-
Copy full SHA for d463f8e - Browse repository at this point
Copy the full SHA d463f8eView commit details
Commits on May 6, 2023
-
move UniformRawKernel to legacy (PaddlePaddle#53158)
* move UniformRawKernel to legacy * Update uniform_kernel.cc * Update uniform_kernel.cu * Update uniform_kernel.cc * Update uniform_kernel.cu * Update uniform_kernel.h * Update uniform_kernel.cc * Empty Commit to setup deployments
Configuration menu - View commit details
-
Copy full SHA for 13e2e10 - Browse repository at this point
Copy the full SHA 13e2e10View commit details -
rem npu in test (PaddlePaddle#53469)
* rem npu in test * restore some code
Configuration menu - View commit details
-
Copy full SHA for a499731 - Browse repository at this point
Copy the full SHA a499731View commit details -
Add trt pow converter. (PaddlePaddle#53462)
* Add trt pow converter. * update to use AddConstantLayer * add dims=0 ut
Configuration menu - View commit details
-
Copy full SHA for 5a44bf7 - Browse repository at this point
Copy the full SHA 5a44bf7View commit details -
Rename randint_raw and move it to legacy (PaddlePaddle#53157)
* Rename randint_raw and move it to legacy * Update fetch_v2_op.cc * Update randint_kernel.cc * Update randint_kernel.cu * Empty Commit to setup deployments
Configuration menu - View commit details
-
Copy full SHA for 3e7be9c - Browse repository at this point
Copy the full SHA 3e7be9cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 12406ca - Browse repository at this point
Copy the full SHA 12406caView commit details -
fix brpc double link (PaddlePaddle#53512)
* polish * polish * polish * polish * polish * polish * polish * polish * polish * polish * polish
Configuration menu - View commit details
-
Copy full SHA for 03fe3ce - Browse repository at this point
Copy the full SHA 03fe3ceView commit details -
use int64 to calc dim for c softmax (PaddlePaddle#53541)
* use int64 to calc dim for c softmax * fix complie bug
Configuration menu - View commit details
-
Copy full SHA for da963ea - Browse repository at this point
Copy the full SHA da963eaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 08a8b75 - Browse repository at this point
Copy the full SHA 08a8b75View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6a65ee0 - Browse repository at this point
Copy the full SHA 6a65ee0View commit details -
[XPU] substitute new api kernel for combinatorial adaptive avg_pool2d…
…_grad kernel (PaddlePaddle#53528)
Configuration menu - View commit details
-
Copy full SHA for eda8df7 - Browse repository at this point
Copy the full SHA eda8df7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4682c0d - Browse repository at this point
Copy the full SHA 4682c0dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 99399f3 - Browse repository at this point
Copy the full SHA 99399f3View commit details -
Add fused_gate_attention API. (PaddlePaddle#53432)
* Add fused_gate_attention API. * Implement FusedDropout API. * Fix doc and add unittest. * Skip for non-gpu device. * Add unittest.
Configuration menu - View commit details
-
Copy full SHA for b729512 - Browse repository at this point
Copy the full SHA b729512View commit details -
Configuration menu - View commit details
-
Copy full SHA for 165afab - Browse repository at this point
Copy the full SHA 165afabView commit details -
Configuration menu - View commit details
-
Copy full SHA for dd2860e - Browse repository at this point
Copy the full SHA dd2860eView commit details -
[IR] OpTrait & OpInterface & OpInfo (PaddlePaddle#52846)
* add OpTrait OpInterface ValueIterator TypeList * refine code * refine code * refine code * add opinfo * add typeid copy constructor * add trait interface construct method for opinfo_impl * add trait interface construct method for opinfo_impl * add trait interface construct method for opinfo_impl * add trait interface construct method for opinfo_impl * add trait interface construct method for opinfo_impl * add create * add member func for opinfo * fix compile bug * add op interface in ircontext * fix compile bug * fix compile bug * refine code * fix compile bug * add ut * refine ut * refine code of opinfo_impl * delete unused code * add dyncast for operation * refine comment * refine opinfo_impl * delete unused code * refine code by comment * refine code * refine code * refine code for registerOp * refine opfin create * refine code of search method of ircontext * refine op attribute * change opinfo_map key from type_id to string
Configuration menu - View commit details
-
Copy full SHA for d91d758 - Browse repository at this point
Copy the full SHA d91d758View commit details -
Configuration menu - View commit details
-
Copy full SHA for f5476da - Browse repository at this point
Copy the full SHA f5476daView commit details -
【prim】Elementwise double grad (PaddlePaddle#53014)
* add mul doubel grad * add sub_double_grad * add add sub high test * add mutiply test * modify other unsqueeze * delete api.yaml * only for make ci run * midify unsqueeze * modify unsqueeze * tmp * modify operants gen * review modify * modify review * debug * debug * modify ci cross boundary * delete log
Configuration menu - View commit details
-
Copy full SHA for a5a0e8f - Browse repository at this point
Copy the full SHA a5a0e8fView commit details -
fix strided_slice ut (PaddlePaddle#53553)
* fix strided_slice ut * remove check_dygraph
Configuration menu - View commit details
-
Copy full SHA for 1d8c82b - Browse repository at this point
Copy the full SHA 1d8c82bView commit details -
Configuration menu - View commit details
-
Copy full SHA for ca174ea - Browse repository at this point
Copy the full SHA ca174eaView commit details -
[inference][trt] add lookup_table op trt converter, use trt gather la…
…yer (PaddlePaddle#53554) * add lookup_table op trt converter * update
Configuration menu - View commit details
-
Copy full SHA for 08b44e6 - Browse repository at this point
Copy the full SHA 08b44e6View commit details -
Add PADDLE_THROW in take_along_axis kernel when the datatype of index…
… is wrong. (PaddlePaddle#53556)
Configuration menu - View commit details
-
Copy full SHA for b65e932 - Browse repository at this point
Copy the full SHA b65e932View commit details
Commits on May 8, 2023
-
Configuration menu - View commit details
-
Copy full SHA for fe80730 - Browse repository at this point
Copy the full SHA fe80730View commit details -
Configuration menu - View commit details
-
Copy full SHA for 184cf9a - Browse repository at this point
Copy the full SHA 184cf9aView commit details -
Configuration menu - View commit details
-
Copy full SHA for a299153 - Browse repository at this point
Copy the full SHA a299153View commit details -
Configuration menu - View commit details
-
Copy full SHA for 65c6ed1 - Browse repository at this point
Copy the full SHA 65c6ed1View commit details -
Configuration menu - View commit details
-
Copy full SHA for acefdeb - Browse repository at this point
Copy the full SHA acefdebView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3fd2e76 - Browse repository at this point
Copy the full SHA 3fd2e76View commit details -
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
… add_flash_attn_for_af2
Configuration menu - View commit details
-
Copy full SHA for 462e36e - Browse repository at this point
Copy the full SHA 462e36eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2bf6128 - Browse repository at this point
Copy the full SHA 2bf6128View commit details -
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
… add_flash_attn_for_af2
Configuration menu - View commit details
-
Copy full SHA for 3458f8c - Browse repository at this point
Copy the full SHA 3458f8cView commit details -
[Paddle-TRT] add generic plugin for lookup_table_v2(embedding) op (Pa…
…ddlePaddle#53539) * add embedding generic plugin, not enabled
Configuration menu - View commit details
-
Copy full SHA for fca8595 - Browse repository at this point
Copy the full SHA fca8595View commit details -
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
… add_flash_attn_for_af2
Configuration menu - View commit details
-
Copy full SHA for be44a91 - Browse repository at this point
Copy the full SHA be44a91View commit details -
Configuration menu - View commit details
-
Copy full SHA for a01b20d - Browse repository at this point
Copy the full SHA a01b20dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2f50338 - Browse repository at this point
Copy the full SHA 2f50338View commit details -
Merge branch 'add_flash_attn_for_af2' of https://github.com/JamesLim-…
…sy/Paddle into add_flash_attn_for_af2
Configuration menu - View commit details
-
Copy full SHA for a9ba1ba - Browse repository at this point
Copy the full SHA a9ba1baView commit details -
Merge branch 'add_flash_attn_for_af2' of https://github.com/JamesLim-…
…sy/Paddle into add_flash_attn_for_af2
Configuration menu - View commit details
-
Copy full SHA for c0f497a - Browse repository at this point
Copy the full SHA c0f497aView commit details -
Configuration menu - View commit details
-
Copy full SHA for f3f3d57 - Browse repository at this point
Copy the full SHA f3f3d57View commit details -
Merge branch 'develop' of https://github.com/PaddlePaddle/Paddle into…
… add_flash_attn_for_af2
Configuration menu - View commit details
-
Copy full SHA for bfe5a8c - Browse repository at this point
Copy the full SHA bfe5a8cView commit details -
Merge branch 'add_flash_attn_for_af2' of https://github.com/JamesLim-…
…sy/Paddle into add_flash_attn_for_af2
Configuration menu - View commit details
-
Copy full SHA for 0b7fda0 - Browse repository at this point
Copy the full SHA 0b7fda0View commit details -
Configuration menu - View commit details
-
Copy full SHA for ac3ff47 - Browse repository at this point
Copy the full SHA ac3ff47View commit details -
Configuration menu - View commit details
-
Copy full SHA for fe91940 - Browse repository at this point
Copy the full SHA fe91940View commit details -
[inference][trt]Unary operation support 0d (PaddlePaddle#53506)
* fix trt Unary operation do not support 0d when TRT < 8.6 * update unary ut * add rsqrt to unary_list * move rsqrt to act_list
Configuration menu - View commit details
-
Copy full SHA for 10f9249 - Browse repository at this point
Copy the full SHA 10f9249View commit details -
Configuration menu - View commit details
-
Copy full SHA for e988251 - Browse repository at this point
Copy the full SHA e988251View commit details -
Configuration menu - View commit details
-
Copy full SHA for 186f5e0 - Browse repository at this point
Copy the full SHA 186f5e0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0a59825 - Browse repository at this point
Copy the full SHA 0a59825View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2aedd9d - Browse repository at this point
Copy the full SHA 2aedd9dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 26c3077 - Browse repository at this point
Copy the full SHA 26c3077View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7dcf5e5 - Browse repository at this point
Copy the full SHA 7dcf5e5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6d396ac - Browse repository at this point
Copy the full SHA 6d396acView commit details -
Configuration menu - View commit details
-
Copy full SHA for b6c0407 - Browse repository at this point
Copy the full SHA b6c0407View commit details -
Configuration menu - View commit details
-
Copy full SHA for f74237c - Browse repository at this point
Copy the full SHA f74237cView commit details -
【BugFix】fix err of api
to_tensor
, which caused by numpy version upd……ate (PaddlePaddle#53534) * fix * update code * pre-commit * remove scale check (0-D tensor is usable) * fix data dtype err * fix numpy default dtype diff * fix data dtype * fix data dtype * update * fix coverage
Configuration menu - View commit details
-
Copy full SHA for 116fcad - Browse repository at this point
Copy the full SHA 116fcadView commit details -
Configuration menu - View commit details
-
Copy full SHA for 70180df - Browse repository at this point
Copy the full SHA 70180dfView commit details -
Configuration menu - View commit details
-
Copy full SHA for ce937f6 - Browse repository at this point
Copy the full SHA ce937f6View commit details -
add complex support for optest (PaddlePaddle#53356)
* add complex support for optest * add complex grad test * append one * move some debug info * move some debug info * move some debug info * move some debug info * add more complex test * Fix naming ambiguity * Revert "add more complex test" This reverts commit dbcb051. * change backward gradient, add TODO
Configuration menu - View commit details
-
Copy full SHA for e522ceb - Browse repository at this point
Copy the full SHA e522cebView commit details -
Configuration menu - View commit details
-
Copy full SHA for e4bf1a8 - Browse repository at this point
Copy the full SHA e4bf1a8View commit details
Commits on May 9, 2023
-
remove some [-Wunused-parameter]warning and WITH_DISTRIBUTE flag (Pad…
…dlePaddle#53532) * test,test=develop * test,test=develop
Configuration menu - View commit details
-
Copy full SHA for 727fa27 - Browse repository at this point
Copy the full SHA 727fa27View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8d340ee - Browse repository at this point
Copy the full SHA 8d340eeView commit details -
Configuration menu - View commit details
-
Copy full SHA for af2ad8d - Browse repository at this point
Copy the full SHA af2ad8dView commit details -
[Paddle-TRT] Del 2 useless pass (PaddlePaddle#53414)
* delete delete_fill_constant_op_pass and unsqueeze2_eltwise_fuse_pass
Configuration menu - View commit details
-
Copy full SHA for aec4e38 - Browse repository at this point
Copy the full SHA aec4e38View commit details -
Configuration menu - View commit details
-
Copy full SHA for eb12e62 - Browse repository at this point
Copy the full SHA eb12e62View commit details -
[Zero-Dim] add 0D test for linalg.norm/linalg.cond (PaddlePaddle#53592)
* add 0D test for linalg and linalg.cond * remove p_norm test * Update test_zero_dim_tensor.py * Update test_zero_dim_tensor, test=allcase * add 0D op test for cond and pnorm,test=allcase * fix conda error
Configuration menu - View commit details
-
Copy full SHA for 6029e02 - Browse repository at this point
Copy the full SHA 6029e02View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9682b04 - Browse repository at this point
Copy the full SHA 9682b04View commit details -
Configuration menu - View commit details
-
Copy full SHA for 72cb09e - Browse repository at this point
Copy the full SHA 72cb09eView commit details -
Configuration menu - View commit details
-
Copy full SHA for ea0abf9 - Browse repository at this point
Copy the full SHA ea0abf9View commit details -
Configuration menu - View commit details
-
Copy full SHA for dd90f10 - Browse repository at this point
Copy the full SHA dd90f10View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0f1b077 - Browse repository at this point
Copy the full SHA 0f1b077View commit details -
Configuration menu - View commit details
-
Copy full SHA for 14c642c - Browse repository at this point
Copy the full SHA 14c642cView commit details -
[PHI kernels] Bind XPU kernels (PaddlePaddle#53336)
* bind sparse_coo_tensor, reduce_max/max_int32, range/arange_int32, equal_bool, scatter_grad_float32, nearest_interp_int64 kernels * add more unit tests; modify compilation logic of xpu sparse kernels
Configuration menu - View commit details
-
Copy full SHA for 7e9c87c - Browse repository at this point
Copy the full SHA 7e9c87cView commit details -
Configuration menu - View commit details
-
Copy full SHA for a37ef76 - Browse repository at this point
Copy the full SHA a37ef76View commit details -
[Zero-Dim] add 0D Tensor UT case for XPU and expand kernel support 0D (…
…PaddlePaddle#53555) * [Zero-Dim] add 0D Tensor UT case for XPU * fix comment * remove some unnecessary UT
Configuration menu - View commit details
-
Copy full SHA for e588f2d - Browse repository at this point
Copy the full SHA e588f2dView commit details -
remove some [-Wunused-parameter]warning (PaddlePaddle#53617)
* test,test=develop * test,test=develop * test,test=develop * test,test=develop
Configuration menu - View commit details
-
Copy full SHA for bafc346 - Browse repository at this point
Copy the full SHA bafc346View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9cd0a5b - Browse repository at this point
Copy the full SHA 9cd0a5bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9244ceb - Browse repository at this point
Copy the full SHA 9244cebView commit details -
[static op generation] coalesce_tensor (PaddlePaddle#53570)
* [phi][api] add autogen code coalesce_tensor * [phi][api]fix args * [phi][api] supplement attrs
Configuration menu - View commit details
-
Copy full SHA for eaed168 - Browse repository at this point
Copy the full SHA eaed168View commit details -
[CINN]Adjust Bert unittest loss ground truth (PaddlePaddle#53628)
[CINN]Adjust Bert unittest loss ground truth, see: PaddlePaddle/CINN#1357
Configuration menu - View commit details
-
Copy full SHA for 45ce0ad - Browse repository at this point
Copy the full SHA 45ce0adView commit details -
Add compare accuracy api (PaddlePaddle#53430)
zhangkaihuo committedMay 9, 2023 Configuration menu - View commit details
-
Copy full SHA for 4907485 - Browse repository at this point
Copy the full SHA 4907485View commit details
Commits on May 10, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 3be7a6c - Browse repository at this point
Copy the full SHA 3be7a6cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 26fe2dc - Browse repository at this point
Copy the full SHA 26fe2dcView commit details -
Configuration menu - View commit details
-
Copy full SHA for ee1aa69 - Browse repository at this point
Copy the full SHA ee1aa69View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7a8635d - Browse repository at this point
Copy the full SHA 7a8635dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2eea311 - Browse repository at this point
Copy the full SHA 2eea311View commit details -
Revert "Optimize the implementation of the argsort operator. (PaddleP…
…addle#47738)" (PaddlePaddle#53631) This reverts commit 9e9b705.
Configuration menu - View commit details
-
Copy full SHA for aafaad9 - Browse repository at this point
Copy the full SHA aafaad9View commit details -
Configuration menu - View commit details
-
Copy full SHA for e077678 - Browse repository at this point
Copy the full SHA e077678View commit details -
remove some [-Wunused-parameter] warning and WITH_DISTRIBUT flags (Pa…
…ddlePaddle#53650) * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
Configuration menu - View commit details
-
Copy full SHA for 65e57a7 - Browse repository at this point
Copy the full SHA 65e57a7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6a279df - Browse repository at this point
Copy the full SHA 6a279dfView commit details -
[NPU] PP for npu (PaddlePaddle#53501)
* revert p2p communication for xpu * pp for npu * update * update * fix xpuplace * add ut for sync send * Revert "fix xpuplace" This reverts commit f89c1d7. * add ut for pp sync send * rm unusable ut * update
Configuration menu - View commit details
-
Copy full SHA for f023d42 - Browse repository at this point
Copy the full SHA f023d42View commit details -
Configuration menu - View commit details
-
Copy full SHA for c828934 - Browse repository at this point
Copy the full SHA c828934View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0f319f8 - Browse repository at this point
Copy the full SHA 0f319f8View commit details -
[LAUNCH] add log overwrite flag (PaddlePaddle#53608)
* add log overwrite flag * use strtobool
Configuration menu - View commit details
-
Copy full SHA for 7f39bcd - Browse repository at this point
Copy the full SHA 7f39bcdView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4f33f44 - Browse repository at this point
Copy the full SHA 4f33f44View commit details -
[XPU]Conv transpose fp16 && fix unittest (PaddlePaddle#53626)
* fix as review, add fp16 conv2d_transpose * fix unittest of bn and reduce_mean * fix bn unittest * fix ci * fix ci
Configuration menu - View commit details
-
Copy full SHA for 38d664b - Browse repository at this point
Copy the full SHA 38d664bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 65a3a58 - Browse repository at this point
Copy the full SHA 65a3a58View commit details -
add index_put api (PaddlePaddle#52886)
* add index_put api * fix value broadcast in backward and add test case in static * add timeout=120s for index_put * add op_compat for index_put * add inplace index_put test * add test case when index tensor in indices is int32 when indices.size less than x.dims * add index_put api backward in cpu place * add backward test case * refactor code to delete some duplicated code * replace reshape with resize for decrease extra memcpy * add datatype flag in backward yaml * fix bug in documentation * Update python/paddle/tensor/manipulation.py --------- Co-authored-by: Ligoml <39876205+Ligoml@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for f3393f4 - Browse repository at this point
Copy the full SHA f3393f4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 23c108f - Browse repository at this point
Copy the full SHA 23c108fView commit details
Commits on May 11, 2023
-
【prim】add dygraph error code when close prim flag for op who has comp…
…osite implement but no grad kernel (PaddlePaddle#53610) * add no prim no gradOp error code * delete prim_white_list throw error * delete invoke_forward_api throw error * delete invoke_forward_api throw error * review * review
Configuration menu - View commit details
-
Copy full SHA for fb8ea98 - Browse repository at this point
Copy the full SHA fb8ea98View commit details -
Merge branch 'add_flash_attn_for_af2' of https://github.com/JamesLim-…
…sy/Paddle into add_flash_attn_for_af2
Configuration menu - View commit details
-
Copy full SHA for ad3f70a - Browse repository at this point
Copy the full SHA ad3f70aView commit details -
Fix div error when dtype is int64 in static mode (PaddlePaddle#53705)
* Fix div error when dtype is int64 in static mode * Fix out dtype
Configuration menu - View commit details
-
Copy full SHA for 00ded2e - Browse repository at this point
Copy the full SHA 00ded2eView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0d45ac7 - Browse repository at this point
Copy the full SHA 0d45ac7View commit details -
Configuration menu - View commit details
-
Copy full SHA for 7ff9f5e - Browse repository at this point
Copy the full SHA 7ff9f5eView commit details -
[XPU] update log for bkcl function calls. (PaddlePaddle#53609)
* [XPU] update log for bkcl function calls. * minor update * revert unnecessary modifications.
Configuration menu - View commit details
-
Copy full SHA for d67d74c - Browse repository at this point
Copy the full SHA d67d74cView commit details -
[XPU] update dependency for xccl. (PaddlePaddle#53697)
* [XPU] update dependency for xccl. * remove unnecessary codes.
Configuration menu - View commit details
-
Copy full SHA for 44aebd4 - Browse repository at this point
Copy the full SHA 44aebd4View commit details -
[Doc] remove execution_strategy doc (PaddlePaddle#53668)
* remove execution_strategy docstring * remove doc of num_iteration_per_run; test=document_fix
Configuration menu - View commit details
-
Copy full SHA for 49de9de - Browse repository at this point
Copy the full SHA 49de9deView commit details -
up index warning level (PaddlePaddle#53691)
* up warning level * numpy still vlog-0
Configuration menu - View commit details
-
Copy full SHA for 6ec8d85 - Browse repository at this point
Copy the full SHA 6ec8d85View commit details -
[XPU] add depthwise_conv2d_transpose (PaddlePaddle#53680)
* add_depthwise_conv2d_transpose * Update test_depthwise_conv2d_transpose_op_xpu.py 删除print语句
Configuration menu - View commit details
-
Copy full SHA for 08b6f5d - Browse repository at this point
Copy the full SHA 08b6f5dView commit details -
[Paddle-Inference] Support trt 0dims of expand_as_v2 and mish. (Paddl…
…ePaddle#53627) * support_expand_mish
Configuration menu - View commit details
-
Copy full SHA for aebff6d - Browse repository at this point
Copy the full SHA aebff6dView commit details -
[test]mv fluid [controlflow,detection,dlnne,tensorrt] tests to tests (P…
…addlePaddle#53470) * [test]mv fluid controlflow detection dlnne tensorrt tests to tests * [test]clean dlnne * [test] fix test_tensorrt_engine_op * [test] try fix path error * [test] RollBACK test_tensorrt_engine_op * [test] RollBACK test_tensorrt_engine_op * [test]add todo * Empty-Commit; test=document_fix
Configuration menu - View commit details
-
Copy full SHA for 8075752 - Browse repository at this point
Copy the full SHA 8075752View commit details -
[KUNLUN]Revert "revert p2p communication for xpu (PaddlePaddle#53496)" (
PaddlePaddle#53633) * Revert "revert p2p communication for xpu (PaddlePaddle#53496)" This reverts commit eda0c58. * update
Configuration menu - View commit details
-
Copy full SHA for 4a97ba5 - Browse repository at this point
Copy the full SHA 4a97ba5View commit details -
Configuration menu - View commit details
-
Copy full SHA for 32dae48 - Browse repository at this point
Copy the full SHA 32dae48View commit details -
Configuration menu - View commit details
-
Copy full SHA for 314d041 - Browse repository at this point
Copy the full SHA 314d041View commit details -
Revert elementwise (PaddlePaddle#53663)
* modify concat_grad add sum comp rule * delete default mul_double_grad * delete high grad test * recover yaml * modify yaml
Configuration menu - View commit details
-
Copy full SHA for b4024aa - Browse repository at this point
Copy the full SHA b4024aaView commit details -
Configuration menu - View commit details
-
Copy full SHA for 9555ae8 - Browse repository at this point
Copy the full SHA 9555ae8View commit details -
[XPU][PHI Kernels] add pad op for xpu (PaddlePaddle#53684)
* add pad op for xpu * add pad op for xpu * add pad op for xpu
Configuration menu - View commit details
-
Copy full SHA for 6f28eb7 - Browse repository at this point
Copy the full SHA 6f28eb7View commit details -
move DataLoader code to paddle.io (PaddlePaddle#48699)
* move DataLoader to paddle.io. test=develop
Configuration menu - View commit details
-
Copy full SHA for 793f3b9 - Browse repository at this point
Copy the full SHA 793f3b9View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2f56b6d - Browse repository at this point
Copy the full SHA 2f56b6dView commit details -
remove some [-Wunused-parameter] warning (PaddlePaddle#53683)
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
Configuration menu - View commit details
-
Copy full SHA for dbb6269 - Browse repository at this point
Copy the full SHA dbb6269View commit details -
Configuration menu - View commit details
-
Copy full SHA for 04e5e7b - Browse repository at this point
Copy the full SHA 04e5e7bView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4a69a53 - Browse repository at this point
Copy the full SHA 4a69a53View commit details -
Configuration menu - View commit details
-
Copy full SHA for e92a9bb - Browse repository at this point
Copy the full SHA e92a9bbView commit details -
fix doc of compare_accuracy (PaddlePaddle#53661)
zhangkaihuo committedMay 11, 2023 Configuration menu - View commit details
-
Copy full SHA for 5417382 - Browse repository at this point
Copy the full SHA 5417382View commit details -
[inference Zero-Dim]prelu trt converter support zero dim tensor (Padd…
…lePaddle#53634) * prelu op trt converter support zero dim
Configuration menu - View commit details
-
Copy full SHA for 82c7388 - Browse repository at this point
Copy the full SHA 82c7388View commit details -
add cinn bf16 support (PaddlePaddle#53637)
添加CINN与Paddle框架的BFloat16类型映射
Configuration menu - View commit details
-
Copy full SHA for 3888682 - Browse repository at this point
Copy the full SHA 3888682View commit details -
Configuration menu - View commit details
-
Copy full SHA for dc003fa - Browse repository at this point
Copy the full SHA dc003faView commit details -
[Inference Zero-Dim] Support trt 0dim of gelu, hard_swish, hard_sigmo…
…id and leaky_relu (PaddlePaddle#53714) * support_act * delete_silu
Configuration menu - View commit details
-
Copy full SHA for b150b16 - Browse repository at this point
Copy the full SHA b150b16View commit details
Commits on May 12, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 13cdaab - Browse repository at this point
Copy the full SHA 13cdaabView commit details -
Configuration menu - View commit details
-
Copy full SHA for 6cd7609 - Browse repository at this point
Copy the full SHA 6cd7609View commit details -
Configuration menu - View commit details
-
Copy full SHA for 92db839 - Browse repository at this point
Copy the full SHA 92db839View commit details -
fix jacobian and hessian's docstring (PaddlePaddle#53732)
* fix jacobian and hessian's docstring * fix hessian's docstring * fix hessian's docstring
Configuration menu - View commit details
-
Copy full SHA for 3e3297c - Browse repository at this point
Copy the full SHA 3e3297cView commit details -
【Prim】support higher order autodiff for dy2static+composite (PaddlePa…
…ddle#53171) * [Dy2St]Fix x grad names when high order gradient * Polish error msg * Add inputs var to backward in dy2st * Fix error * Get grad names for backward API * Fix save load * Polish code * Add ut * [prim] fix not support optional grad bugs in higher order autodiff * [prim] remove duplicate fill_any_like caused by infershape_for_composite * fix _strip_grad_suffix_ bugs in higher-order autodiff * [prim] create output for test_static_prim.cc --------- Co-authored-by: 0x45f <wangzhen45@baidu.com>
Configuration menu - View commit details
-
Copy full SHA for b73594b - Browse repository at this point
Copy the full SHA b73594bView commit details -
Skip fake alloc in static build for some communication OPs (PaddlePad…
…dle#53593) * Skip fake alloc in static build for depend and nop op * Skip communication op * Skip sync op
Configuration menu - View commit details
-
Copy full SHA for 58916e3 - Browse repository at this point
Copy the full SHA 58916e3View commit details -
Revert "[CINN]Adjust Bert unittest loss ground truth (PaddlePaddle#53628
)" (PaddlePaddle#53731) This reverts commit 45ce0ad.
Configuration menu - View commit details
-
Copy full SHA for 95ae5d5 - Browse repository at this point
Copy the full SHA 95ae5d5View commit details -
fix doc eror of index_put in develop (PaddlePaddle#53727)
* fix doc eror of index_put in develop * fix doc error for index_put; test=document_fix; test=docs_preview
Configuration menu - View commit details
-
Copy full SHA for 4e416c9 - Browse repository at this point
Copy the full SHA 4e416c9View commit details -
move pow2_decay_with_linear_warmup kernel to phi (PaddlePaddle#53741)
* update * update
Configuration menu - View commit details
-
Copy full SHA for 348565b - Browse repository at this point
Copy the full SHA 348565bView commit details -
[PHI] update xpu api version; bind reduce_any_bool xpu kernel; remove…
… unnecessary header (PaddlePaddle#53716)
Configuration menu - View commit details
-
Copy full SHA for 0603777 - Browse repository at this point
Copy the full SHA 0603777View commit details -
sequence_mask functionalization (PaddlePaddle#53478)
* sequence_mask functionalization * fix sequence_mask test
Configuration menu - View commit details
-
Copy full SHA for d2b1e3c - Browse repository at this point
Copy the full SHA d2b1e3cView commit details -
[inference zero dim] softmax, stack op trt converter support zero dim (…
…PaddlePaddle#53729) * softmax support * support stack
Configuration menu - View commit details
-
Copy full SHA for 05d3fc8 - Browse repository at this point
Copy the full SHA 05d3fc8View commit details -
Configuration menu - View commit details
-
Copy full SHA for eb97f4f - Browse repository at this point
Copy the full SHA eb97f4fView commit details -
Configuration menu - View commit details
-
Copy full SHA for df8c302 - Browse repository at this point
Copy the full SHA df8c302View commit details -
Configuration menu - View commit details
-
Copy full SHA for fc3c281 - Browse repository at this point
Copy the full SHA fc3c281View commit details -
[CustomDevice] add inference MP support, PART0 (PaddlePaddle#53719)
* [CustomDevice] add inference MP support, PART0 * update
Configuration menu - View commit details
-
Copy full SHA for d03bbef - Browse repository at this point
Copy the full SHA d03bbefView commit details -
【prim】add forward output for Silu grad signature (PaddlePaddle#53632)
* add rules * modify silu_grad input * modify kernel signature * modify kernel signature * code style * review
Configuration menu - View commit details
-
Copy full SHA for 3846111 - Browse repository at this point
Copy the full SHA 3846111View commit details -
Configuration menu - View commit details
-
Copy full SHA for d01c89c - Browse repository at this point
Copy the full SHA d01c89cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 1019b26 - Browse repository at this point
Copy the full SHA 1019b26View commit details -
Configuration menu - View commit details
-
Copy full SHA for 772b490 - Browse repository at this point
Copy the full SHA 772b490View commit details -
test(prim-cinn): split test_resnet and test_bert into three tests (Pa…
…ddlePaddle#53723) * test(prim-cinn): split test_resnet and test_bert into three tests * test(prim-cinn): fix cmake file to run prim test in CINN-CI
Configuration menu - View commit details
-
Copy full SHA for 60cf9b5 - Browse repository at this point
Copy the full SHA 60cf9b5View commit details -
Configuration menu - View commit details
-
Copy full SHA for c497b43 - Browse repository at this point
Copy the full SHA c497b43View commit details -
Configuration menu - View commit details
-
Copy full SHA for 4d39cc7 - Browse repository at this point
Copy the full SHA 4d39cc7View commit details -
【Hackathon 4 No.20】Add i0 / i0e to paddle (PaddlePaddle#52058)
* added base code for i0 and i0e * added grad base code for i0 and i0e * added i0 and i0e python code * added ops and backward yaml config * added i0 and i0e cpu kernel, but not test. * added i0 and i0e code and unitest files * added test files * added i0/i0e gpu implementation code * updated code style * updated code style * fixed unitests code * updated i0 with eigen3 * fixed bug and added more test cases * refactor: fixed static graph bug * refactor: removed i0 and i0e from op_compat * refactor: updated code style * refactor: updated op_compat.yaml * refactor: updated op_compat.yaml * refactor: fixed op name mapping and optimize unittest case * refactor: manually implement i0 / i0e * refactor: added grad kernel for i0 / i0e,didn't finish * Update math.py * refactor: added equation to doc in English and added comments for computing i0 / i0e gradient * refactor: removed eigen implementation * refactor: finished i0 / i0e cpu and gpu op * refactor: updated code style * fix: find a bug but not fix * fix: incorrect unittest cases * update: updated code style and remove my file * update: updated unittest case * fix: fixed sign error * fix: fixed mistakes when merging * refactor: updated code style * refactor: remove unused code * refactor: updated code style
Configuration menu - View commit details
-
Copy full SHA for ce256f7 - Browse repository at this point
Copy the full SHA ce256f7View commit details
Commits on May 13, 2023
-
Revert elementwise add (PaddlePaddle#53745)
* modify concat_grad add sum comp rule * delete default mul_double_grad * delete high grad test * recover yaml * modify yaml * recover add_double_grad prim
Configuration menu - View commit details
-
Copy full SHA for b75d8c7 - Browse repository at this point
Copy the full SHA b75d8c7View commit details
Commits on May 14, 2023
-
fix build error (PaddlePaddle#53790)
* fix build error * fix build error * fix
Configuration menu - View commit details
-
Copy full SHA for 3e90a46 - Browse repository at this point
Copy the full SHA 3e90a46View commit details
Commits on May 15, 2023
-
move OneHotRawKernel to legacy (PaddlePaddle#53200)
* move OneHotRawKernel to legacy * fix
Configuration menu - View commit details
-
Copy full SHA for 34122e3 - Browse repository at this point
Copy the full SHA 34122e3View commit details -
Tranpose layout (PaddlePaddle#53351)
* update * Update backward.h * Update composite_backward_api.h * Update tensor_utils.cc * Update backward.cc * update * stype * update * add ctest * code stype
Configuration menu - View commit details
-
Copy full SHA for 3dce9f0 - Browse repository at this point
Copy the full SHA 3dce9f0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 8105607 - Browse repository at this point
Copy the full SHA 8105607View commit details -
relocate python/paddle/fluid/regularizer.py (PaddlePaddle#53106)
* relocate regularizer.py * fix bug * fix bug * fix bug * relocate the import * replace _regularization_coeff with coeff * remove the L1DecayRegularizer and L2DecayRegularizer
Configuration menu - View commit details
-
Copy full SHA for 00e415d - Browse repository at this point
Copy the full SHA 00e415dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 359f43a - Browse repository at this point
Copy the full SHA 359f43aView commit details -
Configuration menu - View commit details
-
Copy full SHA for a822a08 - Browse repository at this point
Copy the full SHA a822a08View commit details -
Fix bug of hybrid_parallel_optimizer, amp use scaler.minimize(), (Pad…
…dlePaddle#53773) however it can't deal with group of parameter_list of dict.
Configuration menu - View commit details
-
Copy full SHA for 5152971 - Browse repository at this point
Copy the full SHA 5152971View commit details -
[PHI]Add Filter for get_kernel_signatures.py (PaddlePaddle#53760)
* delete log * filter some kernel signature
Configuration menu - View commit details
-
Copy full SHA for b428e8f - Browse repository at this point
Copy the full SHA b428e8fView commit details -
Configuration menu - View commit details
-
Copy full SHA for 3d4d7c1 - Browse repository at this point
Copy the full SHA 3d4d7c1View commit details -
add check ops for prim (PaddlePaddle#52302)
* add check ops for prim * fix pow and concat composite registration * modify log * add note and remove useless code * remove useless code * modify program to check * remove useless note
Configuration menu - View commit details
-
Copy full SHA for 3d6bd6a - Browse repository at this point
Copy the full SHA 3d6bd6aView commit details -
Configuration menu - View commit details
-
Copy full SHA for a9c3e32 - Browse repository at this point
Copy the full SHA a9c3e32View commit details -
remove some [-Wunused-paramter]warning (PaddlePaddle#53681)
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
Configuration menu - View commit details
-
Copy full SHA for 96188fc - Browse repository at this point
Copy the full SHA 96188fcView commit details -
remove some [-Wunsed-parameter]warning (PaddlePaddle#53679)
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
Configuration menu - View commit details
-
Copy full SHA for ca2ea16 - Browse repository at this point
Copy the full SHA ca2ea16View commit details -
remove some [-Wunsed-parameter] warning (PaddlePaddle#53687)
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
Configuration menu - View commit details
-
Copy full SHA for 8ed01e8 - Browse repository at this point
Copy the full SHA 8ed01e8View commit details -
remove some [-Wunsed-parameter] warning (PaddlePaddle#53689)
* test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
Configuration menu - View commit details
-
Copy full SHA for 3e1fffe - Browse repository at this point
Copy the full SHA 3e1fffeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 972daa4 - Browse repository at this point
Copy the full SHA 972daa4View commit details -
Reduce inference library size and compile time (PaddlePaddle#53369)
* Reduce inference library size and compile time * resolve conflicts
Configuration menu - View commit details
-
Copy full SHA for 0ef5180 - Browse repository at this point
Copy the full SHA 0ef5180View commit details -
Silu double grad (PaddlePaddle#53605)
* add rules * modify no kernel yaml parse * success op generate * success test_silu_double * modify bug * modify static error * modify silu_grad input * modify kernel signature * modify kernel signature * code style * code style * review * delete opinfo modify
Configuration menu - View commit details
-
Copy full SHA for 94c3880 - Browse repository at this point
Copy the full SHA 94c3880View commit details -
[inference Zero-Dim][trt] Add Zero-Dim tensor support for clip, cast,…
… flatten_contiguous_range (PaddlePaddle#53769) * [inference Zero-Dim][trt]clip,cast,flatten_contiguous_range trt op converter support zero dim
Configuration menu - View commit details
-
Copy full SHA for cc9aeda - Browse repository at this point
Copy the full SHA cc9aedaView commit details -
Configuration menu - View commit details
-
Copy full SHA for e04f8d4 - Browse repository at this point
Copy the full SHA e04f8d4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 56fded1 - Browse repository at this point
Copy the full SHA 56fded1View commit details -
move dequantize kernel to phi (PaddlePaddle#53739)
* update * fix bug * fix output type def
Configuration menu - View commit details
-
Copy full SHA for efd410c - Browse repository at this point
Copy the full SHA efd410cView commit details -
[AMP]fix embedding model weight type mismatch error (PaddlePaddle#53770)
* fix embedding model weight type mismatch error * Update fp16_utils.py --------- Co-authored-by: Zhang Ting <zhangting_2017@163.com>
Configuration menu - View commit details
-
Copy full SHA for 848deec - Browse repository at this point
Copy the full SHA 848deecView commit details
Commits on May 16, 2023
-
Configuration menu - View commit details
-
Copy full SHA for 2174e91 - Browse repository at this point
Copy the full SHA 2174e91View commit details -
Configuration menu - View commit details
-
Copy full SHA for 434343c - Browse repository at this point
Copy the full SHA 434343cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 926b886 - Browse repository at this point
Copy the full SHA 926b886View commit details -
fix simple typos (PaddlePaddle#53783)
* correct 1th to 1st * correct 1th to 1st * fix typo * fix typos
Configuration menu - View commit details
-
Copy full SHA for 847c48a - Browse repository at this point
Copy the full SHA 847c48aView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4f7dfd0 - Browse repository at this point
Copy the full SHA 4f7dfd0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 79c84ba - Browse repository at this point
Copy the full SHA 79c84baView commit details -
[phi] move stft to phi - Step 1 (PaddlePaddle#53517)
* [phi]mv StftKernel to phi * [phi] fix KernelSignature * [phi]fix arr error * [phi] Disable check_dygraph * [phi]fix include * [phi] rewrite mutable_data, add output register * [phi] fix Alloc * [phi] fix Alloc again * [phi] fix mutable_data * [phi] fix onesided_out Resize
Configuration menu - View commit details
-
Copy full SHA for 00c21ab - Browse repository at this point
Copy the full SHA 00c21abView commit details -
[inference][trt]Remove unused code from teller.cc (PaddlePaddle#53758)
* remove unused code
Configuration menu - View commit details
-
Copy full SHA for 2a94b81 - Browse repository at this point
Copy the full SHA 2a94b81View commit details -
[AMP] Allow to switch whether to use promote strategy to choose kerne…
…l for O2 training. (PaddlePaddle#53742) * Allow to switch whether to use promote strategy to choose kernel for O2 training. * Fix comparing error and add unittest.
Configuration menu - View commit details
-
Copy full SHA for db407bf - Browse repository at this point
Copy the full SHA db407bfView commit details -
[Inference] clean unused code/target for reduce inference so volume (…
…PART I) (PaddlePaddle#53762) * remove prelu land ookuip_table plugin, adjust .h include location * clean code and adjust some .h * update
Configuration menu - View commit details
-
Copy full SHA for 51ecd93 - Browse repository at this point
Copy the full SHA 51ecd93View commit details -
Configuration menu - View commit details
-
Copy full SHA for 98100fd - Browse repository at this point
Copy the full SHA 98100fdView commit details -
[dygraph]remove legacy code : _in_eager_mode_ and _in_eager_without_d…
…ygraph_check() (PaddlePaddle#53761) * remove _in_eager_mode_ * remove _in_eager_mode_
Configuration menu - View commit details
-
Copy full SHA for b133317 - Browse repository at this point
Copy the full SHA b133317View commit details -
Configuration menu - View commit details
-
Copy full SHA for 481511a - Browse repository at this point
Copy the full SHA 481511aView commit details -
Add Japanese README (PaddlePaddle#53726)
* Add Japanese README * Update README_ja.md
Configuration menu - View commit details
-
Copy full SHA for ad45b36 - Browse repository at this point
Copy the full SHA ad45b36View commit details -
[static op generation] InstanceNorm (PaddlePaddle#53340)
* mv InstanceNorm * modify op_version.yaml * modify add Operator:: in get_expected_kernel_func.cc * rm gradexpectedkernel * add extra * add float epsilon=1e-5
Configuration menu - View commit details
-
Copy full SHA for 7b81092 - Browse repository at this point
Copy the full SHA 7b81092View commit details -
Configuration menu - View commit details
-
Copy full SHA for b86bbe8 - Browse repository at this point
Copy the full SHA b86bbe8View commit details -
static graph autogen code support for softmax op (PaddlePaddle#53581)
* static graph autogen code support for softmax op * bug fixed * fix PR-CI-Windows error * fix CI error * bug fixed * fix conflicts
Configuration menu - View commit details
-
Copy full SHA for 312f018 - Browse repository at this point
Copy the full SHA 312f018View commit details -
Move fused batchnorm to Phi (PaddlePaddle#53476)
* trans fused batch norm Compute function * trans batch norm register info to phi * trans fused batch norm grad Compute * trans batch norm grad register info * add sig file * update sig file * Update fused_bn_activation_kernel.cu * Update fused_bn_activation_grad_kernel.cu * fix * Rename fused_bn_activation_kernel_grad.cu to fused_bn_activation_kernel.cu * fix * fix * fix CudnnDataType error * fix * fix include * update * add #if * add fused bn act to cmakelist.txt * update cmakelist * fix #ifdef error * add timeout set * add env set * fix * fix * Update fused_bn_activation_sig.cc
Configuration menu - View commit details
-
Copy full SHA for 5e5481d - Browse repository at this point
Copy the full SHA 5e5481dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0689e2a - Browse repository at this point
Copy the full SHA 0689e2aView commit details -
昇腾和寒武纪相关代码退场 npu相关代码退场3 (PaddlePaddle#53699)
* rm npu * rm use_npu * rm npuid * rm use_npu * rm npuid * delete npupinned * roll back sth. * roll back sth. * delete npupinned * roll back sth. * roll back sth. * rm npu * rollback something * rollback npu identity * rollback npu identity
Configuration menu - View commit details
-
Copy full SHA for 5b054d2 - Browse repository at this point
Copy the full SHA 5b054d2View commit details -
move cudnn_lstm kernel to phi (PaddlePaddle#53730)
* update * fix bug * test * test * update * update mutable_data * fix bug * update * fix bug * update output type reg * update * update
Configuration menu - View commit details
-
Copy full SHA for 52889e3 - Browse repository at this point
Copy the full SHA 52889e3View commit details -
Configuration menu - View commit details
-
Copy full SHA for c2c3bd4 - Browse repository at this point
Copy the full SHA c2c3bd4View commit details -
Configuration menu - View commit details
-
Copy full SHA for 32e36b1 - Browse repository at this point
Copy the full SHA 32e36b1View commit details -
【static】modify backward prune logic for EmptygradOpMaker (PaddlePaddl…
…e#53746) * add rules * modify no kernel yaml parse * success op generate * success test_silu_double * modify bug * modify static error * modify silu_grad input * modify kernel signature * modify kernel signature * code style * code style * review * delete opinfo modify * modify gradOpMaker * modify gradOpMaker * modify genarated-j2 * add approve rules * modify aytograd_functional_static_test
Configuration menu - View commit details
-
Copy full SHA for 69161a9 - Browse repository at this point
Copy the full SHA 69161a9View commit details -
Fix some tests for issuse 52842 (PaddlePaddle#53795)
* polish * polish
Configuration menu - View commit details
-
Copy full SHA for c33ba9d - Browse repository at this point
Copy the full SHA c33ba9dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 0ab7f94 - Browse repository at this point
Copy the full SHA 0ab7f94View commit details -
【PaddlePaddle Hackathon 4 No.34】为 Paddle 优化 Lerp OP 在 GPU 上的性能 (Paddl…
…ePaddle#53154) * modify lerp_kernel.cu * pre-commit * fix some CI issues * fix some CI issues * fix some CI issues * fix some CI issues * fix some CI issues * fix some CI issues * fix some CI issues * fix some CI issues * Add files via upload fix some CI issues
Configuration menu - View commit details
-
Copy full SHA for e592534 - Browse repository at this point
Copy the full SHA e592534View commit details -
remove some [-Wunused-parameter] warning and fix a file to pass cppli…
…nt (PaddlePaddle#53814) * test,test=develop * test,test=develop * test,test=develop * test,test=develop * test,test=develop
Configuration menu - View commit details
-
Copy full SHA for 10a38b4 - Browse repository at this point
Copy the full SHA 10a38b4View commit details -
【Hackathon No57】add bf16 for mode (PaddlePaddle#53195)
* add bf16 for mode * remove random seed 666 * try to fix op_type error * test for me * try to fix op_type * fix redundancy code * add fp,bf for lastdim * fix some error * simplify code * fix shape error * optype error * fix skipif bf16
Configuration menu - View commit details
-
Copy full SHA for 640cff0 - Browse repository at this point
Copy the full SHA 640cff0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 50f0acc - Browse repository at this point
Copy the full SHA 50f0accView commit details -
Configuration menu - View commit details
-
Copy full SHA for 74b91bc - Browse repository at this point
Copy the full SHA 74b91bcView commit details
Commits on May 17, 2023
-
Configuration menu - View commit details
-
Copy full SHA for d6d3de7 - Browse repository at this point
Copy the full SHA d6d3de7View commit details -
update openblas version (PaddlePaddle#53748)
* update openblas version * update
Configuration menu - View commit details
-
Copy full SHA for 8965366 - Browse repository at this point
Copy the full SHA 8965366View commit details -
【Hackathon 4 No.21】Add i1 / i1e to paddle (PaddlePaddle#53210)
* Add i1 and i1e op * resolve merge conflicts
Configuration menu - View commit details
-
Copy full SHA for a63fb4c - Browse repository at this point
Copy the full SHA a63fb4cView commit details -
Configuration menu - View commit details
-
Copy full SHA for 38e5cd0 - Browse repository at this point
Copy the full SHA 38e5cd0View commit details -
Configuration menu - View commit details
-
Copy full SHA for 56e8aff - Browse repository at this point
Copy the full SHA 56e8affView commit details -
Configuration menu - View commit details
-
Copy full SHA for 91a0ea5 - Browse repository at this point
Copy the full SHA 91a0ea5View commit details -
[IR] Program & Parameter & PaddleDialect (PaddlePaddle#53557)
* add program parameter dialect_interface * fix op create bug * add ir parameter convert pd variable methods * refine code * fix bug * refine by ut * refine ut * delete unused code * refine code * refine code by comment * reset WITH_NEW_IR * refine op attribute map * refine program and op create * refine program and op create
Configuration menu - View commit details
-
Copy full SHA for 78967ad - Browse repository at this point
Copy the full SHA 78967adView commit details -
Configuration menu - View commit details
-
Copy full SHA for 2cb2801 - Browse repository at this point
Copy the full SHA 2cb2801View commit details -
Configuration menu - View commit details
-
Copy full SHA for f6be954 - Browse repository at this point
Copy the full SHA f6be954View commit details -
Supports offline compilation of Paddle third-party libraries (PaddleP…
…addle#53744) * optimize logsumexp in small data scale * fix * fix * add #pragma once * compile protobuf offline * add submodlu gflags * check_submodules * check_submodules * add_submodule protobuf * add_submodule_protobuf * add_submodule * add .gitmodules * add_submodules * fix_compiler error * support offline compile * support offline compile * support offline_compile * remove cub * remove brpc * support offline compile * support offline compile * canning patching on cryptopp * modify .gitigonre of cryptopp * test * offline compile * add_submodule zlib * modify .gitmodules * modify .gitmodules * fix setup.py bug * delete submodule cryptopp * fix windows compile bug * fix xxhash compile problem --------- Co-authored-by: Asthestarsfalll <1186454801@qq.com> Co-authored-by: Asthestarsfalll <72954905+Asthestarsfalll@users.noreply.github.com>
Configuration menu - View commit details
-
Copy full SHA for 734dc44 - Browse repository at this point
Copy the full SHA 734dc44View commit details -
Configuration menu - View commit details
-
Copy full SHA for 9e045ee - Browse repository at this point
Copy the full SHA 9e045eeView commit details -
Configuration menu - View commit details
-
Copy full SHA for 4f1bf19 - Browse repository at this point
Copy the full SHA 4f1bf19View commit details
Commits on May 18, 2023
-
[AMP]Master grad in static graph (PaddlePaddle#53362)
* add master gradients on static graph * add unit test for bf16 master grad static graph * use float16 as v100 test dtype * only skip GPU which do not support bf16 * use linear layer to test master grad * 1.push master grad creation before all optimizer ops; 2.remove useless unittest; 3.use a function to create master grad states
Configuration menu - View commit details
-
Copy full SHA for 972581d - Browse repository at this point
Copy the full SHA 972581dView commit details -
Configuration menu - View commit details
-
Copy full SHA for 65ce688 - Browse repository at this point
Copy the full SHA 65ce688View commit details -
Configuration menu - View commit details
-
Copy full SHA for 92121d1 - Browse repository at this point
Copy the full SHA 92121d1View commit details -
Configuration menu - View commit details
-
Copy full SHA for 6d7076c - Browse repository at this point
Copy the full SHA 6d7076cView commit details -
rm cmake npu (PaddlePaddle#53869)
* rm cmake npu * Update generic.cmake * Update generic.cmake
Configuration menu - View commit details
-
Copy full SHA for 79ce3fa - Browse repository at this point
Copy the full SHA 79ce3faView commit details -
rm tools npu (PaddlePaddle#53870)
* rm tools npu * Update get_pr_ut.py * Update get_pr_ut.py
Configuration menu - View commit details
-
Copy full SHA for d294eef - Browse repository at this point
Copy the full SHA d294eefView commit details -
[XPU] do not call check_nccl_version_for_p2p under xpu (PaddlePaddle#…
…53862) * [XPU] do not call check_nccl_version_for_p2p under xpu * refine code.
Configuration menu - View commit details
-
Copy full SHA for 5d638fe - Browse repository at this point
Copy the full SHA 5d638feView commit details -
Configuration menu - View commit details
-
Copy full SHA for 236e742 - Browse repository at this point
Copy the full SHA 236e742View commit details -
support auto generate for op layer_norm (PaddlePaddle#53178)
* simplify layer_norm_op.cc * support auto generate for op layer_norm * update unittest for composite_layer_norm * remove layer_norm_op.cc from scripts * replace layer_norm_op with generated_op * add get_expected_kernel for layer_norm * update cmake kernel register function for layer_norm_mkldnn_op
Configuration menu - View commit details
-
Copy full SHA for 4f07b65 - Browse repository at this point
Copy the full SHA 4f07b65View commit details -
Configuration menu - View commit details
-
Copy full SHA for 1ac28b6 - Browse repository at this point
Copy the full SHA 1ac28b6View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2d0c694 - Browse repository at this point
Copy the full SHA 2d0c694View commit details -
[Dy2static-Fallback] add set_eval_frame function in pybind. (PaddlePa…
…ddle#52006) * [Dy2static-Fallback] add set_eval_frame function in pybind. 1. add set_eval_frame function in pybind. * add unittest for eval frame hooker. * [support py38] * fix-GeneratorExit error in eval frame hooker * support python == 3.9 * support 3.10 * fix some comments
Configuration menu - View commit details
-
Copy full SHA for 7b1695a - Browse repository at this point
Copy the full SHA 7b1695aView commit details -
Configuration menu - View commit details
-
Copy full SHA for acb5039 - Browse repository at this point
Copy the full SHA acb5039View commit details -
Configuration menu - View commit details
-
Copy full SHA for 2782b29 - Browse repository at this point
Copy the full SHA 2782b29View commit details -
move sequence_mask op InferShape func (PaddlePaddle#53782)
* move sequence_mask op InferShape func * add dtype infer
Configuration menu - View commit details
-
Copy full SHA for a862deb - Browse repository at this point
Copy the full SHA a862debView commit details -
Configuration menu - View commit details
-
Copy full SHA for d8407c5 - Browse repository at this point
Copy the full SHA d8407c5View commit details -
Configuration menu - View commit details
-
Copy full SHA for e916e80 - Browse repository at this point
Copy the full SHA e916e80View commit details -
Configuration menu - View commit details
-
Copy full SHA for 117e951 - Browse repository at this point
Copy the full SHA 117e951View commit details -
Configuration menu - View commit details
-
Copy full SHA for 0bed220 - Browse repository at this point
Copy the full SHA 0bed220View commit details -
Configuration menu - View commit details
-
Copy full SHA for 26da689 - Browse repository at this point
Copy the full SHA 26da689View commit details -
Fused elementwises kernels and ops (PaddlePaddle#51427)
* Fused elementwises kernels and ops * change fuse pass name * adjust .pbtxt files * adjust quantization attributes * add missing arguments and fix others, review fixed * simplify fused kernel registration * fix elementwise unit tests * reuse one fused elementwise op * adjust proto * Add supported datatypes * Change 'Scale' to 'scale' in tests, change some tests to onednn * Revert breaking changes * Fix unit tests * Delete obsolete test cases * Delete commented out code * Fix codestyle * delete temporary condition * fix conflicts and delete duplicate fusing * Fix code after merge * Move tests to new directory * fix tests volatility * Rename test_elementwise_add_onednn_op.py to test_elementwise_add_mkldnn_op.py * Update CMakeLists.txt add mkldnn op test --------- Co-authored-by: Silv3S <slawomir.siwek@intel.com>
Configuration menu - View commit details
-
Copy full SHA for fb4a6ec - Browse repository at this point
Copy the full SHA fb4a6ecView commit details -
Configuration menu - View commit details
-
Copy full SHA for d53d8fd - Browse repository at this point
Copy the full SHA d53d8fdView commit details -
Configuration menu - View commit details
-
Copy full SHA for ba84941 - Browse repository at this point
Copy the full SHA ba84941View commit details -
Configuration menu - View commit details
-
Copy full SHA for c3c8579 - Browse repository at this point
Copy the full SHA c3c8579View commit details -
Configuration menu - View commit details
-
Copy full SHA for bee8537 - Browse repository at this point
Copy the full SHA bee8537View commit details -
Configuration menu - View commit details
-
Copy full SHA for 3747978 - Browse repository at this point
Copy the full SHA 3747978View commit details