CodeGeeX inference support oneflow backend #65

BBuf · 2023-02-10T01:21:13Z

以下是oneflow和Faster Transformer以及PyTorch在Fp16情况下的性能结果：

由于mock torch在这个例子的应用中存在一些问题，无法一键以 oneflow 的方式运行 pytorch 版本的代码，我们以单独新增脚本的方式支持 oneflow 后端的 codegeex 推理。

BBuf · 2023-02-13T07:22:29Z

codegeex/oneflow/codegeex_model.py

+from oneflow.nn.parameter import Parameter
+
+
+def fast_gelu(x):


优化1: quick_gelu

BBuf · 2023-02-13T07:22:44Z

codegeex/oneflow/codegeex_model.py

+        # Query, Key, and Value
+        # =====================
+
+        if hasattr(torch._C, 'grouped_matmul_bias'):


优化2: group matmul

BBuf · 2023-02-13T07:23:03Z

codegeex/oneflow/codegeex_model.py

+        origin_key_layer = key_layer
+        origin_value_layer = value_layer
+
+        if hasattr(torch._C, 'fused_multi_head_attention_inference'):


优化3: fused_fmha

BBuf · 2023-02-13T07:23:31Z

codegeex/oneflow/codegeex_model.py

+        context_length=None,
+    ):
+
+        # hidden_states: [sq, b, h]


TopQueryAttention和SelfAttrention优化方式相同

BBuf · 2023-02-13T07:23:54Z

tests/test_inference_oneflow.py

+from codegeex.oneflow import CodeGeeXModel
+from codegeex.tokenizer import CodeGeeXTokenizer
+from codegeex.quantization import quantize
+os.environ["ONEFLOW_KERNEL_ENABLE_FUSED_LINEAR"] = "1"


优化4: matmul支持和bias_add进行融合。

BBuf added 3 commits February 10, 2023 01:18

add oneflow backend for inference

103821c

delete pycache

bea78b2

codegeex support oneflow backend

8869145

BBuf commented Feb 13, 2023

View reviewed changes

Stanislas0 merged commit d980c9d into THUDM:main Feb 13, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CodeGeeX inference support oneflow backend #65

CodeGeeX inference support oneflow backend #65

BBuf commented Feb 10, 2023 •

edited

Loading

BBuf Feb 13, 2023

BBuf Feb 13, 2023

BBuf Feb 13, 2023

BBuf Feb 13, 2023

BBuf Feb 13, 2023

CodeGeeX inference support oneflow backend #65

CodeGeeX inference support oneflow backend #65

Conversation

BBuf commented Feb 10, 2023 • edited Loading

BBuf Feb 13, 2023

Choose a reason for hiding this comment

BBuf Feb 13, 2023

Choose a reason for hiding this comment

BBuf Feb 13, 2023

Choose a reason for hiding this comment

BBuf Feb 13, 2023

Choose a reason for hiding this comment

BBuf Feb 13, 2023

Choose a reason for hiding this comment

BBuf commented Feb 10, 2023 •

edited

Loading