[python] Adds built-in DeepSpeed handler #292

frankfliu · 2022-10-16T20:50:02Z

Description

Brief description of what this PR is about

If this change is a backward incompatible change, why must this change be made?
Interesting edge cases to note here

lanking520

I would suggest we look back on DeepSpeed config to make it more generic. This solution may only applies to small and simple model. For large model requires CPU memory load saving, DeepSpeed checkpoint loading may not work

lanking520 · 2022-10-16T21:42:06Z

engines/python/setup/djl_python/deepspeed.py

+        model = deepspeed.init_inference(model,
+                                         mp_size=mp_size,
+                                         dtype=model.dtype,
+                                         replace_method='auto',


This line is not always used. Sometimes loading checkpoint will need to remove this line

lanking520 · 2022-10-16T21:43:00Z

engines/python/src/test/resources/gpt2/serving.properties

+option.entryPoint=djl_python.deepspeed
+option.parallel_loading=true
+option.tensor_parallel_degree=2
+option.model_loading_timeout=600


This time is too short?

lanking520 · 2022-10-16T21:43:53Z

engines/python/setup/djl_python/deepspeed.py

+        model = AutoModelForCausalLM.from_pretrained(model_id)
+        tokenizer = AutoTokenizer.from_pretrained(model_id)
+        if data_type == "fp16":
+            model.half()


It may be more efficient to allow HuggingFace do the FP16 conversion during loading

frankfliu requested a review from zachgk as a code owner October 16, 2022 20:50

lanking520 reviewed Oct 16, 2022

View reviewed changes

[python] Adds built-in DeepSpeed handler

dbb1835

frankfliu force-pushed the gpt2 branch from ad0fd8c to dbb1835 Compare October 17, 2022 22:54

lanking520 approved these changes Oct 18, 2022

View reviewed changes

lanking520 merged commit 440c2fa into deepjavalibrary:master Oct 18, 2022

frankfliu deleted the gpt2 branch October 18, 2022 17:56

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[python] Adds built-in DeepSpeed handler #292

[python] Adds built-in DeepSpeed handler #292

frankfliu commented Oct 16, 2022

lanking520 left a comment

lanking520 Oct 16, 2022

lanking520 Oct 16, 2022

lanking520 Oct 16, 2022

[python] Adds built-in DeepSpeed handler #292

[python] Adds built-in DeepSpeed handler #292

Conversation

frankfliu commented Oct 16, 2022

Description

lanking520 left a comment

Choose a reason for hiding this comment

lanking520 Oct 16, 2022

Choose a reason for hiding this comment

lanking520 Oct 16, 2022

Choose a reason for hiding this comment

lanking520 Oct 16, 2022

Choose a reason for hiding this comment