Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

m1 32g内存使用mps进行推理非常慢 #835

Open
1 task done
sdhjl2000 opened this issue Apr 27, 2023 · 1 comment
Open
1 task done

m1 32g内存使用mps进行推理非常慢 #835

sdhjl2000 opened this issue Apr 27, 2023 · 1 comment

Comments

@sdhjl2000
Copy link

Is there an existing issue for this?

  • I have searched the existing issues

Current Behavior

内存吃满,运行其慢,比使用cpu(float)还慢,求教如何调整

image

Expected Behavior

No response

Steps To Reproduce

tokenizer = AutoTokenizer.from_pretrained("/Users/XX/Projects/6b/chatglm-6b", trust_remote_code=True) model = AutoModel.from_pretrained("/Users/XX/Projects/6b/chatglm-6b", trust_remote_code=True).half().to("mps")

Environment

m1  13.3.1 (22E261) 
pytorch  2.1.0.dev20230425

Anything else?

No response

@sdhjl2000
Copy link
Author

#462

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant