[BUG] _raise_timeout_error when training chatglm2-6b #2713

wangshuai09 · 2023-11-22T07:38:03Z

Describe the bug
Loading /home/wangshuai/models/chatglm2-6b requires to execute some code in that repo, you can inspect the content of the repository at https://hf.co//home/wangshuai/models/chatglm2-6b. You can dismiss this prompt by passing trust_remote_code=True. Do you accept? [y/N] True False None Loading /home/wangshuai/models/chatglm2-6b requires to execute some code in that repo, you can inspect the content of the repository at https://hf.co//home/wangshuai/models/chatglm2-6b. You can dismiss this prompt by passing trust_remote_code=True. Do you accept? [y/N] Traceback (most recent call last): File "/root/miniconda3/envs/torch_npu/lib/python3.8/site-packages/transformers/dynamic_module_utils.py", line 597, in resolve_trust_remote_code answer = input( File "/root/miniconda3/envs/torch_npu/lib/python3.8/site-packages/transformers/dynamic_module_utils.py", line 577, in _raise_timeout_error raise ValueError( ValueError: Loading this model requires you to execute the configuration file in that repo on your local machine. We asked if it was okay but did not get an answer. Make sure you have read the code there to avoid malicious use, then set the option trust_remote_code=Trueto remove this error.

To Reproduce
Model download form huggingface-chatglm2-6b
Script is
torchrun --nproc_per_node=4 --master_port=20001 fastchat/train/train.py \ --model_name_or_path /home/xxx/models/chatglm2-6b \ --data_path /home/xxx/datasets/evol-instruct-chinese/evol-instruct-chinese-1024-subset.json \ --fp16 True \ --output_dir output_chatglm \ --num_train_epochs 5 \ --per_device_train_batch_size 8 \ --per_device_eval_batch_size 1 \ --gradient_accumulation_steps 1 \ --evaluation_strategy "no" \ --save_strategy "epoch" \ --learning_rate 5e-5 \ --weight_decay 0. \ --lr_scheduler_type "cosine" \ --logging_steps 1 \ --fsdp "full_shard auto_wrap" \ --model_max_length 512 \ --gradient_checkpointing True \ --lazy_preprocess True

Reason
trust_remote_code shold be True to execute code present on the Hub on local machine.

The text was updated successfully, but these errors were encountered:

wangshuai09 mentioned this issue Nov 22, 2023

add trust_remote_code argument #2715

Merged

3 tasks

wangshuai09 closed this as completed Nov 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[BUG] _raise_timeout_error when training chatglm2-6b #2713

[BUG] _raise_timeout_error when training chatglm2-6b #2713

wangshuai09 commented Nov 22, 2023

[BUG] _raise_timeout_error when training chatglm2-6b #2713

[BUG] _raise_timeout_error when training chatglm2-6b #2713

Comments

wangshuai09 commented Nov 22, 2023