Can you share the dataset class of SST-5, SNLI, TREC datasets? #36

zimingyy · 2024-08-14T12:13:01Z

Hi, i am interested in your non-differentiable objectives experiments using MeZO, but i don't find the dataset class and prompt template of SST-5, SNLI, TREC datasets. Can you share the dataset class of SST-5, SNLI, TREC datasets? Thank you very much!!

zimingyy · 2024-08-14T12:19:36Z

Also, I tried to modify the code to support the zero-order optimization training of accuracy, an non-differentiable objective function. I use roberta-large model and SST2 dataset. I set the batchsize to 512 and the learning rate to 1e-6 and 5e-7. I tried to reproduce the results in your paper, but the training results were poor. Can you share this part of your code implementation?

gaotianyu1350 · 2024-08-14T12:39:22Z

Hi,

You can run the non-differentiable example by (large models, squad, also mentioned in README)

MODEL=facebook/opt-13b TASK=SQuAD MODE=prefix LR=1e-2 EPS=1e-1 bash mezo.sh --non_diff --evaluation_strategy no --save_strategy no --save_model

The implementation is here:

MeZO/large_models/trainer.py

Line 734 in 552cb1b

def zo_forward_nondiff(self, model, inputs):

zimingyy · 2024-08-14T12:49:12Z

thanks for your reply! Sure, i have already tried the OPT-13b model finetuning on Squad dataset using MeZO, and the result is quite good. I want to try more non-differentiable example, such as Classfication tasks (accuracy metric), Can you share this part of your code implementation? I really appreciate your help.

gaotianyu1350 · 2024-08-26T11:09:46Z

Hi Ziming,

I realized the feature is actually provided. It is implemented under the flag --optimize_acc in the medium sized model folder.

zimingyy · 2024-08-26T11:39:51Z

Yes, I have resolved my issue, and I am very grateful for your enthusiastic assistance!!!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can you share the dataset class of SST-5, SNLI, TREC datasets? #36

Can you share the dataset class of SST-5, SNLI, TREC datasets? #36

zimingyy commented Aug 14, 2024

zimingyy commented Aug 14, 2024

gaotianyu1350 commented Aug 14, 2024

zimingyy commented Aug 14, 2024

gaotianyu1350 commented Aug 26, 2024

zimingyy commented Aug 26, 2024

Can you share the dataset class of SST-5, SNLI, TREC datasets? #36

Can you share the dataset class of SST-5, SNLI, TREC datasets? #36

Comments

zimingyy commented Aug 14, 2024

zimingyy commented Aug 14, 2024

gaotianyu1350 commented Aug 14, 2024

zimingyy commented Aug 14, 2024

gaotianyu1350 commented Aug 26, 2024

zimingyy commented Aug 26, 2024