The link of the model weights is now unavailable. #9

BeachWang · 2024-02-05T07:11:29Z

Hi,

I try to reproduce your work with the reported codes. The loss in pretrain and finetune meets expectations. But I get ridiculous answer in eval. For example:

Q:
Is there a snowboard in the image?
Answer the question using a single word or phrase.

A:
50

So can you upload the model weights to help me debug.
Thank you very much!

clarencerat · 2024-02-06T05:29:08Z

Yeah model weights are not available . Authors of repo please release it , i am excited to see.

BeachWang · 2024-02-06T06:10:37Z

I find the link to huggingface is available. But I test it on MME and only get 1110.96 score.

BeachWang · 2024-02-06T08:05:11Z

Hi,

I try to reproduce your work with the reported codes. The loss in pretrain and finetune meets expectations. But I get ridiculous answer in eval. For example:

Q: Is there a snowboard in the image? Answer the question using a single word or phrase.

A: 50

So can you upload the model weights to help me debug. Thank you very much!

The bug may be in the preprocess_v0 in train.py. It should be

# instruction_len = len(tokenizer(parts[0]).input_ids)
instruction_len = len(tokenizer(parts[0]).input_ids) - 1

, since the last space will be tokenized in parts[0] but be tokenized with other contents in target.

yhcao6 · 2024-03-22T11:42:32Z

Hi,
I try to reproduce your work with the reported codes. The loss in pretrain and finetune meets expectations. But I get ridiculous answer in eval. For example:
Q: Is there a snowboard in the image? Answer the question using a single word or phrase.
A: 50
So can you upload the model weights to help me debug. Thank you very much!

The bug may be in the preprocess_v0 in train.py. It should be
# instruction_len = len(tokenizer(parts[0]).input_ids)
instruction_len = len(tokenizer(parts[0]).input_ids) - 1
, since the last space will be tokenized in parts[0] but be tokenized with other contents in target.

I also got 1110.96 on MME, have you fixed the performance sir?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The link of the model weights is now unavailable. #9

The link of the model weights is now unavailable. #9

BeachWang commented Feb 5, 2024 •

edited

Loading

clarencerat commented Feb 6, 2024

BeachWang commented Feb 6, 2024 •

edited

Loading

BeachWang commented Feb 6, 2024

yhcao6 commented Mar 22, 2024

The link of the model weights is now unavailable. #9

The link of the model weights is now unavailable. #9

Comments

BeachWang commented Feb 5, 2024 • edited Loading

clarencerat commented Feb 6, 2024

BeachWang commented Feb 6, 2024 • edited Loading

BeachWang commented Feb 6, 2024

yhcao6 commented Mar 22, 2024

BeachWang commented Feb 5, 2024 •

edited

Loading

BeachWang commented Feb 6, 2024 •

edited

Loading