You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Thank you for making your work and code available to the community. We have encountered a few issues while reproducing the results from the paper using the provided repository. We would appreciate your guidance on the following points:
Test Performance Discrepancy:
When using the provided pre-trained weights and running the test script, we observed that the performance is consistently lower by one point compared to the results reported in the paper. Could you please provide insights into any additional steps or configurations that might be necessary to achieve the reported performance?
Batch Size:
In the paper, the batch size is mentioned as 8, while the default batch size in the provided code is set to 16. Could you please confirm which batch size was used for the results reported in the paper? Additionally, if there were any specific reasons for this discrepancy, we would appreciate an explanation.
Initialization of Weights:
It is not clear from the provided documentation whether the initial weights in the training process are randomly initialized or if they use pre-trained weights. Could you please clarify the weight initialization strategy used in your experiments?
Any assistance or clarification on these points would be highly appreciated. Thank you for your time and support.
Best regards,
Amordia
The text was updated successfully, but these errors were encountered:
Sorry but I don't know the reason for the test performance discrepancy. Maybe it is becasue the gpu difference? We test the model on RTX 3090.
We have trained the model with different batch sizes and found that the size of the batch does not significantly affect performance. If GPU allows, we recommend using a larger batch size for faster training.
We used the checkpoints pre-trained on ImageNet by MAE in this url which is same as CounTR.
Dear Authors,
Thank you for making your work and code available to the community. We have encountered a few issues while reproducing the results from the paper using the provided repository. We would appreciate your guidance on the following points:
Test Performance Discrepancy:
Batch Size:
Initialization of Weights:
Any assistance or clarification on these points would be highly appreciated. Thank you for your time and support.
Best regards,
Amordia
The text was updated successfully, but these errors were encountered: