-
Notifications
You must be signed in to change notification settings - Fork 65
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Evaluation results #1
Comments
I am also interested at this topic. Thanks. |
I run the model on a very small dataset(51 classes with 20 video clips per class) and the result is very strange. Always output the same prediction. I wonder if it can be better if I load the pre-trained weights. I would appreciate anyone who can give me some tips. Thanks, |
I am facing with the same situation with you. I still don't have any ideat about that now. Waiting for the reply for authors. |
I wonder whether the problem results from the code or from my too-small dataset. |
I tried it with a nearly 2000 Videos. Run different Epochs. But still the accuracy is not more than 21.09%. Strange thing is that it's same for most of the runs. No change in figures. |
Your dataset is too small. You can try run ViViT with ViT’s weight loaded for both temporal and spatial part. |
@DylanTao94 Can you share how I can do that. |
Sry mate, my code is not allowed to share. You can follow the steps in ViViT paper. |
yes this model works fine i've tested it on a dataset of 50k videos |
@seandatasci i think I might be doing something wrong with the code. Can you help me out here. My code is here |
i have the same problems with you, and i wonder you have resolved the problems whether or not, the acc or auc results is lower than 50%, the dataset size is also 2000, Thank u |
Inspired from the implementation of the ViViT by the author, we have reimplement the TimeSformer and ViViT, and release the pretrain-model weights on Kinetics600 can be found here |
The model isn't learning. Trained on 2 classes of UCF101 dataset. Adam optimizer, CrossEntropyLoss |
Hi,
Thanks for your work making a Pytorch version of the paper - much appreciated!
How does this implementation compare to results in the original paper. Specifically on the Moments in Time dataset.
Thanks,
Ed
The text was updated successfully, but these errors were encountered: