-
Notifications
You must be signed in to change notification settings - Fork 7k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Cannot reproduce performance of Keypoint R-CNN with ResNet-50 #1606
Comments
Thanks a lot for opening this issue! I'm having a look at it |
Some follow-up information: both |
Ok, I think I found the issue. I have mistakenly took the wrong model checkpoint when uploading the model for keypoint rcnn. I took the checkpoint for epoch 29, instead of the one for epoch 45... Because the number of images is smaller in the person keypoint subset of COCO, the number of epochs should be adapted so that we have the same number of iterations. I'll upload the correct weights soon and let you know, thanks a lot for opening the issue! |
@yoshitomo-matsubara should be fixed in #1609 |
Hi @fmassa One more quick quick question about case b of mine above before you close this issue: It would be very appreciated if you can provide the hyperparameters (as a document or comments like this) to train each of the models so that we can have a better idea as we train them by ourselves e.g., using different datasets, models, etc |
@yoshitomo-matsubara I'll open a PR with the training hyperparameters for training. Faster R-CNN
Mask R-CNN
Keypoint R-CNN
Also, if you could send a PR with those schedules it would be great! |
I tried two approaches for reproducing the performance of Keypoint R-CNN with ResNet-50, box AP = 54.6, keypoint AP = 65.0:
a) use pretrained Keypoint R-CNN with train.py
b) train Keypoint R-CNN by myself with train.py
But either didn't reproduce the performance. As for a), my guess is I need to set some parameters besides pretrained flag.
Could you please help me reproduce the performance, hopefully for both a) and b)? More details about my results are given as follows.
Environment
Details
a) use pretrained Keypoint R-CNN with train.py
command:
pipenv run python train.py --data-path ./coco2017/ --dataset coco_kp --model keypointrcnn_resnet50_fpn --test-only --pretrained
log
b) use pretrained Keypoint R-CNN with train.py
command:
pipenv run python -m torch.distributed.launch --nproc_per_node=3 --use_env python train.py --data-path ./coco2017/ --dataset coco_kp --model keypointrcnn_resnet50_fpn --world-size 3 --lr 0.0075
Learning rate
lr
is set by following a suggestion intrain.py
box AP = 50.6 (Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets=100 ])
keypoint AP = 61.1 (Average Precision (AP) @[ IoU=0.50:0.95 | area= all | maxDets= 20 ])
Thank you!
The text was updated successfully, but these errors were encountered: