Is the mask decoder weight inherited from the teacher models' decoder? #18

Vickeyhw · 2024-02-03T15:45:41Z

If so, in the full-stage knowledge distillation, the image encoder is randomly initialized, is the mask decoder finetuned at a smaller learning rate than the light weight image encoder? Is this consistent with your implementation?

shuh15 · 2024-02-04T09:44:44Z

Yes, the weights for mask decoder are inherited from the teacher, and we use a smaller learning rate for mask decoder compared to image encoder.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Is the mask decoder weight inherited from the teacher models' decoder? #18

Is the mask decoder weight inherited from the teacher models' decoder? #18

Vickeyhw commented Feb 3, 2024

shuh15 commented Feb 4, 2024

Is the mask decoder weight inherited from the teacher models' decoder? #18

Is the mask decoder weight inherited from the teacher models' decoder? #18

Comments

Vickeyhw commented Feb 3, 2024

shuh15 commented Feb 4, 2024