`auto_lr_find` does not work if there is a BackboneFinetuning callback #14674

ejm714 · 2022-09-12T22:56:05Z

🐛 Bug

auto_lr_find does not properly restore the model for training if there is a BackboneFinetuning callback.

To Reproduce

Specify a BackboneFinetuning callback, set auto_lr_find to True, and then run tune and fit.

trainer = Trainer(
    auto_lr_find=True,
    callbacks=[BackboneFinetuning()],
)
trainer.tune(model, train_dataloaders=train_data, val_dataloaders=val_data)
trainer.fit(model, train_dataloaders=train_data, val_dataloaders=val_data)

which will yield the following error

/usr/local/lib/python3.7/dist-packages/pytorch_lightning/callbacks/finetuning.py in on_fit_start(self, trainer, pl_module)
    103             for opt_idx, optimizer in enumerate(trainer.optimizers):
    104                 param_groups = self._apply_mapping_to_param_groups(
--> 105                     self._internal_optimizer_metadata[opt_idx], named_parameters
    106                 )
    107                 optimizer.param_groups = param_groups

KeyError: 0

See notebook example: https://colab.research.google.com/drive/1ajrSRge90RM8Rlcwk0HyEosLLpOpyvg-

Expected behavior

It should be the case that after auto_lr_find runs, the model is reset and the found learning rate is used.

Environment

See bottom cell of colab notebook.

Additional context

I think the culprit is that on_fit_start on BackboneFinetuning now calls the on_fit_start method of BaseFinetuning, which then thinks the model is being restarted from a checkpoint.

It looks like the bug got introduced in this PR: 07635d0#diff-ac96be7ba54bac4d7dc79ee012a211498fb97689e37026fe8a1b06a359079224R410

The fix will need to both support the finetuning callbacks when training is resumed as well as as support using auto lr find when there is a backbone finetuning callback on the model.

cc @akihironitta @Borda @rohitgr7

The text was updated successfully, but these errors were encountered:

granthamtaylor · 2024-03-27T18:46:31Z

I came across the same issue 1.5 years later.

I cannot use both BaseFinetuning and LearningRateFinder at the same time, otherwise I get hit with KeyError: 0.

patrontheo · 2024-09-04T15:18:25Z

Same for me
@Borda @awaelchli Any plans about it ?

ejm714 added the needs triage Waiting to be triaged by maintainers label Sep 12, 2022

ejm714 changed the title ~~auto_lr_find does not work is there is a BackboneFinetuning callback~~ auto_lr_find does not work if there is a BackboneFinetuning callback Sep 12, 2022

ejm714 mentioned this issue Sep 12, 2022

Expose ability to set learning rate for models drivendataorg/zamba#157

Open

carmocca added this to the pl:1.7.x milestone Sep 15, 2022

carmocca added bug Something isn't working tuner callback: finetuning and removed needs triage Waiting to be triaged by maintainers labels Sep 15, 2022

carmocca modified the milestones: pl:1.7.x, v1.8.x Oct 13, 2022

ejm714 mentioned this issue Dec 19, 2022

Add ability to set learning rate drivendataorg/zamba#223

Closed

2 tasks

Borda modified the milestones: v1.8.x, v1.9 Jan 6, 2023

Borda modified the milestones: v1.9, v1.9.x Jan 16, 2023

awaelchli added the help wanted Open to be worked on label Dec 31, 2023

awaelchli removed this from the v1.9.x milestone Dec 31, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

`auto_lr_find` does not work if there is a BackboneFinetuning callback #14674

`auto_lr_find` does not work if there is a BackboneFinetuning callback #14674

ejm714 commented Sep 12, 2022 •

edited by github-actions bot

Loading

granthamtaylor commented Mar 27, 2024

patrontheo commented Sep 4, 2024

auto_lr_find does not work if there is a BackboneFinetuning callback #14674

auto_lr_find does not work if there is a BackboneFinetuning callback #14674

Comments

ejm714 commented Sep 12, 2022 • edited by github-actions bot Loading

🐛 Bug

To Reproduce

Expected behavior

Environment

Additional context

granthamtaylor commented Mar 27, 2024

patrontheo commented Sep 4, 2024

`auto_lr_find` does not work if there is a BackboneFinetuning callback #14674

`auto_lr_find` does not work if there is a BackboneFinetuning callback #14674

ejm714 commented Sep 12, 2022 •

edited by github-actions bot

Loading