Save state on train end #1168

gesen2egee · 2024-03-10T15:40:39Z

Based on the current implementation, the state is saved as frequently as it is with LoRA.

However, given the significant file size of the state, even with the use of save_last_n_XX to manage and delete old states, it remains cumbersome, especially for those who need to save a large number of step versions (approximately 100-200 steps) to pick up the optimal training outcomes.

This minor modification introduces
--save_state_on_train_end allowing the preservation of only the final state at the end of training. This facilitates the continuation of training for under-trained models.

It has been observed that, compared to resuming with weights, resuming from the state provides a more stable and similar descent curve to the original training process.

kohya-ss · 2024-03-20T09:01:58Z

Thank you! This makes a big sense.

gesen2egee added 2 commits March 10, 2024 23:33

save state on train end

095b803

Update train_network.py

d282c45

kohya-ss changed the base branch from main to dev March 20, 2024 08:49

kohya-ss merged commit bf6cd4b into kohya-ss:dev Mar 20, 2024
1 check passed

bmaltais mentioned this pull request Apr 7, 2024

v23.1.0 bmaltais/kohya_ss#2219

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Save state on train end #1168

Save state on train end #1168

gesen2egee commented Mar 10, 2024 •

edited

Loading

kohya-ss commented Mar 20, 2024

Save state on train end #1168

Save state on train end #1168

Conversation

gesen2egee commented Mar 10, 2024 • edited Loading

kohya-ss commented Mar 20, 2024

gesen2egee commented Mar 10, 2024 •

edited

Loading