[LoRA] Support LTX Video #10228

a-r-r-o-w · 2024-12-16T02:13:51Z

~~Finetuning script on-the-way soon! WIP~~

Fine-tuning script: a-r-r-o-w/finetrainers#123

To run training, use:

#!/bin/bash

# export TORCH_LOGS="+dynamo,recompiles,graph_breaks"
# export TORCHDYNAMO_VERBOSE=1
export WANDB_MODE="offline"
export NCCL_P2P_DISABLE=1
export TORCH_NCCL_ENABLE_MONITORING=0

GPU_IDS="2,3"

DATA_ROOT="/raid/aryan/video-dataset-disney"
CAPTION_COLUMN="prompts.txt"
VIDEO_COLUMN="videos.txt"

# Model arguments
model_cmd="--model_name ltx_video \
  --pretrained_model_name_or_path Lightricks/LTX-Video"

# Dataset arguments
dataset_cmd="--data_root $DATA_ROOT \
  --video_column $VIDEO_COLUMN \
  --caption_column $CAPTION_COLUMN \
  --id_token afkx \
  --video_resolution_buckets 49x480x768 \
  --caption_dropout_p 0.05"

# Dataloader arguments
dataloader_cmd="--dataloader_num_workers 0"

# Diffusion arguments
diffusion_cmd="--flow_resolution_shifting"

# Training arguments
training_cmd="--training_type lora \
  --seed 42 \
  --mixed_precision bf16 \
  --batch_size 1 \
  --train_steps 2000 \
  --rank 128 \
  --lora_alpha 64 \
  --target_modules to_q to_k to_v to_out.0 \
  --gradient_accumulation_steps 1 \
  --gradient_checkpointing \
  --checkpointing_steps 500 \
  --checkpointing_limit 2 \
  --enable_slicing \
  --enable_tiling"

# Optimizer arguments
optimizer_cmd="--optimizer adamw \
  --lr 1e-4 \
  --scale_lr \
  --lr_scheduler cosine_with_restarts \
  --lr_warmup_steps 100 \
  --lr_num_cycles 1 \
  --beta1 0.9 \
  --beta2 0.95 \
  --weight_decay 1e-4 \
  --epsilon 1e-8 \
  --max_grad_norm 1.0"

# Validation arguments
validation_cmd="--validation_prompts \"afkx A black and white animated scene unfolds with an anthropomorphic goat surrounded by musical notes and symbols, suggesting a playful environment. Mickey Mouse appears, leaning forward in curiosity as the goat remains still. The goat then engages with Mickey, who bends down to converse or react. The dynamics shift as Mickey grabs the goat, potentially in surprise or playfulness, amidst a minimalistic background. The scene captures the evolving relationship between the two characters in a whimsical, animated setting, emphasizing their interactions and emotions.@@@49x480x768:::A woman with long brown hair and light skin smiles at another woman with long blonde hair. The woman with brown hair wears a black jacket and has a small, barely noticeable mole on her right cheek. The camera angle is a close-up, focused on the woman with brown hair's face. The lighting is warm and natural, likely from the setting sun, casting a soft glow on the scene. The scene appears to be real-life footage@@@49x480x768\" \
  --num_validation_videos 1 \
  --validation_steps 100"

# Miscellaneous arguments
miscellaneous_cmd="--tracker_name finetrainers-ltxv \
  --output_dir /raid/aryan/ltx-video \
  --nccl_timeout 1800 \
  --report_to wandb"

cmd="accelerate launch --config_file accelerate_configs/uncompiled_2.yaml --gpu_ids $GPU_IDS train.py \
  $model_cmd \
  $dataset_cmd \
  $dataloader_cmd \
  $diffusion_cmd \
  $training_cmd \
  $optimizer_cmd \
  $validation_cmd \
  $miscellaneous_cmd"

echo "Running command: $cmd"
eval $cmd
echo -ne "-------------------- Finished executing script --------------------\n\n"

HuggingFaceDocBuilderDev · 2024-12-16T02:20:38Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint. The docs are available until 30 days after the last update.

src/diffusers/loaders/lora_pipeline.py

src/diffusers/models/transformers/transformer_ltx.py

sayakpaul

Okay for me to merge once we have added a test suite test_lora_layers_ltx_video.py.

a-r-r-o-w · 2024-12-17T05:20:15Z

@sayakpaul So, either CogVideoX or Mochi is okay for copying since they are same. I've added the tests as well. Could you give it a quick look too?

sayakpaul

Awesome. LGTM!

tests/lora/test_lora_layers_ltx_video.py

* add lora support for ltx * add tests * fix copied from comments * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

add lora support for ltx

7d96d94

a-r-r-o-w added lora roadmap Add to current release roadmap labels Dec 16, 2024

a-r-r-o-w mentioned this pull request Dec 16, 2024

LTX Video a-r-r-o-w/finetrainers#123

Merged

Merge branch 'main' into ltx-lora-support

598596a

a-r-r-o-w marked this pull request as ready for review December 17, 2024 02:07

Merge branch 'main' into ltx-lora-support

307b797

a-r-r-o-w requested a review from sayakpaul December 17, 2024 04:55

sayakpaul reviewed Dec 17, 2024

View reviewed changes

src/diffusers/loaders/lora_pipeline.py Show resolved Hide resolved

sayakpaul reviewed Dec 17, 2024

View reviewed changes

src/diffusers/models/transformers/transformer_ltx.py Show resolved Hide resolved

sayakpaul approved these changes Dec 17, 2024

View reviewed changes

a-r-r-o-w added 2 commits December 17, 2024 06:11

add tests

6809572

fix copied from comments

7c6e385

sayakpaul approved these changes Dec 17, 2024

View reviewed changes

tests/lora/test_lora_layers_ltx_video.py Outdated Show resolved Hide resolved

tests/lora/test_lora_layers_ltx_video.py Show resolved Hide resolved

tests/lora/test_lora_layers_ltx_video.py Show resolved Hide resolved

a-r-r-o-w and others added 2 commits December 17, 2024 06:38

update

5b0e7aa

Merge branch 'main' into ltx-lora-support

3583908

a-r-r-o-w merged commit ac86393 into main Dec 17, 2024
14 of 15 checks passed

a-r-r-o-w deleted the ltx-lora-support branch December 17, 2024 06:35

sayakpaul added a commit that referenced this pull request Dec 23, 2024

[LoRA] Support LTX Video (#10228)

2e1bd6c

* add lora support for ltx * add tests * fix copied from comments * update --------- Co-authored-by: Sayak Paul <spsayakpaul@gmail.com>

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[LoRA] Support LTX Video #10228

[LoRA] Support LTX Video #10228

a-r-r-o-w commented Dec 16, 2024 •

edited

Loading

HuggingFaceDocBuilderDev commented Dec 16, 2024

sayakpaul left a comment

a-r-r-o-w commented Dec 17, 2024

sayakpaul left a comment

[LoRA] Support LTX Video #10228

[LoRA] Support LTX Video #10228

Conversation

a-r-r-o-w commented Dec 16, 2024 • edited Loading

HuggingFaceDocBuilderDev commented Dec 16, 2024

sayakpaul left a comment

Choose a reason for hiding this comment

a-r-r-o-w commented Dec 17, 2024

sayakpaul left a comment

Choose a reason for hiding this comment

a-r-r-o-w commented Dec 16, 2024 •

edited

Loading