Skip to content

NUS-HPC-AI-Lab/Enhance-A-Video

Repository files navigation

Enhance-A-Video

Blog | Twitter | Reddit

This repository is the official implementation of Enhance-A-Video: Better Generated Video for Free.

🎥 Demo

demo.-.Compressed.with.FlexClip.mp4

The video has been heavily compressed to GitHub's policy. For more demos, please visit our blog.

🔥🔥🔥News

🎉 Method

method

We design an Enhance Block as a parallel branch. This branch computes the average of non-diagonal elements of temporal attention maps as cross-frame intensity (CFI). An enhanced temperature parameter multiplies the CFI to enhance the temporal attention output.

🛠️ Dependencies and Installation

Install the dependencies:

conda create -n enhanceAvideo python=3.10
conda activate enhanceAvideo
pip install -r requirements.txt

📜 Requirements

The following table shows the requirements for running HunyuanVideo/CogVideoX model (batch size = 1) to generate videos:

Model Setting
(height/width/frame)
Denoising step GPU Peak Memory
HunyuanVideo 720px1280px129f 50 60GB
CogVideoX-2B 480px720px49f 50 10GB

🧱 Inference

Generate videos:

python cogvideox.py
python hunyuanvideo.py

🔗 BibTeX

@misc{luo2024Enhance-A-Video,
      title={Enhance-A-Video: Better Generated Video for Free},
      author={Yang Luo and Xuanlei Zhao and Mengzhao Chen and Kaipeng Zhang and Wenqi Shao and Kai Wang and Zhangyang Wang and Yang You},
      year={2024},
}