A video recognition system that automatically described video contents in English
- a fixed camera video
- a text file that contains English description about moving objects in the video
- utilized OpenCV and background subtraction to process and detect moving objects
- applied convolutional neural network (VGG) to label objects