Nuclio Automatic/SemiAutomatic AI Tool Functions not running on GPU #2489

machinsk · 2020-11-25T04:33:18Z

Not sure if this is an elephant in the room with Intel not wanting support for Nvidia GPUs, but the models CVAT uses with the Nuclio serverless functions do not run with GPU acceleration. The documentation is not great on this topic, but I've pieced together everything I could find.

I'm deploying CVAT to an AWS EC2 g4dn.xlarge instance, 2nd Generation Intel Xeon, 4 vCPUs, 16GBs RAM, Nvidia Tesla T4 GPU with 16 GB vRAM, running Ubuntu 20.04. I can get all the functions to appear from serverless/deploy.sh under Models and most of them run (which is a different issue), but all that do run, run on the CPU.

Now, I've tried running with and without nvidia-docker (which is not mentioned to use in the installation instructions). I've tried the suggestion here. I have the nvidia gaming drivers installed for the G4 instances. I have docker version 19.03.13, nvidia drivers 445.48 and CUDA version 11.0.

We even tried changing cvat/requirements/base.txt to use tensorflow-gpu opposed to tensorflow.

To test, we use a 180 frame video in a task, from the /task page in the UI, select the ellipsis beside Action and select Automatic annotation and select Faster RCNN via Tensorflow Model (mostly), then in the terminal use gpustat to check for usage on the Tesla T4 GPU, which has only read zero so far. htop shows over 100% utilization on a nuclio task as the progression bar moves in the UI however.

Our next trial was gonna be to try a P3 instance as the documentation (someone please add a link to this in the installation guide) suggests. (Nuclio installation instructions need improvement badly too)

Thanks for the help.

The text was updated successfully, but these errors were encountered:

nmanovic · 2020-11-25T11:47:19Z

@machinsk , a serverless function with a DL model inside just a docker container with some specific bindings to nuclio framework and python code to run the model. If the function itself can be running on GPU, it will run on GPU. For now all functions inside CVAT repository are not optimized to run on GPU. If somebody can contribute and improve them, we will be more than happy to accept the PR.

jahaniam · 2020-12-08T07:01:22Z

@nmanovic I agree that there are many missing steps in the documentation for semi annotation and I had to dig into codes in both cvat and nuclio. After some digging, finally, I was able to run the tensorflow fasterRCNN model on the GPU. I will do a PR soon.

nmanovic · 2021-07-06T09:09:20Z

It looks like the issue was resolved by @jahaniam. I will close it.

nmanovic self-assigned this Nov 25, 2020

jahaniam mentioned this issue Dec 9, 2020

Auto annotation GPU support #2546

Merged

8 tasks

jahaniam mentioned this issue Jan 25, 2021

GPU-Suport: Mask-RCNN + Minor GPU fixes #2714

Merged

8 tasks

nmanovic closed this as completed Jul 6, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Nuclio Automatic/SemiAutomatic AI Tool Functions not running on GPU #2489

Nuclio Automatic/SemiAutomatic AI Tool Functions not running on GPU #2489

machinsk commented Nov 25, 2020 •

edited

Loading

nmanovic commented Nov 25, 2020

jahaniam commented Dec 8, 2020

nmanovic commented Jul 6, 2021

Nuclio Automatic/SemiAutomatic AI Tool Functions not running on GPU #2489

Nuclio Automatic/SemiAutomatic AI Tool Functions not running on GPU #2489

Comments

machinsk commented Nov 25, 2020 • edited Loading

nmanovic commented Nov 25, 2020

jahaniam commented Dec 8, 2020

nmanovic commented Jul 6, 2021

machinsk commented Nov 25, 2020 •

edited

Loading