Computer Pointer Controller

Control the mouse pointer of the computer by using gaze detection points. The gaze is the deep learning model to estimate the gaze of the user’s eyes and change the mouse pointer position accordingly. The gaze detection model depends on the output of the other models face-detection, head-pose-estimation, facial-landmarks. So, The application is an integration of face detection model, head-pose estimation model, and facial landmarks model.

Project Set Up and Installation

Step1. Download OpenVino Toolkit 2020.1 with all the prerequisites by following this installation guide

Step2: Create Virtual Environment using command virtualenv venv in the command prompt

Step3. install all the dependency using pip install requirements.txt.

Step4. Initialze the OpenVino Environment on your local setup. Given below are the commands to initialize:

cd C:\Program Files (x86)\IntelSWTools\openvino\bin\
setupvars.bat

Step5. Download the models using the commands below:

Face Detection Model

python /opt/intel/openvino/deployment_tools/tools/model_downloader/downloader.py --name "face-detection-adas-binary-0001"

Gaze Estimation Model

python /opt/intel/openvino/deployment_tools/tools/model_downloader/downloader.py --name "gaze-estimation-adas-0002"

Facial Landmarks Detection Model

python /opt/intel/openvino/deployment_tools/tools/model_downloader/downloader.py --name "landmarks-regression-retail-0009"

Head Pose Estimation Model

python /opt/intel/openvino/deployment_tools/tools/model_downloader/downloader.py --name "head-pose-estimation-adas-0001"

Demo

Use the following command to run the app

python src/main.py -mf models/intel/face-detection-adas-binary-0001/FP32-INT1/face-detection-adas-binary-0001.xml -ml models/intel/landmarks-regression-retail-0009/FP32/landmarks-regression-retail-0009.xml -mh models/intel/head-pose-estimation-adas-0001/FP32/head-pose-estimation-adas-0001.xml -mg models/intel/gaze-estimation-adas-0002/FP32/gaze-estimation-adas-0002.xml -i bin/demo.mp4 -f ff fl fh fg

Output

Documentation

Command Line Argument Information

mf : Specify path of xml file of face detection model
ml : Specify path of xml file of landmark regression model
mh : Specify path of xml file of Head Pose Estimation model
mg : Specify path of xml file of Gaze Estimation model
i : Specify path of input Video file or cam for Webcam
f (Optional): if you want to see preview video in separate window you need to Specify flag from ff, fl, fh, fg
pt (Optional): if you want to specify confidence threshold for face detection, default=0.6
d (Optional): Specify Device for inference, the device can be CPU, GPU, FPGU, MYRID, default=CPU
o : Specify path of output folder where we will store results
b : Select True for benchmarking mode

Project Structure

models: This folder contains models in IR format downloaded from Openvino Model Zoo
src: This folder contains model files, pipeline file(main.py) and utilities
- face_detection_model.py
- gaze_estimation_model.py
- landmark_detection_model.py
- head_pose_estimation_model.py
- main.py file used to run complete pipeline of project. It calls has object of all the other class files in the folder
- mouse_controller.py is utility to move mouse curser based on mouse coordinates received from gaze_estimation_model class predict method.
- input_feeder.py is utility to load local video or webcam feed
bin: this folder has demo.mp4 file which can be used to test model
results: store the output video and benchmark results
requirements.txt: dependencies

Benchmarks

I have checked Inference Time, Model Loading Time, and Frames Per Second model for FP16, FP32, and FP32-INT8 of all the models except Face Detection Model. Face Detection Model was only available on FP32-INT1 precision. You can use below commands to get results for respective precisions

FP32

python src/main.py -mf models/intel/face-detection-adas-binary-0001/FP32-INT1/face-detection-adas-binary-0001.xml -ml models/intel/landmarks-regression-retail-0009/FP32/landmarks-regression-retail-0009.xml -mh models/intel/head-pose-estimation-adas-0001/FP32/head-pose-estimation-adas-0001.xml -mg models/intel/gaze-estimation-adas-0002/FP32/gaze-estimation-adas-0002.xml -i bin/demo.mp4 -o results/FP32/ -b -f ff fl fh fg

FP16

python src/main.py -mf models/intel/face-detection-adas-binary-0001/FP32-INT1/face-detection-adas-binary-0001.xml -ml models/intel/landmarks-regression-retail-0009/FP16/landmarks-regression-retail-0009.xml -mh models/intel/head-pose-estimation-adas-0001/FP16/head-pose-estimation-adas-0001.xml -mg models/intel/gaze-estimation-adas-0002/FP16/gaze-estimation-adas-0002.xml -i bin/demo.mp4 -o results/FP16/ -b -f ff fl fh fg

FP16-INT8

python src/main.py -mf models/intel/face-detection-adas-binary-0001/FP32-INT1/face-detection-adas-binary-0001.xml -ml models/intel/landmarks-regression-retail-0009/FP16-INT8/landmarks-regression-retail-0009.xml -mh models/intel/head-pose-estimation-adas-0001/FP16-INT8/head-pose-estimation-adas-0001.xml -mg models/intel/gaze-estimation-adas-0002/FP16-INT8/gaze-estimation-adas-0002.xml -i bin/demo.mp4 -o results/FP16-INT8/ -b -f ff fl fh fg

Results

Metrics	FP16	FP16-INT8	FP32
Inference time	13.9s	13.4s	13.4s
Model load time	0.45s	0.67s	0.42s
FPS	4.24	4.40	4.40

FP32 is showing the best results in terms of inference time, model load time and FPS.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Computer Pointer Controller

Project Set Up and Installation

Demo

Documentation

Command Line Argument Information

Project Structure

Benchmarks

FP32

FP16

FP16-INT8

Results

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
bin		bin
models/intel		models/intel
results		results
src		src
README.md		README.md
output_video.mp4		output_video.mp4
requirements.txt		requirements.txt

ahmedhasandrlnd/Computer_Pointer_Controller

Folders and files

Latest commit

History

Repository files navigation

Computer Pointer Controller

Project Set Up and Installation

Demo

Documentation

Command Line Argument Information

Project Structure

Benchmarks

FP32

FP16

FP16-INT8

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages