Convert the OpenFace model from Lua torch to Pytorch #462

T0ny8576 · 2024-09-20T19:46:07Z

What does this PR do?

This PR rewrites the OpenFace model in PyTorch and provides scripts to convert the trained model weights nn4.small2.v1.t7 to a PyTorch state_dict. This PR also provides examples of comparing images and training classifiers using the new model.

Summary of Changes:

openface/openfacenet.py: Add PyTorch model definition
openface/align_dlib.py: Support dlib's CNN face detector; Make upsampling count a tunable parameter
models/get-models.sh: Download dlib's mmod_human_face_detector model and the converted PyTorch OpenFace model weights
batch-represent/batch_represent.py: Generate representations for an image dataset and store the data and labels in .csv files
demos/compare_new.py, demos/classifier_new.py: Add new examples of using the PyTorch model
conversion/test_luatorch.lua, conversion/convert_to_pytorch.py: Add scripts to convert the trained model weights to a PyTorch state_dict (not necessary for using the PyTorch model)

Where should the reviewer start?

Build a Docker image from Dockerfile under the project root directory

sudo docker build . -t newface

Run a Docker container with GPU enabled

sudo docker run --rm -it --gpus all newface

How should this PR be tested?

Run the comparison demo

python3 demos/compare_new.py images/examples/{lennon*,clapton*}

Generate representations in batches for an image dataset, e.g. a raw image directory data/mydataset/raw/

python3 batch-represent/batch_represent.py -i data/mydataset/raw/ -o data/mydataset/feats/ --align_out data/mydataset/aligned/

Train a new SVM classifier

python3 demos/classifier_new.py train data/mydataset/feats/

Run the new classifier

python3 demos/classifier_new.py infer data/mydataset/feats/classifier.pkl data/mydataset/test/*

Note that only the new PyTorch model and the new examples should work. Backward compatibility with the previous version has not been carefully tested yet. The model training and the web demo are currently not supported in the update.

Questions:

Do the docs need to be updated?

Yes

Does this PR add new (Python) dependencies?

Yes

bamos · 2024-09-20T20:52:07Z

Amazing! It looks really good so far

T0ny8576 · 2024-09-20T21:56:23Z

Hi Brandon,I thought I would send out this email next Monday but you already replied lol. I will complete that draft pull request soon.I made an update to the OpenFace repository to convert the ''nn4.small2.v1.resaved.t7'' model from Lua torch to Pytorch during the summer. In the update, I included the new model definition in Pytorch, the conversion scripts, and the new "compare" and "classify" example scripts. I also changed the Dockerfile to build from an NVIDIA CUDA Ubuntu base image for better GPU support.If you have time, could you please review those changes I made? Please let me know your thoughts or any new changes you want me to add. Thanks!Best regards,Qifei DongOn Sep 20, 2024, at 4:52 PM, Brandon Amos ***@***.***> wrote: Amazing! It looks really good so far —Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you authored the thread.Message ID: ***@***.***>

brmarkus · 2024-09-20T23:40:08Z

Please don't mandate systems using NVIDIA GPUs only (probably except for (re-)training)...

jaharkes · 2024-09-23T17:22:16Z

The code changes seem to still support CPU, if not args.cpu is tested in various places to skip loading to gpu. Not sure if that was tested though.
The only other CUDA related change seems to be switching the base container in the Dockerfile from a very old Ubuntu 14.04 based image to a more recent nvidia:cuda container. I assume that container will still run on a machine without a GPU, you may have to force the --cpu flag.
An alternative approach could be to use torch.cuda.is_available() in more places so that the cpu argument isn't really necessary.

brmarkus · 2024-09-23T18:42:06Z

Sounds great!
Using such a --cpu flag might need to be documented on prominent places, ideally as part of this change-request.

crsimx · 2024-10-19T18:59:30Z

Hello all, @T0ny8576
I found something strange from last master or maybe I don't get it.

From docker everything works perfectly with

python3 batch-represent/batch_represent.py -i data/mydataset/raw/ -o data/mydataset/feats/ --align_out data/mydataset/aligned/

and there is a python C type process on nvidia-smi on ubuntu while running

But if i m installing it manually and run it from local it never use gpu and not create any process on gpu and its very slow, it means its working only with cpu ? But im never using --cpu flag.

How can I enable gpu - cuda without docker ?
Thanks in advance !

brmarkus · 2024-10-19T21:11:49Z

Can you describe first what you are exacty using and doing?

The Dockerfile is now based on a very powerful nvidia:cuda container, see "https://github.com/cmusatyalab/openface/blob/master/Dockerfile":

FROM nvidia/cuda:11.8.0-cudnn8-devel-ubuntu22.04

If you only follow the installation steps from within the Dockerfile, you miss the major, central setup from the base container.

Not sure I traced it correctly to here, "https://gitlab.com/nvidia/container-images/cuda/blob/master/doc/supported-tags.md#cuda-1180", but the link "https://gitlab.com/nvidia/container-images/cuda/blob/master/dist/11.8.0/ubuntu22.04/devel/cudnn8/Dockerfile" seems too old for the GitLab repo, maybe that got migrated already 2 years ago.

T0ny8576 added 5 commits August 14, 2024 02:06

Convert nn4.small2.v1 model from lua torch to pytorch

ffadf0a

Add new compare and classifier demos using the pytorch model

fbce304

Add python script for batched representation generation

f8522c4

Set dlib CNN face detector as default; Enable CUDA for dlib

34e6128

Add dlib upsampling count to arguments

95c6c03

T0ny8576 marked this pull request as ready for review September 26, 2024 19:22

Download converted PyTorch model weights

6f89e57

teiszler merged commit 0514752 into cmusatyalab:master Oct 4, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Convert the OpenFace model from Lua torch to Pytorch #462

Convert the OpenFace model from Lua torch to Pytorch #462

T0ny8576 commented Sep 20, 2024 •

edited

Loading

bamos commented Sep 20, 2024

T0ny8576 commented Sep 20, 2024 via email

brmarkus commented Sep 20, 2024

jaharkes commented Sep 23, 2024

brmarkus commented Sep 23, 2024

crsimx commented Oct 19, 2024 •

edited

Loading

brmarkus commented Oct 19, 2024

Convert the OpenFace model from Lua torch to Pytorch #462

Convert the OpenFace model from Lua torch to Pytorch #462

Conversation

T0ny8576 commented Sep 20, 2024 • edited Loading

What does this PR do?

Summary of Changes:

Where should the reviewer start?

How should this PR be tested?

Questions:

bamos commented Sep 20, 2024

T0ny8576 commented Sep 20, 2024 via email

brmarkus commented Sep 20, 2024

jaharkes commented Sep 23, 2024

brmarkus commented Sep 23, 2024

crsimx commented Oct 19, 2024 • edited Loading

brmarkus commented Oct 19, 2024

T0ny8576 commented Sep 20, 2024 •

edited

Loading

crsimx commented Oct 19, 2024 •

edited

Loading