KeyPosS: Facial Landmark Detection through GPS-Inspired True-Range Multilateration

KeyPosS is a facial landmark detection method inspired by GPS technology. It addresses the limitations of traditional heatmap and coordinate regression techniques with an efficient and accurate approach.

KeyPosS uses a fully convolutional network to predict distance maps between points of interest (POIs) on a face and multiple anchor points. The anchor points are then leveraged to precisely triangulate the POIs' positions using true-range multilateration.

Figure 1: A comparison of four decoding methods. Our KeyPosS excels with minimal overhead.

Figure 2: The KeyPosS pipeline, encompassing the Distance Encoding Model, Station Anchor Sampling Strategy, and True-range Multilateration. It is suitable for any distance encoding-based approach.

Key Features

GPS-inspired: Applies proven concepts from GPS technology to facial analysis, enabling more precise localization.
True-Range Multilateration: Decodes predicted distances into landmark coordinates through multilateration with anchoring points.
Versatile: Can be built upon any distance encoding-based model for enhanced performance.
Efficient: Avoids computational burdens of heatmap-based methods.

For more details, please see our ACM MM 2023 paper.

Performance Overview

Table 1: A performance comparison with State-of-the-Art methods. Results are presented in NME (%), with top results in bold.

Quick Start Guide

Get started with the KeyPosS facial landmark detection system in a few simple steps:

1. Installation:

Environment Setup: Begin by setting up the necessary environment. For this, refer to the instructions provided by mmpose.
Datasets: Our experiments utilize the COCO, WFLW, 300W, COFW, and AFLW datasets.

2. Training:

Pre-trained Models: We leverage ImageNet models from mmpose as our starting point.

Training Command: To start the training process, execute the following command:

CUDA_VISIBLE_DEVICES=0,1,2,3 sh tools/dist_train.sh \
    configs/face/2d_kpt_sview_rgb_img/topdown_heatmap/coco_wholebody_face/hrnetv2_w18_coco_wholebody_face_256x256_dark.py \
    4 \
    --work-dir exp/exp889

3. Evaluation:

Step 1: Obtain the Models

Download: Retrieve the pre-trained and trained models for each dataset and heatmap resolution from Google Drive.

Step 2: Model Setup

Placement: After downloading, move the "exp" model file to the root directory of your codebase.

Step 3: Resolution Configuration

Supported Resolutions: The model in the "exp" directory is compatible with five resolutions: 64, 32, 16, 8, and 4.
Configuration: Prior to running the test script, adjust the resolution by editing the "data_cfg/heatmap_size" field in the configuration file to your chosen resolution.

Step 4: Test Execution

Script Selection: Based on your chosen resolution, run the appropriate test script:
- run_test_64.sh
- run_test_32.sh
- run_test_16.sh
- run_test_8.sh
- run_test_4.sh
These scripts evaluate the model's efficacy across various face datasets: WFLW, COCO, 300W, AFLW, and COFW.

Step 5: Evaluation Command

Command Execution: To kick off the evaluation, input the following command:

CUDA_VISIBLE_DEVICES=0,1,2,3 sh tools/dist_test.sh \
    configs/face/2d_kpt_sview_rgb_img/topdown_heatmap/wflw/hrnetv2_w18_wflw_256x256_dark.py \
    exp/exp_v1.3.0/best_NME_epoch_60.pth \
    4

Acknowledgment

Our work is primarily based on mmpose. We express our gratitude to the authors for their invaluable contributions.

Citation

If you find this work beneficial, kindly cite our paper:

@inproceedings{bao2023keyposs,
  title={KeyPosS: Plug-and-Play Facial Landmark Detection through GPS-Inspired True-Range Multilateration},
  author={Bao, Xu and Cheng, Zhi-Qi and He, Jun-Yan and Xiang, Wangmeng and Li, Chenyang and Sun, Jingdong and Liu, Hanbing and Liu, Wei and Luo, Bin and Geng, Yifeng and others},
  booktitle={Proceedings of the 31st ACM International Conference on Multimedia},
  pages={5746--5755},
  year={2023}
}

License

This repository is licensed under the Apache 2.0 license. For more details, please refer to the LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
checkpoints		checkpoints
configs		configs
demo		demo
docker		docker
docs		docs
mmpose		mmpose
requirements		requirements
resources		resources
tests		tests
tools		tools
MANIFEST.in		MANIFEST.in
README.md		README.md
model-index.yml		model-index.yml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
run.sh		run.sh
run_test.sh		run_test.sh
run_test_16.sh		run_test_16.sh
run_test_32.sh		run_test_32.sh
run_test_4.sh		run_test_4.sh
run_test_64.sh		run_test_64.sh
run_test_8.sh		run_test_8.sh
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

KeyPosS: Facial Landmark Detection through GPS-Inspired True-Range Multilateration

Key Features

Performance Overview

Quick Start Guide

1. Installation:

2. Training:

3. Evaluation:

Step 1: Obtain the Models

Step 2: Model Setup

Step 3: Resolution Configuration

Step 4: Test Execution

Step 5: Evaluation Command

Acknowledgment

Citation

License

About

Releases

Packages

Languages

zhiqic/KeyPosS

Folders and files

Latest commit

History

Repository files navigation

KeyPosS: Facial Landmark Detection through GPS-Inspired True-Range Multilateration

Key Features

Performance Overview

Quick Start Guide

1. Installation:

2. Training:

3. Evaluation:

Step 1: Obtain the Models

Step 2: Model Setup

Step 3: Resolution Configuration

Step 4: Test Execution

Step 5: Evaluation Command

Acknowledgment

Citation

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages