This repo is the official implementation of "Generating Human Motion in 3D Scenes from Text Descriptions".

Installation

conda create -n most python=3.9
conda activate most
# install pytorch
conda install pytorch==1.13.0 torchvision==0.14.0 torchaudio==0.13.0 pytorch-cuda=11.6 -c pytorch -c nvidia
# install pytroch3d
pip install pytorch3d-0.7.2-cp39-cp39-linux_x86_64.whl
# install other requirements
cat requirements.txt | sed -e '/^\s*-.*$/d' -e '/^\s*#.*$/d' -e '/^\s*$/d' | awk '{split($0, a, "#"); if (length(a) > 1) print a[1]; else print $0;}' | awk '{split($0, a, "@"); if (length(a) > 1) print a[2]; else print $0;}' | xargs -n 1 pip install
# install MoST lib
pip install -e . --no-build-isolation --no-deps

NOTE:

pytorch3d download link.
If you want to run stage 1, please uncomment shapely, tenacity, openai, scikit-learn in requirements.txt.

Data preparation

ScanNet dataset

Download ScanNet v2 from link. We only need files that ends with *_vh_clean_2.ply, *_vh_clean.aggregation.json, *_vh_clean_2*segs.json.
Link to data/:

mkdir data
ln -s /path/to/scannet data/ScanNet

Preprocess by runing:

python tools/preprocess_scannet.py

Files will be saved in data/scannet_preprocess.

HUMANISE dataset

Download HUMANISE dataset from link.
Link to data/

mkdir data
ln -s /path/to/humanise data/HUMANISE

SMPLX models

Download SMPLX models from link.
Put the smplx folder under data/smpl_models folder:

mkdir data/smpl_models
mv smplx data/smpl_models/

Pretrained models

Weights are shared in link. Please download and unzip it and put the folder most_release under out folder:

mv most_release out/release

Stage 1: locating the target object

Object bounding box detection

Here, we use ground truth object detection results for ScanNet scenes (in the HUMANISE dataset). If you want to test on a new scene, please follow GroupFree3D to get object bounding boxes.

Inference the target object

python tools/locate_target.py -c configs/locate/locate_chatgpt.yaml

We use Azure OpenAI service, please refer to this link and this link.

Stage 2: generating human motions

Testing

Generating results

python tools/generate_results.py -c configs/test/generate.yaml

The results will be saved in out/test.

Evaluation

python tools/evaluate_results.py -c configs/test/evaluate.yaml

Citation

@inproceedings{cen2024text_scene_motion,
  title={Generating Human Motion in 3D Scenes from Text Descriptions},
  author={Cen, Zhi and Pi, Huaijin and Peng, Sida and Shen, Zehong and Yang, Minghui and Shuai, Zhu and Bao, Hujun and Zhou, Xiaowei},
  booktitle={CVPR},
  year={2024}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Installation

Data preparation

ScanNet dataset

HUMANISE dataset

SMPLX models

Pretrained models

Stage 1: locating the target object

Object bounding box detection

Inference the target object

Stage 2: generating human motions

Testing

Generating results

Evaluation

Citation

Files

README.md

Latest commit

History

README.md

File metadata and controls

Installation

Data preparation

ScanNet dataset

HUMANISE dataset

SMPLX models

Pretrained models

Stage 1: locating the target object

Object bounding box detection

Inference the target object

Stage 2: generating human motions

Testing

Generating results

Evaluation

Citation