Hybrid Transformer Based Feature Fusion for Self-Supervised Monocular Depth Estimation

Advances in Image Manipulation, European Conference on Computer Vision Workshops (ECCVW) 2022

Authors: Snehal Singh Tomar, Maitreya Suin, and A.N. Rajagopalan

Paper: ECCVW 2022 arXiv preprint

Setup

Our setup for this project entailed the following:

CUDA 10.0, cuDNN 7.5.0, Python 3.6, Pytorch 0.4.1, Torchvision 0.2.1, OpenCV 3.3.1, and Ubuntu 20.04.

Upon cloning the repository, please place the KITTI Dataset in "kitti_data/" before running any experiments.

Training

Please run:

python train_w_lrl_hrl.py --model_name <desired_model_name> --png

The "--png" option may be omitted if the KITTI Dataset has been downloaded in ".jpg" file format. The model files will be saved at "model_under_trg/desired_model_name/" by default.

Inference

To generate predicted depth maps, run:

python export_gt_depth.py --data_path kitti_data --split eigen

To evaluate a particular model, run:

python evaluate_depth.py --load_weights_folder <path_to_model_weights> --eval_mono

Bibtex

If you use this code, please cite our paper:

@inproceedings{tomar2022hybrid,
  title={Hybrid Transformer Based Feature Fusion for Self-Supervised Monocular Depth Estimation},
  author={Tomar, Snehal Singh and Suin, Maitreya and Rajagopalan, A.N.},
  booktitle={Advances in Image Manipulation, European Conference on Computer Vision Workshops (ECCVW) 2022},
  year={2022}
}

License

This code is for non-commercial use only. Please refer to our License file for more.

Acknowledgement

This implementation borrows heavily from Monodepth2, and draws inspiration from the DIFFNet.

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
__pycache__		__pycache__
assets		assets
datasets		datasets
networks		networks
splits		splits
LICENSE		LICENSE
README.md		README.md
evaluate_depth_vit2Light_resnet.py		evaluate_depth_vit2Light_resnet.py
evaluate_pose.py		evaluate_pose.py
export_gt_depth.py		export_gt_depth.py
kitti_utils.py		kitti_utils.py
layers.py		layers.py
options.py		options.py
pose.pth		pose.pth
pose_encoder.pth		pose_encoder.pth
test_simple_vit2light_resnet.py		test_simple_vit2light_resnet.py
train_w_lrl_hrl.py		train_w_lrl_hrl.py
trainer_enc_lrl_hrl.py		trainer_enc_lrl_hrl.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hybrid Transformer Based Feature Fusion for Self-Supervised Monocular Depth Estimation

Setup

Training

Inference

Bibtex

License

Acknowledgement

About

Releases

Packages

Languages

License

snehalstomar/Hybrid-Transformer-based-Self-Supervised-Monocular-Depth-Estimation

Folders and files

Latest commit

History

Repository files navigation

Hybrid Transformer Based Feature Fusion for Self-Supervised Monocular Depth Estimation

Setup

Training

Inference

Bibtex

License

Acknowledgement

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages