BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping

Srikumar Sastry, Subash Khanal, Aayush Dhakal, Di Huang, Nathan Jaocbs

🦢 Dataset Released: Cross-View iNAT Birds 2021

This cross-view birds species dataset consists of paired ground-level bird images and satellite images, along with meta-information associated with the iNaturalist-2021 dataset.

Satellite images along with meta-information - Link

iNaturalist Images - Link

Computer Vision Tasks

Fine-Grained image classification
Satellite-to-bird image retrieval
Bird-to-satellite image retrieval
Geolocalization of Bird Species

An example of task 3 is shown below:

👨‍💻 Getting Started

Setting up

Clone this repository:

git clone https://github.com/mvrl/BirdSAT.git

Clone the Remote-Sensing-RVSA repository inside BirdSAT:

cd BirdSAT
git clone https://github.com/ViTAE-Transformer/Remote-Sensing-RVSA.git

Append the code for CVMMAE present in utils_model/CVMMAE.py to the file present in Remote-Sensing-RVSA/MAEPretrain_SceneClassification/models_mae_vitae.py
Download pretrained satellite image encoder from - Link and place inside folder pretrained_models. You might get an error while loading this model. You need to set the option kernel=3 in the file Remote-Sensing-RVSA/MAEPretrain_SceneClassification/models_mae_vitae.py in the class MaskedAutoencoderViTAE.
Download all datasets, unzip them and place inside folder data.

Installing Required Packages

There are two options to setup your environment to be able to run all the functions in the repository:

Using Dockerfile provided in the repository to create a docker image with all required packages:
```
docker build -t <your-docker-hub-id>/birdsat .
```

Creating conda Environment with all required packages:

conda create -n birdsat python=3.10 && \
conda activate birdsat && \
pip install requirements.txt

Additionally, we have hosted a pre-built docker image on docker hub with tag srikumar26/birdsat:latest for use.

🔥 Training Models

Setup all the parameters of interest inside config.py before launching the training script.
Run pre-training by calling:
```
python pretrain.py
```
Run fine-tuning by calling:
```
python finetune.py
```

❄️ Pretrained Models

Download pretrained models from the given links below:

Model Type	Download Url
CVE-MAE	Link
CVE-MAE-Meta	Link
CVM-MAE	Link
CVM-MAE-Meta	Link

📑 Citation

@inproceedings{sastry2024birdsat,
  title={BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping},
  author={Srikumar, Sastry and Subash, Khanal and Aayush, Dhakal and Huang, Di and Nathan, Jacobs},
  booktitle={Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision},
  year={2024}
}

🔍 Additional Links

Check out our lab website for other interesting works on geospatial understanding and mapping;

Multi-Modal Vision Research Lab (MVRL) - Link
Related Works from MVRL - Link

Name		Name	Last commit message	Last commit date
Latest commit History 41 Commits
.github/workflows		.github/workflows
Remote-Sensing-RVSA @ c73cd03		Remote-Sensing-RVSA @ c73cd03
data		data
imgs		imgs
pretrained_models		pretrained_models
utils_model		utils_model
.gitignore		.gitignore
.gitmodules		.gitmodules
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
__init__.py		__init__.py
config.py		config.py
datasets.py		datasets.py
finetune.py		finetune.py
models.py		models.py
pretrain.py		pretrain.py
requirements.txt		requirements.txt
retrieval_eval.py		retrieval_eval.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping

Srikumar Sastry, Subash Khanal, Aayush Dhakal, Di Huang, Nathan Jaocbs

🦢 Dataset Released: Cross-View iNAT Birds 2021

Computer Vision Tasks

👨‍💻 Getting Started

Setting up

Installing Required Packages

🔥 Training Models

❄️ Pretrained Models

📑 Citation

🔍 Additional Links

About

Releases

Packages

Languages

License

mvrl/BirdSAT

Folders and files

Latest commit

History

Repository files navigation

BirdSAT: Cross-View Contrastive Masked Autoencoders for Bird Species Classification and Mapping

Srikumar Sastry, Subash Khanal, Aayush Dhakal, Di Huang, Nathan Jaocbs

🦢 Dataset Released: Cross-View iNAT Birds 2021

Computer Vision Tasks

👨‍💻 Getting Started

Setting up

Installing Required Packages

🔥 Training Models

❄️ Pretrained Models

📑 Citation

🔍 Additional Links

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages