Highlights
- Pro
Vision
The largest collection of PyTorch image encoders / backbones. Including train, eval, inference, export scripts, and pretrained weights -- ResNet, ResNeXT, EfficientNet, NFNet, Vision Transformer (V…
This is an official implementation for "Swin Transformer: Hierarchical Vision Transformer using Shifted Windows".
Video Autoencoder: self-supervised disentanglement of 3D structure and motion (ICCV 2021). Website: https://zlai0.github.io/VideoAutoencoder/
PyTorch implementation of MAE https//arxiv.org/abs/2111.06377
PyTorch implementation of Contrastive Learning methods
CLIP (Contrastive Language-Image Pretraining), Predict the most relevant text snippet given an image
Pytorch implementation of U-Net, R2U-Net, Attention U-Net, and Attention R2U-Net.
A data generation pipeline for creating semi-realistic synthetic multi-object videos with rich annotations such as instance segmentation masks, depth maps, and optical flow.
PyTorch implementation of MoCo: https://arxiv.org/abs/1911.05722
PyTorch implementation of MoCo v3 https//arxiv.org/abs/2104.02057
PyTorch implementation of SimSiam https//arxiv.org/abs/2011.10566
[ICCV '21] "Unsupervised Point Cloud Pre-training via Occlusion Completion"
Datasets, Transforms and Models specific to Computer Vision
Code release for NeRF (Neural Radiance Fields)
A PyTorch implementation of NeRF (Neural Radiance Fields) that reproduces the results.
Code release for Local Light Field Fusion at SIGGRAPH 2019
Pytorch code for ICLR-20 Paper "Learning to Explore using Active Neural SLAM"
[ECCV 2022] RC-MVSNet: Unsupervised Multi-View Stereo with Neural Rendering
A curated list of awesome 3d generation papers
PyTorch implementation of SwAV https//arxiv.org/abs/2006.09882
Open source code for paper "Understanding Contrastive Representation Learning through Alignment and Uniformity on the Hypersphere" ICML 2020