Amodal completion with transformers

Description

An experiment to see if we can train ViT to output amodally completed shape. Amodal completion is a perceptual phenomenon where shapes occluded by other shapes appear to us as complete. For instance, if a disk occludes part of a rectangle, we still perceive that rectangle as a rectangle rather than some odd shape that has a small portion missing.

Here we hypothesize that training a ViT to output full shapes that are behind an occluder (these are our targets that a model is learning to predict) is a sufficient signal to learn amodal completion. Caveat -– only rectangles and discs used, so our results may not generalize to more complex scenes.

This ViT implementation is based on PyTorch Lightning Tutorial 11.

How to run

Install dependencies

# clone project   
git clone https://github.com/qbilius/amodal

# install project   
cd amodal 
pip install -e .   
pip install -r requirements.txt

Run training locally: python amodal/train.py.
Observe results with tensorboard: tensorboard --logdir=output.
Visualize loss with python amodal/visualization.py plot_loss --version <version number>.
Visualize amodal completion results with python amodal/visualization.py plot_results --version <version number>.

Details

Architecture:
- Image embedding into a 64-dimensional space
- Positional encoding, sampled from a normal distribution
- 4 transformer layers with 128-dimensional hidden layers
- A final fully-connected prediction layer that de-embeds outputs back into an image space
Optimizer: SDG with learning rate = .1 and momentum .9
Training: 150 training epochs on a dataset of 50k examples (~2 hours)

Results

Checkpoint - Parameters - Log

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
amodal		amodal
results		results
.flake8		.flake8
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Amodal completion with transformers

Description

How to run

Details

Results

License

About

Releases 1

Packages

Languages

License

qbilius/amodal

Folders and files

Latest commit

History

Repository files navigation

Amodal completion with transformers

Description

How to run

Details

Results

License

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages