- debug to use GPU
- try to apply pytorch lightening
- Define an argument to download up to n images
- validate annotations and mark them not to validate it again
- Read cited papers at https://github.com/lucidrains/DALLE2-pytorch
- train the model and debug it
- deploy it as an ONNX model for web browsers
pip install -U pip
poetry config repositories.pytorch https://download.pytorch.org/whl/cpu
poetry install
poe download_annotations # download the annotations source
poe import_annotations # import annotations. this is recommended to finish this completely
poe download_images # download images as much as you want
poe update_splits # update the trainset index and the testset index to take account of the recently downloaded images
poe start_servers
You may want to access your local servers from a remote host. Then you can use alternative addresses provided by ngrok.
poe start_ngrok
In Colab, you may run commands as follow.
!pip install --quiet mlflow
!MLFLOW_TRACKING_URI="https://your_tracking_server_url" \
DATASETS_URL="https://your_datasets_server_url" \
mlflow run https://github.com/hotohoto/minidalle2.git -e train_clip -v main -P n-epochs=1 --env-manager=local
You can also run the other tasks than train_clip
. They are defined in MLproject.
poe test
- https://github.com/lucidrains/DALLE2-pytorch
- https://redcaps.xyz/
- https://github.com/redcaps-dataset/redcaps-downloader
- https://python-poetry.org/docs/repositories/
- https://github.com/hotohoto/python-example-project
- https://medium.com/analytics-vidhya/build-a-machine-learning-laboratory-from-anywhere-google-colab-ngrok-ca7590777bd8