Skip to content
This repository has been archived by the owner on Dec 21, 2021. It is now read-only.

Crawling@Home - Help us build a billion-scale image-caption dataset by filtering Common Crawl with OpenAI CLIP

License

Notifications You must be signed in to change notification settings

Wikidepia/crawlingathome-worker

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Crawling@Home Worker

Deprecated! Please use ARKseal/crawlingathome-worker

Discord Chat Docker Image

Help us build a billion-scale image-caption dataset by filtering Common Crawl with OpenAI CLIP

Setup

Prebuilt Docker Image

To use worker docker images run the following commands, where $NICKNAME is your nickname that will be showed in leaderboard.

docker run --detach \
  --name watchtower \
  --restart=on-failure \
  --volume /var/run/docker.sock:/var/run/docker.sock \
  containrrr/watchtower --label-enable --cleanup --interval 1800 && \
docker run --detach -it \
  --name crawlingathome-worker \
  --label=com.centurylinklabs.watchtower.enable=true \
  --restart=on-failure \
  --shm-size 1G \
  -e NAME=$NICKNAME \
  wikidepia/crawlingathome-worker:latest-cpu

Build Docker Image

You can build docker image yourself with the following command

docker build --no-cache --build-arg GIT_COMMIT=$(git rev-parse HEAD) .

Contribute

You are more than welcome to contribute to this development, and make it more sane :)

About

Crawling@Home - Help us build a billion-scale image-caption dataset by filtering Common Crawl with OpenAI CLIP

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Contributors 4

  •  
  •  
  •  
  •