Pdf table data extraction and analysis repository
-
Curtin Institute for Computation (CIC)
- Foad Farivar (foad.farivar@curtin.edu.au)
- Daniel Marrable (D.Marrable@curtin.edu.au)
- Carlo Martinotti (carlo.martinotti@curtin.edu.au)
-
School of Management and Marketing
- Ramon Wenel (ramon.wenzel@curtin.edu.au)
For the sake of brevity, this document will assume the user is running on Linux. If a reader is using windows, they will be able to get a similar terminal interface by using WSL. The main method of using the app is via Docker container. You need to install Docker to your local system to be able to build and run the docker image. The folowing folders should be present in your [developer@workfor.com.au] google drive:
- a. Login to google cloud console
- b. At the top left corner click [select a project] and select dataingest
- c. From the top left Navigation bar, select [APIs and Services]
- d. Click Credentials from the left side menu
- e. Download the OAuth 2.0 Client IDs as a Json file and rename it to "client_secret.json"
- f. Copy this file to the same directory as the Dockerfile
a. In the main directory run: docker build -t pdea[:tag] .
- a. If the image is sucessfully built in the previouse step run
docker run -d --name dataingest -e PORT=8080 -p 8080:8080 pdea[:tag]
- b. You can check the logs of the running docker container by running:
docker logs -f --details dataingest
you should see something like this...
a. the application by default will be served an http://localhost:8080/
- a. Run
docker stop dataingest
- b. Run
docker rm dataingest
Note 1: if at any point you get Credentials error please go to: http://localhost:8080/authorize and follow the steps
Note 2: If you change the code, you need to rebuild the docker image and run the new image