- clone the repo: git clone https://github.com/benjlis/foiarchive-pdfloader.git
- cd foiarchive-pdfloader
- create a virtual environment: python3 -m venv env
- activate the envionment: . env/bin/activate
- install the requirements: pip install -r requirements.txt
- define required environmental variables and store in .env
- run it in the background with nohup: nohup python -u pdf2pgs3.py >> load.log 2>&1&
-
Notifications
You must be signed in to change notification settings - Fork 0
Downloads PDFs and stores the text in the FOIArchive database and a copy in an s3 bucket
License
history-lab/foiarchive-pdfloader
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
Repository files navigation
About
Downloads PDFs and stores the text in the FOIArchive database and a copy in an s3 bucket
Resources
License
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published