Skip to content

Latest commit

 

History

History
28 lines (17 loc) · 1.18 KB

README.md

File metadata and controls

28 lines (17 loc) · 1.18 KB

Hindawi downloader

Download all books from Hindawi organization's website in PDF format.

Pre-requisites

Installation

  • Open terminal inside the project's folder
  • $ poetry install

Usage

  • $ python dl.py all-books download all the books (2147) sequentially. Beware, a lot of Gigabytes :))
  • $ python dl.py starting-from <line_number> download all books starting from line number inside the books_data.csv file, useful when the above command is interrupted
  • $ python dl.py book <book_id> download a specific book using its id

Notes

  • You'll find the downloaded books inside the كتب مؤسسة هنداوي folder (will be automatically created when you run the command)
  • get_books_data.py is used to get the all books data upfront in form of (id, title, pdf_url)
  • books_data.csv is already downloaded for you! You don't need to use the get_books_data.py script unless you want to remove the books_data.csv file and re-generate it again.

Thank you 😄