This job scraper is for the #jobs
channel in our Slack Community.
Prerequisites:
- 🐍 Python 3.10 or higher
- 📦 Poetry as the package manager
- 👩🏽💻 VS Code is the recommended IDE, but you can use others like PyCharm or Sublime
- ⚙️
.env
file with SLACK and OPENAI keys (ask maintainer or use your own values)
-
Clone the repo and open it in your IDE
-
Open the Intergrated Terminal and install the packages and dependencies
poetry install
-
Create a
.env
file with the appropriate ENVIRONMENT_VARIABLES (ask a maintainer)# Run these tests to check if your system is setup poetry run pytest tests/test_setup.py
- main.ipynb is the main program that does the scraping and posting to Slack
/notebooks
contains notebooks with runnable code!/jobs
contains the core modules for this "product"/tests
contains the tests!- Config files and such at the Workspace Root
- Linter and Formatter: Ruff
- Testing: pytest
- Project Config: pyproject.tom
Currently, this repo scrapes from Indeed
and Google Jobs
.
💡 I used this article about the "best job sites in 2024" to start
The main steps are:
- Use Pylenium to scrape jobs from Indeed and parse data to a standard format
- Use AI to filter out irrelevant jobs
- Post relevant jobs to the
#jobs
channel in the QAP Slack Community