GitHub - farvath/job-recommendation-engine: This job recommendation system helps connect candidates with suitable opportunities by analyzing skills. It combines data from Stack Overflow's 2018 Developer Survey and a Kaggle dataset.Improved job-candidate matching for a more efficient hiring process. Personalized recommendations based on skills and past successes.

Job recommendations engine

This job recommendation engine is built using a combination of data scraping, machine learning techniques and deployed in a streamlit application. The model is built on a custom made a dataset by combining the survey conducted by stackoverflow(2018) and a kaggle dataset , data has preprocessed using custom components, and then applies machine learning algorithms( Collaborative Based recommendation system ) to generate personalized recommendations. The system uses historical data to create a similarity matrix (cosine similarity metric). The recommendation is solely on the skills provided in job posting and skills that the candidate has .

Dataset

Stackoverflow Survey dataset : Link

Kaggle Dataset for job-postings : Link

Feature Extraction and preprocessing

Initiallly a Exploratory data analysis (EDA) is done .

In order to build a custom dataset feature extraction is done. I have decided to use certain columns containing the skills section , technologies worked on , etc... from the stackoverflow dataset. details can be feature_extraction_user_a.ipynb and feature_extraction_user_b.ipynb.

Next, preprocessed the job opening dataset taken from the kaggle and built a random users-company dataset can be found as colabdata.csv.

Detailed features extracted from the datset can be found in the Features folder

Collaborative Filtering

Download the stack overflow dataset and the job-postings dataset from the above link and place into /data/collaborative filtering folder.
Run collaborative filtering.ipynb to check the output of Collaborative filtering recommendations.

Implementation :

In order to run the collaborative filtering model:

Create a python virtual environment, activate it and install the packages from requirements.txt file :

python -m venv venv
venv\Scripts\activate
pip install -r requirements.txt

Kindly download the two datasets mentioned above and place them in the data folder . Run the collaborative filtering.ipynb
The model depends on all files in the data folder. The csv files in data folder contain the userskills and colabdata.
The recommendations.csv contains top 10 recommendations for a random sample (first 200 users) of the Stack Overflow dataset.
Inferences.ipynb : This contains code which was used to make inferences about the dataset.

Usage

To run the files, download the stack overflow dataset and the job-postings dataset from the given link and place into /data/collaborative filtering folder. Run collaborative filtering.ipynb to check the output of CF recommendations based on skills .

Inorder to ease the process i have stored the ouptut of first 200 users in recommendations.csv. You can directly use that for inference.

Also, similarity matrix is also exported as similarity_matrix.pkl. (NOTE: MATRIX IS FOR WHOLE DATASET IS EXPORTED)

You may indiviually run the End-to-End/collaborative filtering.ipynb file or run using streamlit application :

First clone the repository :

git clone https://github.com/farvath/job-recommendation-engine.git

Replace the pre-processed data paths stored in pickle files (e.g., similarity_matrix.pkl), make sure these files are present in the same directory as your application (app.py).
In your terminal, navigate to your project directory(Frontend/app.py) and run the following command:
```
streamlit run app.py
```

Assumptions

Majority of the candidates have registered themselves and have their own user_id.
Out of the 5000 candidates , we have assumed that around 2000 candidates have already be hired .
Every candidate has rated all the recommended jobs that they have got before.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Job recommendations engine

Dataset

Feature Extraction and preprocessing

Collaborative Filtering

Implementation :

Usage

Assumptions

Interface

About

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
Features		Features
Frontend		Frontend
data/collaborative filtering		data/collaborative filtering
inference_results		inference_results
preprocessing		preprocessing
Inferences.ipynb		Inferences.ipynb
README.md		README.md
collaborative filtering.ipynb		collaborative filtering.ipynb
requirements.txt		requirements.txt

farvath/job-recommendation-engine

Folders and files

Latest commit

History

Repository files navigation

Job recommendations engine

Dataset

Feature Extraction and preprocessing

Collaborative Filtering

Implementation :

Usage

Assumptions

Interface

About

Topics

Resources

Stars

Watchers

Forks

Languages