Skip to content

This project has 2 part, one is scrape indeed by beautifulSoap and selenium, then save csv file, 2nd part is data visulization, infuture it will have GUI by PyQt5

Notifications You must be signed in to change notification settings

kkwangsir/Scraping

Repository files navigation

Scraping

This project has 2 part, one is scraping indeed.ca by beautifulSoap and selenium, then save csv file, 2nd part is data visulization, in future, it will have GUI by PyQt5

The scrape processing

I used the selenium and webdriver to read the real web pages, and iterate innerhtml the of id:result to collect the few tags of data, and when indeed show promotion page I added a script to find close button Click the pic or link below to watch on youtube

Watch the video

https://youtu.be/irgcfEXtVkk


Data Analysis

Through the spider, we got two csv files, which are job advertisements for Saskatchewan python and Java programmers. First I cleaned up the duplicates, and then sorted the employers who posted the job ads, and then got the top ten companies.

0

Finally, I analyzed the job description and took the keywords for the programmer's level. Through the previous data, I know that there are 77 positions. Through the keyword chart, I know that the Saskatchewan IT company has a strong demand for senior programmers, and the demand for entry level programmers is not high. I think that most entry programmers work in companies that come through the school's coop or intern , so public information sites can hardly see a large number of entry programmers' job advertisements.

0

About

This project has 2 part, one is scrape indeed by beautifulSoap and selenium, then save csv file, 2nd part is data visulization, infuture it will have GUI by PyQt5

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages