Skip to content

This repository contains data analysis and information about Cyclistic Bike Share project from Google Data Analytics Certificate.

Notifications You must be signed in to change notification settings

hoang-nguyen1/Google-Data-Analytics-Capstone-Project

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Google Data Analytics Capstone Project


Background

In 2016, Cyclistic launched a successful bike-share offering. Since then, the program has grown to a fleet of 5,824 bicycles that are geotracked and locked into a network of 692 stations across Chicago. The bikes can be unlocked from one station and returned to any other station in the system anytime. Until now, Cyclistic’s marketing strategy relied on building general awareness and appealing to broad consumer segments. One approach that helped make these things possible was the flexibility of its pricing plans: single-ride passes, full-day passes, and annual memberships. Customers who purchase single-ride or full-day passes are referred to as casual riders. Customers who purchase annual memberships are Cyclistic members.


1. Ask

Business Question:

How do annual members and casual riders use Cyclistic bikes differently?

Key Stakeholders:

  • Lily Moreno
  • Cyclistic marketing analytics team
  • Cyclistic executive team

Business Task:

Use the data to explore and gain insights on how annual members and casual riders use bikes differently. From trends and insights, assist the team to implement marketing strategies aimed at converting casual riders into annual members.


2. Prepare

The data include information about Cyclistic's historical trip and customers' patterns in bike usage. It is obtained between April 2022 and March 2023 (12 months).

Link: Data Source


3. Process

Due to large amount of data and flexbility of R language, I will process the data with R Studio to work with the data more efficiently. This will be applied to data cleaning, analysis, and visualization.

Data Preparation

  • Libraries: tidyverse, lubridate, ggplot2
  • Data are stored in CSV format and imported separately by months
  • Checked data columns for name consistencies
  • 12 months of data are merged into a single dataframe

Date Cleaning

  • Created additional columns (date, month, day, year, and day of week) for data analysis
  • Removed start_lat, start_lng, end_lat, and end_lng columns from the data
  • Got rid of missing values using na.omit()
  • Created a ride length column in minutes with difftime()
  • Converted the ride length column into numeric
  • Removed and filtered the data so that it does not include values with negative or zero ride lengths
  • Ordered the day of week column

4. Analyze

The analyze phase will explore the question "How do annual members and casual riders use Cyclistic bikes differently?" by performing descriptive statistical analysis and aggregating the data with group by to extract trends from bike users.

Descriptive analysis

  • Calculated the mean, median, min, and max on ride length column
  • For each member types, I explored the average, min, and max ride length

Data aggregation

  • Explored the members type by looking at the count between annual vs. casual members
  • Count number of each bikes type
  • Group by members and bikes type to find to total count
  • Count number of rides of weekday based on members type
  • Calculated the average ride length by bike and member type
  • Calculated the average of ride length for each member types based on week day

5. Share

After analyzing the data, I visualize the data with pie chart and bar graphs to discern the differences in bike patterns visually between member and casual riders. Data visualization is a powerful tool to draw insights quickly

Visualization #1: Percentage Distribution of Annual and Casual Riders

Annual vs  Casual Pie Chart
  • Overall, there are more member than casual riders (60% vs. 40%).

Visualization #2: Total Number of Rides by Bike and Member Types

Screenshot 2023-05-15 at 12 30 56 PM
  • Between these three bikes (classic bike, docked bike, and electric bike), classic bike is the most popular type of bike among member and casual riders. Furthermore, members have higher number of rides for classic and electric bikes than casual riders (due to larger proportion of members). Unlike casual riders, they do not use docked bikes.

Visualization #3: Total Number of Rides by Member Types and Day of Week

Screenshot 2023-05-15 at 12 46 26 PM
  • During weekdays, there are high amount of bike usage among members compared to casual riders. However, casual riders increase their bike usage in the weekend.

Visualization #4: Total Number of Rides by Member Types and Month

Screenshot 2023-05-15 at 12 55 03 PM
  • For both member and casual riders, bike usage is lowest in the beginning and at the end of the year (January-April and November-December). These members prefer to use bikes throughout the middle of the year, which indicate they have higher preferences in bike usage during the summer and fall.

Visualization #5: Average Ride Duration Spent by Member Types

Screenshot 2023-05-15 at 1 03 15 PM
  • On average, casual riders spend roughly two times more than members in riding bikes (indicated in minutes).

Visualization #6: Average Ride Duration Spent by Member and Bike Types

Screenshot 2023-05-15 at 1 08 14 PM
  • Members spend significantly less minutes in all the bikes compared to casual riders and they do not use docked bikes at all. In addition, casual riders spend the most time on docked bikes in relative to other bikes.

6. Act

Conclusion

After analyzing and visualizing the data, some major findings that I found were larger percentage of members compared to casual riders, higher preferences for classic bike for riders, popularity in bike usage during summer and fall, and higher average ride duration for casual riders. Additionally, docked bike is the least popular bike due its long riding time as indicated by average spent time in casual riders.

Recommendations

  • Due to high popularity in bike usage throughout the summer and early fall, it would be a good idea to launch a incentive campaign to maintain or increase bike usage
  • Casual riders spend more time in riding bike, therefore providing some kind of discounts or promotions for electric bike would help them save some time
  • Launch marketing campaign that target the benefits of converting into annual membership for casual riders through advertisements, emails, and brochures

About

This repository contains data analysis and information about Cyclistic Bike Share project from Google Data Analytics Certificate.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages