Analyze bike ride data to differentiate casual riders from premium members and develop strategies to convert casual riders, enhancing profitability. This project includes data cleaning, exploratory data analysis, and insightful visualizations.
- Introduction
- Data Collection
- Data Cleaning and Processing
- Exploratory Data Analysis
- Insights and Visualizations
- Conclusion
- References
This project aims to analyze bike rides data to determine the difference between casual riders and premium members. The goal is to increase profits by converting casual riders into premium members.
The dataset consists of multiple CSV files for each month. The data includes the following attributes:
- Ride ID
- Ride Type (casual or member)
- Start Time
- End Time
- Start Station
- End Station
- Bike Type
Data cleaning and processing were performed using SQL and Excel. The steps involved:
- Removing duplicates
- Handling missing values
- Converting data types
- Creating new features
Exploratory Data Analysis (EDA) was conducted to understand the data better. Here are some key visualizations:
Figure 1: Number of Rides by Month
Figure 2: Average Ride Duration by User Type
Based on the EDA, several insights were derived. Some key findings include:
-
Casual riders tend to have longer ride durations.
-
Weekends see a higher number of casual riders compared to weekdays.
Figure 3: Ride Duration Distribution
Detailed visualizations and reports can be found in the visualizations and docs/reports directory.
The analysis provides actionable insights that can help increase the conversion rate of casual riders to premium members.