Churn Prediction Analysis and Model Development

This project involves a comprehensive analysis of customer churn data and the development of a predictive model to identify customers who are at risk of churning. The project utilizes various data analysis techniques, statistical tests, and machine learning models.

Project Overview

The goal of this project is to understand the factors influencing customer churn and develop a predictive model that can help identify customers who are at risk of leaving. The project includes:

Data exploration and visualization.
Statistical analysis to determine relationships between variables.
Development and tuning of a machine learning model using XGBoost.
Saving the trained model for future predictions.

Dataset

The dataset used in this project contains customer information such as age, gender, geographical location, balance, credit score, and whether the customer has churned or not.

Exploratory Data Analysis (EDA)

Customer Demographics

Age Distribution: The distribution of customers across different age groups was visualized using a histogram.
Gender Distribution: The gender distribution of customers was analyzed using a pie chart.

Churn Analysis

Churn Percentage: The percentage of customers who have churned was calculated.
Reasons for Churn: A chi-square test was performed to determine if there is a significant relationship between categorical variables (e.g., Geography, Gender, NumOfProducts) and customer churn.
Churn Patterns: Churn patterns were identified by analyzing the distribution of churned customers across different segments (e.g., Geography, Gender, Age).

Financial Analysis

Average Account Balance: The average balance of customers was calculated.
Financial Characteristics: The financial characteristics of churned vs. non-churned customers were compared using histograms.

Predictive Modeling

Feature Engineering

Geography and Gender Encoding: The Geography column was converted to boolean values using one-hot encoding, and the Gender column was mapped to binary values.

Model Development

Initial XGBoost Model: An XGBoost model was trained to compute feature importances, identifying the top 5 most significant predictors of customer churn.

Hyperparameter Tuning

Grid Search: Hyperparameter tuning was performed using GridSearchCV to find the best model configuration.

Model Evaluation

Classification Report: The final model was evaluated using a classification report, providing metrics such as precision, recall, and F1-score.

Requirements

To run this project, you need the following Python libraries:

pandas
matplotlib
scipy
xgboost
scikit-learn
joblib

How to Run

Clone the Repository: Clone this repository to your local machine.
Install Dependencies: Install the required Python libraries.
Run the Script: Execute the Jupyter notebook or Python script to perform the analysis and train the model.
View Results: The trained model will be saved as model.pkl, and you can use it for future predictions.

Results

The top 5 most significant predictors of customer churn were identified.
A tuned XGBoost model was developed, achieving high accuracy in predicting at-risk customers.

Contributing

If you would like to contribute to this project, please fork the repository and submit a pull request.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.gitattributes		.gitattributes
Churn Modelling.ipynb		Churn Modelling.ipynb
P3- Churn-Modelling Data.xlsx		P3- Churn-Modelling Data.xlsx
README.md		README.md
model.pkl		model.pkl

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Churn Prediction Analysis and Model Development

Table of Contents

Project Overview

Dataset

Exploratory Data Analysis (EDA)

Customer Demographics

Churn Analysis

Financial Analysis

Predictive Modeling

Feature Engineering

Model Development

Hyperparameter Tuning

Model Evaluation

Requirements

How to Run

Results

Contributing

About

Releases

Packages

Languages

mishra-krishna/Customer-Churn-Analysis-and-Prediction

Folders and files

Latest commit

History

Repository files navigation

Churn Prediction Analysis and Model Development

Table of Contents

Project Overview

Dataset

Exploratory Data Analysis (EDA)

Customer Demographics

Churn Analysis

Financial Analysis

Predictive Modeling

Feature Engineering

Model Development

Hyperparameter Tuning

Model Evaluation

Requirements

How to Run

Results

Contributing

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages