Skip to content

Built a model to determine the risk associated with extending credit to a borrower. Performed Univariate and Bivariate exploration using various methods such as pair-plot and heatmap to detect outliers and to monitor the behaviour and correlation of the features. Imputed the missing values using KNN Imputer and implemented SMOTE to address the i…

Notifications You must be signed in to change notification settings

AyushTyagi1610/Credit-Risk-Modelling

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

11 Commits
 
 
 
 
 
 

Repository files navigation

Credit Risk Analysis

Introduction

Credit risk modelling refers to the use of financial models to estimate losses a firm might suffer in the event of a borrower’s default. Financial institutions used credit risk analysis models to determine the probability of default of a potential borrower. The models provide information on the level of a borrower’s credit risk at any particular time. If the lender fails to detect the credit risk in advance, it exposes them to the risk of default and loss of funds. Lenders rely on the validation provided by credit risk analysis models to make key lending decisions on whether or not to extend credit to the borrower and the credit to be charged. The probability of default, sometimes abbreviated as POD, is the likelihood that a borrower will default on their loan obligations.

Source

The dataset for this credit risk modelling project has been taken from kaggle.

Features

The dataset classifies the credit risk of borrower with respect to various features such as Age , Income , Employment length , Home ownership and other attributes of loan such as Interest Rate , Purpose etc.

Project Overview

In this project, different learning algorithms have been used to find the best algorithm for credit risk prediction. The project also handles the imbalanced data and imputing the missing values.

Methods used:

  1. K-Nearest Neighbours
  2. Logistic Regression
  3. Random Forest
  4. Decision Trees

Conclusion

After fitting and testing the different models on the data, Random Forest Regressor performed better than all other algorithms that we used and were able to predict the credit risk on our test set with an accuracy of 93%.

Summary

About

Built a model to determine the risk associated with extending credit to a borrower. Performed Univariate and Bivariate exploration using various methods such as pair-plot and heatmap to detect outliers and to monitor the behaviour and correlation of the features. Imputed the missing values using KNN Imputer and implemented SMOTE to address the i…

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published