Bank-Customer-Segmentation

Business Case & Problem Statement

If there is enough data about customers, data science can be applied for market segmentation. In this case study, the bank has extensive data on their customers for the past 6 months. As a marketing team at the bank we will divide customers into distinctive groups to launch a targetted ad marketing campaign. Process of grouping customers based on their interactions with the business and having some common characteristics. In most cases, interactions depend on their purchase behaviors.

Data

Explanation of each independent features

CUSTID: Identification of Credit Card holder
BALANCE: Balance amount left in customer's account to make purchases
BALANCE_FREQUENCY: How frequently the Balance is updated, score between 0 and 1 (1 = frequently updated, 0 = not frequently updated)
PURCHASES: Amount of purchases made from account
ONEOFFPURCHASES: Maximum purchase amount done in one-go
INSTALLMENTS_PURCHASES: Amount of purchase done in installment
CASH_ADVANCE: Cash in advance given by the user
PURCHASES_FREQUENCY: How frequently the Purchases are being made, score between 0 and 1 (1 = frequently purchased, 0 = not frequently purchased)
ONEOFF_PURCHASES_FREQUENCY: How frequently Purchases are happening in one-go (1 = frequently purchased, 0 = not frequently purchased)
PURCHASES_INSTALLMENTS_FREQUENCY: How frequently purchases in installments are being done (1 = frequently done, 0 = not frequently done)
CASH_ADVANCE_FREQUENCY: How frequently the cash in advance being paid
CASH_ADVANCE_TRX: Number of Transactions made with "Cash in Advance"
PURCHASES_TRX: Number of purchase transactions made
CREDIT_LIMIT: Limit of Credit Card for user
PAYMENTS: Amount of Payment done by user
MINIMUM_PAYMENTS: Minimum amount of payments made by user
PRC_FULL_PAYMENT: Percent of full payment paid by user
TENURE: Tenure of credit card service for user

Step by Step Methodology:

Import Libraries and Dataset
Data Clean Up
Exploratory Data Analysis & Feature Selection
Scale the Data Using ‘StandardScaler’
Generation of Clusters using K-means
Generation of Clusters using Hierarchical Agglomerative
Inspection of Clusters using Principal Component Analysis
Generation of Clusters using AutoEncoders

Conclusion

Optimal number of cluster is found by applying K-means Algorithm with elbow-method, and applied K-Means k=8 number of clusters. I have also tried autoencoders to cluster customers using encoded data with 10 features and divided customers into four distinctive groups.

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
Bank_Customer_Segmentation.ipynb		Bank_Customer_Segmentation.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Bank-Customer-Segmentation

Business Case & Problem Statement

Data

Step by Step Methodology:

Conclusion

About

Releases

Packages

Languages

hakanco/Bank-Customer-Segmentation

Folders and files

Latest commit

History

Repository files navigation

Bank-Customer-Segmentation

Business Case & Problem Statement

Data

Step by Step Methodology:

Conclusion

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages