Building Predictive Models for Customer Churn in Telecom

Experfy Editorial Team Experfy Editorial Team
Experfy Content Manager
August 8, 2017 FinTech

Need Predictive Analytics Training? Browse courses developed by industry thought leaders and Experfy in Harvard Innovation Lab.

Problem Description

Consumers today go through a complex decision making process before subscribing to any one of the numerous Telecom service options — Voice (Prepaid, Post-Paid), Data (DSL, 3G, 4G), Voice+Data, etc. Since the services provided by the Telecom vendors are not highly differentiated and number portability is commonplace, customer loyalty becomes an issue. Hence, it is becoming increasingly important for telecommunications companies to proactively identify customers that have a tendency to unsubscribe and take preventive measures to retain such customers.

The aim of this blog post is to introduce a predictive model to identify the set of customers who have a high probability of unsubscribing from the service now or in the near future using Personal Details, Demographic Information, Pricing and the Plans they have subscribed to. A secondary objective is to identify the features of the Independent Variables (also known as “X” or “Predictors”) which cause a great impact on the Dependent Variable (“Y” or “Outcome”) that causes a customer to unsubscribe.

Data Description

On the input data:

6 months of data with 3 million transactions

Predictors / Independent Variables (IV) considered:

  • Customer Demographics (Age, Gender, Marital Status, Location, etc.)
  • Call Statistics (Length of calls like Local, National & International, etc.)
  • Billing Information (what the customer paid for)
  • Voice and Data Product (Broadband services, Special Data Tariffs, etc.)
  • Complaints and Disputes (customer satisfaction issues and the remedial steps taken)
  • Credit History

On the output:

  • Target / Response considered for the model:
  • The value ‘1’ indicates UNSUBSCRIBED customers
  • The value ‘0’ indicates ACTIVE customers

Note: For the sake of brevity, I am ignoring the steps taken to clean, transform, and impute the data.

In any Predictive Model work, the data set has to be partitioned appropriately so as to avoid overfitting/underfitting issues amongst other things.

Prediction Accuracy & Model Selection:

Models built on TRAINING data set are validated using the VALIDATION data set. It is common to build multiple models including ensembles and compare their performance. The model that eventually gets deployed is the one that benefits the business the most, while keeping the error rate within acceptable limits.

Here are the 2 common error types in Churn Prediction:

Type I Error — False Negative: Failing to identify a customer who has a high propensity to unsubscribe.

From a business perspective, this is the least desirable error as the customer is very likely to quit/cancel/abandon the business, thus adversely affecting its revenue.

Type II Error — False Positive: Classifying a good, satisfied customer as one likely to Churn.

Any Predictive Algorithm going into Production will have to be the one with the least Type I error.

In our case, we used multiple algorithms on a Test data set of 300K transactions to predict Churn. Shown below are the results from the top 2 performing algorithms:

Algorithm 1: Decision Tree

Overall Error of Decision Tree Model = (1535+20685) / (31754+268246) = .074 = 7.4%

Prediction Accuracy for DT = 100–7.4 = 92.6%

Algorithm 2: Neural Networks

Overall Error of Neural Network Model = (1633+18771) / (31754+268246) = .068 = 6.8%

Prediction Accuracy for NN = 100–6.8 = 93.2%

Though the overall error rate of Neural Network was less than that of the Decision Tree algorithm, the Decision Tree model was chosen for deployment due the higher Type I error rate.

The model was chosen based on not only Prediction Accuracy, but also the impact of Type I Error.

Technology

R was used to build, validate, and test the models with the 3 miilion transaction data set.

It was re-implemented in Spark/MLLib/Scikit-Learn/HDFS to deal with larger datasets.

Results

Model predicts the likelihood of Customer Churn with high accuracy.

Key variables that were impacting Customer Churn or causing significant impact on the “Y” were:

  • Age (ages 21–40)
  • Salary (lower salaries)
  • Data Usage (those who used more data services)
  • Experfy Insights

    Top articles, research, podcasts, webinars and more delivered to you monthly.

  • Experfy Editorial Team

    Experfy Content Manager
    Tags
    Artificial IntelligenceBig Data & TechnologyDecision Tree algorithmPredictive ModelsTelecom serviceTelecom vendorstelecommunications companies
    © 2021, Experfy Inc. All rights reserved.
    Leave a Comment
    Next Post
    How to Become a Data Scientist (Part 1/3)

    How to Become a Data Scientist (Part 1/3)

    Leave a Reply Cancel reply

    Your email address will not be published. Required fields are marked *

    More in FinTech
    FinTech
    7+ Entrepreneurial Reasons to Invest in Cryptocurrency Development

    Startups and entrepreneurs worldwide are starting to realize the financial potential of cryptocurrency development and blockchain technology. Many investors and companies are looking to get in on the ground floor by investing in cryptocurrency development. Initially, many entrepreneurs have avoided the cryptocurrency space because of the risks, but there are many better reasons that you

    9 MINUTES READ Continue Reading »
    FinTech
    Is Your Money Safe In Crypto Exchanges? Top 3 Crypto Exchanges To Ensure Safety

    We all know that cryptocurrency is still in its nascent stage, and investing in crypto exchanges at this point could be risky for the investors, especially the newbies. Security always remains a major concern when you choose to buy digital currency like cryptocurrency. The growing market for cryptocurrencies has a significant demand for high-quality and

    5 MINUTES READ Continue Reading »
    FinTech
    Future Of Cryptocurrency – Will It Lead The Finance Market in Upcoming Years?

    Future Of Cryptocurrency – Will It Lead The Finance Market in Upcoming Years? Cryptocurrency has gained a lot of buzz in recent times, although there is still a lot to learn about this evolving technology. The technology is said to have powerful applications in a variety of fields, from healthcare to the media. Did you

    7 MINUTES READ Continue Reading »

    About Us

    Incubated in Harvard Innovation Lab, Experfy specializes in pipelining and deploying the world's best AI and engineering talent at breakneck speed, with exceptional focus on quality and compliance. Enterprises and governments also leverage our award-winning SaaS platform to build their own customized future of work solutions such as talent clouds.

    Join Us At

    Contact Us

    1700 West Park Drive, Suite 190
    Westborough, MA 01581

    Email: [email protected]

    Toll Free: (844) EXPERFY or
    (844) 397-3739

    © 2025, Experfy Inc. All rights reserved.