Print Shootout Parekh Kunal

  • November 2019
  • PDF

This document was uploaded by user and they confirmed that they have the permission to share it. If you are author or own the copyright of this book, please report to us by using this DMCA report form. Report DMCA


Overview

Download & View Print Shootout Parekh Kunal as PDF for free.

More details

  • Words: 707
  • Pages: 10
DATA MINING SHOOTOUT 2007 Presented by: ANKIT SAKLECHA KUNAL PAREKH ROHIT JAISWAL VINAY KANOJIA Copyright © 2007, SAS Institute Inc. All rights reserved.

Business Problem ƒ M.K Nurich offered a variety of magazines and periodicals to its customers ƒ Large customer base with diverse interests ƒ Blanket Marketing approach not feasible ƒ Adopt a more targeted marketing approach ƒ Attract more potential high value customers Copyright © 2007, SAS Institute Inc. All rights reserved.

Copyright © 2007, SAS Institute Inc. All rights reserved.

1

Business Understanding ƒ Target new prospects by predicting customer’s lifetime revenue (over 5 years) ƒ Build a model to rank customers in terms of expected revenue ƒ Find out the predicted revenue of those customers that were not solicited earlier ƒ Use data of current customers to expand into the untapped market

Copyright © 2007, SAS Institute Inc. All rights reserved.

Data Understanding ƒ Modeling dataset – 10,669 observations ƒ 177 modeling variables ƒ 1 continuous target variable ƒ OBS_ID – Unique Identifier ƒ Scoring dataset – 7,054 observations ƒ Another scoring dataset – unsolicited customers who had become members

Copyright © 2007, SAS Institute Inc. All rights reserved.

Copyright © 2007, SAS Institute Inc. All rights reserved.

2

Data Preparation ƒ Explored the distributions and measurement levels of different variables ƒ Partitioned the data into 70:30 for training and validation respectively ƒ Used several variable selection techniques like Variable Selection, DM Regression, DM Neural etc. ƒ Added variables one-by-one and ran the model and observed the change in the RMSE value Copyright © 2007, SAS Institute Inc. All rights reserved.

Data Preparation ƒ Rejected 8 variables initially that had about 74% missing values:

Copyright © 2007, SAS Institute Inc. All rights reserved.

Copyright © 2007, SAS Institute Inc. All rights reserved.

3

Data Preparation Variables Selected (22) :

Copyright © 2007, SAS Institute Inc. All rights reserved.

Data Preparation Transformation: ƒ Transformed the variables to normalize the distribution and reduce the skewness ƒ Max Normal Transformation produced the best results

Copyright © 2007, SAS Institute Inc. All rights reserved.

Copyright © 2007, SAS Institute Inc. All rights reserved.

4

Data Preparation Before Transformation:

Copyright © 2007, SAS Institute Inc. All rights reserved.

Data Preparation After Transformation:

Copyright © 2007, SAS Institute Inc. All rights reserved.

Copyright © 2007, SAS Institute Inc. All rights reserved.

5

Data Preparation Imputation: ƒ Imputed variables to replace missing values ƒ 22 variables had missing values

Copyright © 2007, SAS Institute Inc. All rights reserved.

Data Preparation

Copyright © 2007, SAS Institute Inc. All rights reserved.

Copyright © 2007, SAS Institute Inc. All rights reserved.

6

Modeling ƒ Ran different models like Neural network, Autoneural node, Regression, DM Neural, Dmine Regression etc. ƒ Experimented with the properties of the models ƒ Used Ensemble model to combine predictions from multiple models ƒ Found out the best model by doing a model comparison Copyright © 2007, SAS Institute Inc. All rights reserved.

Evaluation Results of Model Comparison:

ƒ The best model was Neural network with 8 neurons ƒ It had the lowest RMSE value of 158.94 Copyright © 2007, SAS Institute Inc. All rights reserved.

Copyright © 2007, SAS Institute Inc. All rights reserved.

7

Scoring ƒ Used our best model to score the datasets ƒ P_rev_all was the column that was exported along with the OBS_ID ƒ P_rev_all displayed the revenue generated for customers

Copyright © 2007, SAS Institute Inc. All rights reserved.

Scoring ƒ For the 1st Scoring dataset, the customer with an ID of 13656 would generate the maximum revenue of $936.76

Copyright © 2007, SAS Institute Inc. All rights reserved.

Copyright © 2007, SAS Institute Inc. All rights reserved.

8

Scoring ƒ For the 2nd Scoring dataset, the customer with an ID of 1200 would generate the maximum revenue of $323.74

Copyright © 2007, SAS Institute Inc. All rights reserved.

Conclusion ƒ Determine the predicted revenues based on customer’s lifetime value ƒ Target customers with highest predicted revenues ƒ Maximize the Profit and Net Revenue

Copyright © 2007, SAS Institute Inc. All rights reserved.

Copyright © 2007, SAS Institute Inc. All rights reserved.

9

THANK YOU

Copyright © 2007, SAS Institute Inc. All rights reserved.

Copyright © 2007, SAS Institute Inc. All rights reserved.

10

Related Documents

Print Shootout Parekh Kunal
November 2019 13
Kunal
November 2019 17
Kunal- Hindi
June 2020 15
Kunal Sharma
November 2019 19
Six Gun Shootout
May 2020 2