Data accuracy's impact on segmentation performance: Benchmarking RFM analysis,logistic regression,and decision trees |
| |
Authors: | Kristof Coussement Filip A.M. Van den Bossche Koen W. De Bock |
| |
Affiliation: | 1. IESEG School of Management, Université Catholique de Lille (LEM, UMR CNRS 8179), Expertise Center for Database Marketing (ECDM), Department of Marketing, 3 Rue de la Digue, F-59000, Lille, France;2. Hogeschool-Universiteit Brussel, Faculty of Economics and Management, Warmoesberg 26, B-1000 Brussels, Belgium;3. Katholieke Universiteit Leuven, Faculty of Business and Economics, Naamsestraat 69, B-3000 Leuven, Belgium |
| |
Abstract: | Companies greatly benefit from knowing how problems with data quality influence the performance of segmentation techniques and which techniques are more robust to these problems than others. This study investigates the influence of problems with data accuracy – an important dimension of data quality – on three prominent segmentation techniques for direct marketing: RFM (recency, frequency, and monetary value) analysis, logistic regression, and decision trees. For two real-life direct marketing data sets analyzed, the results demonstrate that (1) under optimal data accuracy, decision trees are preferred over RFM analysis and logistic regression; (2) the introduction of data accuracy problems deteriorates the performance of all three segmentation techniques; and (3) as data becomes less accurate, decision trees retain superior to logistic regression and RFM analysis. Overall, this study recommends the use of decision trees in the context of customer segmentation for direct marketing, even under the suspicion of data accuracy problems. |
| |
Keywords: | Customer segmentation Direct marketing Data quality Data accuracy RFM Decision trees |
本文献已被 ScienceDirect 等数据库收录! |
|