ISSN 1000-1239 CN 11-1777/TP

• 论文 •

### VOTCL及其在交叉销售问题上的应用研究

1. (山东大学计算机科学与技术学院 济南 250101) (zhouguangtong@gmail.com)
• 出版日期: 2010-09-15

### VOTCL and the Study of Its Application on Cross-Selling Problems

Zhou Guangtong, Yin Yilong, Guo Xinjian, and Dong Cailing

1. (School of Computer Science and Technology, Shandong University, Jinan 250101)
• Online: 2010-09-15

Abstract: Cross-selling is regarded as one of the most promising strategies to make profits. The authors first describe a typical cross-selling model, followed by analysis showing that class-imbalance and cost-sensitivity usually co-exist in the data sets collected from this domain. In fact, the central issue in real-world cross-selling applications focuses on the identification of potential cross-selling customers. However, the performance of customer prediction suffers from the problem that class-imbalance and cost-sensitivity are arising simultaneously. To address this problem, an effective method called VOTCL is proposed. In the first stage, VOTCL generates a number of balanced training data sets by combining under-sampling and over-sampling techniques; then a base learner is trained on each of the data set in the second stage; finally, VOTCL obtains the final decision-making model by using an optimal threshold based voting scheme. The effectiveness of VOTCL is validated on the cross-selling data set provided by PAKDD 2007 competition where an AUC value of 0.6037 is achieved by using the proposed method. The ensemble model also outperforms a single base learner, which to some extent shows the efficacy of the proposed optimal threshold based voting scheme.