References: [1] Khan, A. A., Jamwal, S., Sepehri, M. M. (2012). Applying Data Mining to Customer Churn Prediction in an Internet ServiceProvider, Int. J. Comput. Appl., 2010. [2] Adebiyi, S. O., Oyatoye, E. O., Amole, B. B. (2016). Relevant Drivers for Customers‘ Churn and Retention Decision in the Nigerian Mobile Telecommunication Industry, J. Compet., 2016. [3] Umayaparvathi, V., Iyakutti, K. (2016). A Survey on Customer Churn Prediction in Telecom Industry: Datasets, Methods and Metrics, Int. Res. J. Eng. Technol., p 2395–56, 2016. [4] Dalvi, P. K., Khandge, S. K., Deomore, A., Bankar, A., Kanade, V. A. (2016). Analysis of Customer Churn Prediction in Telecom Industry using Decision Trees and Logistic Regression, Symp. Colossal Data Anal. Netw., 2016. [5] Sonak, A., Patankar, R. A. (2015). A Survey on Methods to Handle Imbalance Dataset., 4, (11), p 338–343. [6] Bekkar, M., Djemaa, H. K., Alitouche, T. A. (2013). Evaluation measures for models assessment over imbalanced data sets, J. Inf. Eng. Appl., 2013. [7] Breiman, L. (2001). Random forests, Mach. Learn., 2001. [8] Esteves, G., and Mendes-Moreira, J. (2016). Churn perdiction in the telecom business, in 2016 11th International Conference on Digital Information Management, ICDIM 2016, 2016. [9] Wu, Z., Lin, W., Zhang, Z., Wen, A., Lin, L. (2017). An Ensemble Random Forest Algorithm for Insurance Big Data Analysis, in Proceedings - 2017 IEEE International Conference on Computational Science and Engineering and IEEE/IFIP International Conference on Embedded and Ubiquitous Computing, CSE and EUC 2017. [10] Khalilia, M., Chakraborty, S., Popescu, M. (2011). Predicting disease risks from highly imbalanced data using random forest,BMC Med. Inform. Decis. Mak., 2011. [11] Effendy, V., Baizal, Z. K. a. (2014). Handling imbalanced data in customer churn prediction using combined sampling and weighted random forest, 2014 2nd Int. Conf. Inf. Commun. Technol., 2014. [12] Dwiyanti, E., Adiwijaya, Ardiyanti, A. (2017). Handling imbalanced data in churn prediction using RUSBoost and feature selection (Case study: PT. Telekomunikasi Indonesia regional 7), in Advances in Intelligent Systems and Computing, 2017. [13] Kobyli Dski, A., Przepiórkowski, A. (2008). Definition extraction with balanced random forests, in Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 2008. [14] Singh, S., Gupta. (2014). Comparative study ID3, cart and C4 . 5 Decision tree algorithm: a survey, Int. J. Adv. Inf. Sci. Technol., 2014. [15] Chen, C., Liaw, A., Breiman, L. (2004). Using random forest to learn imbalanced data, Univ. California, Berkeley, 2004. [16] Ghosh, S., Kumar, S. (2013). Comparative Analysis of K-Means and Fuzzy C-Means Algorithms, Int. J. Adv. Comput. Sci. Appl.. [17] Oyelade, O. J., Oladipupo, O. O., Obagbuwa, I. C. (2010). Application of k Means Clustering algorithm for prediction of Students Academic Performance, Int. J. Comput. Sci. Inf. Secur., 2010. [18] Weng, C. G., Poon, J. (2008). A new evaluation measure for imbalanced datasets, Conf. Res. Pract. Inf. Technol. Ser., 2008. [19] Fawcett, T. (2006). An introduction to ROC analysis, Pattern Recognit. Lett., 2006. [20] Kotsiantis, S. B., Kanellopoulos, D., Pintelas, P. E. (2006). Data preprocessing for supervised learning, Int. J. Comput. Sci., 2006. |