ANALISIS SIMULASI PREDIKSI CUSTOMER CHURN E-COMMERCE MENGGUNAKAN ALGORITMA RANDOM FOREST BERBASIS DATA SINTETIS

Authors

  • Rayhan Bagoes Santoso Universitas Amikom Yogyakarta
  • Bety Wulan Sari Universitas Amikom Yogyakarta

DOI:

https://doi.org/10.35794/jmbi.v13i1.67545

Abstract

Tingkat customer churn yang tinggi menjadi tantangan kritis bagi industri e-commerce di Indonesia dengan potensi kerugian mencapai miliaran rupiah per tahun. Penelitian ini bertujuan untuk mengimplementasikan sistem prediksi churn pelanggan sebagai proof-of-conceptmenggunakan algoritma machine learning. Mengingat keterbatasan akses data privat e-commerce, penelitian ini menggunakan pendekatan metodologis dengan dataset sintetis yang terdiri dari 1000 data pelanggan  dan 9 fitur utama meliputi tenure, monthly spending, total transactions, support tickets, dan last purchase days. Tiga algoritma machine learning diimplementasikan yaitu Logistic Regression, Decision Tree, dan Random Forest untuk melakukan klasifikasi prediksi churn. Hasil penelitian menunjukkan bahwa Random Forest memberikan performa yang paling stabil dengan akurasi sebesar 87.5%, precision 86%, recall 87%, dan F1-score 86%. Penurunan performa dibandingkan model deterministik menunjukkan bahwa model diuji pada kondisi data yang lebih realistis dan tidak mengalami overfitting terhadap aturan generatif.

References

Ahmad, A. K., Jafar, A., & Aljoumaa, K. (2019). Customer churn prediction in telecom using machine learning in big data platform. Journal of Big Data, 6(1), 1–24.

Breiman, L. (2001). Random forests. Machine Learning, 45(1), 5–32.

Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (pp. 785–794).

De Caigny, A., Coussement, K., & De Bock, K. W. (2020). A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees. European Journal of Operational Research, 269(2), 760–772.

García, S., Luengo, J., & Herrera, F. (2020). Data preprocessing in data mining. Springer.

Géron, A. (2022). Hands-on machine learning with Scikit-Learn, Keras, and TensorFlow (3rd ed.). O’Reilly Media.

Kumar, R., & Sharma, P. (2022). A comprehensive study on customer churn prediction in e-commerce using ML techniques. International Journal of Information Technology, 14(5), 2567–2580.

Lalwani, P., Mishra, M. K., Chadha, J. S., & Sethi, P. (2022). Customer churn prediction system: A machine learning approach. Computing, 104(2), 271–294.

Lazarov, S., & Capota, M. (2023). Churn prediction in e-commerce using machine learning and ensemble methods. IEEE Access, 11, 45789–45801.

Óskarsdóttir, M., Bravo, C., Verbeke, W., Sarraute, C., Baesens, B., & Vanthienen, J. (2021). Social network analytics for churn prediction in telco: Model building, evaluation and network architecture. Expert Systems with Applications, 184, Article 115508.

Provost, F., & Fawcett, T. (2023). Data science for business: What you need to know about data mining and data-analytic thinking. O’Reilly Media.

Saghir, M., Bibi, Z., Bashir, S., & Khan, F. H. (2019). Churn prediction using neural network based individual and ensemble models.

Striuk, V., & Ternov, O. (2021). Customer churn prediction for e-commerce using machine learning algorithms.

Verbraken, W., Bravo, C., Weber, R., & Baesens, B. (2014). Development and application of consumer credit scoring models using profit-based classification measures. European Journal of Operational Research, 238(2), 505–513.

Zhang, Y., & Qi, Y. (2020). Customer churn prediction in e-commerce based on deep learning.

Downloads

Published

2026-03-30

How to Cite

Rayhan Bagoes Santoso, & Bety Wulan Sari. (2026). ANALISIS SIMULASI PREDIKSI CUSTOMER CHURN E-COMMERCE MENGGUNAKAN ALGORITMA RANDOM FOREST BERBASIS DATA SINTETIS. JMBI UNSRAT (Jurnal Ilmiah Manajemen Bisnis Dan Inovasi Universitas Sam Ratulangi)., 13(1), 317–331. https://doi.org/10.35794/jmbi.v13i1.67545