PERFORMANCE COMPARISON OF C4.5 AND K-NEAREST NEIGHBOR ALGORITHMS FOR MARKETPLACE SALES POTENTIAL ANALYSIS
DOI:
https://doi.org/10.69916/jkbti.v5i2.505Keywords:
C4.5, K-Nearest Neighbor, Marketplace, Classification, sales predictionAbstract
The rapid growth of digital marketplace platforms such as Shopee, Tokopedia, and Bukalapak has transformed online business competition and increased the importance of data-driven sales analysis. Marketplace data, including product price, ratings, reviews, sales volume, views, and seller location, contain valuable information that can be utilized to predict product market potential. However, the large volume, heterogeneous characteristics, and dynamic nature of marketplace data make manual analysis inefficient. Therefore, this study aims to analyze and compare the performance of the C4.5 and K-Nearest Neighbor (KNN) algorithms in classifying marketplace sales potential. The dataset used in this research was collected through data scraping from Shopee, Tokopedia, and Bukalapak using the BigSeller application in March 2022, consisting of 21,750 product records with numerical and categorical attributes. Data preprocessing was conducted using Orange Data Mining, including data cleaning, missing value handling, normalization, feature transformation, and dataset partitioning. The classification process categorized products into three market potential levels: low, medium, and high. Model performance was evaluated using a confusion matrix based on accuracy, precision, recall, and F1-score metrics. The experimental results demonstrate that the C4.5 algorithm outperformed KNN, achieving an accuracy of 0.86, while KNN obtained an accuracy of 0.70. Moreover, C4.5 showed higher precision, recall, and F1-score values, indicating better classification consistency and stability. The findings suggest that C4.5 is more effective for marketplace sales potential classification due to its ability to identify influential attributes and manage heterogeneous marketplace datasets. This study contributes to marketplace sales prediction and supports data-driven decision-making in e-commerce environments.
Downloads
References
Sya’roni, “Analisis Potensi Marketplace Terhadap Penjualan Menggunakan K-Nearest Neighbors,” JURNAL SISTEM DAN INFORMATIKA (JSI), vol. 19, no. 2, pp. 1–9, Oct. 2025, doi: https://doi.org/10.30864/jsi.v19i2.724.
M. Utami and V. Ayumi, “Prediksi Penjualan Produk Terlaris Menggunakan Algoritma K-Nearest Neighboard (KNN),” JCOSIS (Journal Computer Science and Information Syetem, vol. 1, no. 2, pp. 43–47, 2024, doi: https://doi.org/10.61567.
F. U. Aulya and K. Kusnawi, “Evaluating Classification Models for Predicting Product Success in Indonesian E-Commerce,” Jurnal Teknik Informatika (Jutif), vol. 6, no. 4, pp. 2723–2739, Aug. 2025, doi: 10.52436/1.jutif.2025.6.4.5071.
L. Nadhia Ningsih, R. Septiani, A. Pramadjaya, and S. Nuralisah, “Implementing the C4.5 Algorithm for Customer Satisfaction Classification,” MALCOM: Indonesian Journal of Machine Learning and Computer Science, vol. 5, no. 4, pp. 1470–1480, Sep. 2025, doi: 10.57152/malcom.v5i4.2211.
N. Ubaedilah, Puji Isyanto, and Asep Darojatul Romli, “Analisis Faktor-Faktor Yang Mempengaruhi Keputusan Pembelian Impulsif Pada Pengguna Tiktok Shop,” Journal of Trends Economics and Accounting Research, vol. 4, no. 1, pp. 46–56, Sep. 2023, doi: 10.47065/jtear.v4i1.875.
M. Yasir Alghifari et al., “COMPARISON OF SVM AND NAIVE BAYES ALGORITHMS IN SENTIMENT ANALYSIS OF USER REVIEWS ON BUKALAPAK,” JURNAL INOVTEK POLBENG -SERI INFORMATIKA, vol. 10, no. 3, pp. 1623–1633, Nov. 2025, doi: https://doi.org/10.35314/dqhpkb12.
E. Purwanto, B. P. Cipto Utomo, H. Permatasari, and F. Mohd, “Comparative Analysis of Classification Models for Sales Prediction in E-commerce: Decision Tree, Random Forest, SVM, Naive Bayes, and KNN,” Jurnal Teknik Informatika (Jutif), vol. 6, no. 6, pp. 5899–5915, Jan. 2026, doi: 10.52436/1.jutif.2025.6.6.5224.
A. Sonita and A. Lestari, “Implementasi Metode K-Nearest Neighbor Untuk Prediksi,” JSAI: Journal Scientific and Applied Informatics, vol. 7, no. 3, pp. 544–552, Nov. 2024, doi: 10.36085.
N. F. Octavia and Berlilana, “Penerapan Algoritma K-Nearest Neighbor untuk Analisis Sentimen Ulasan Produk Elektronik pada Platform E-Commerce,” Jurnal Algoritma, vol. 22, no. 2, pp. 2110–2121, Nov. 2025, doi: 10.33364/algoritma/v.22-2.3083.
M. Redinal Muktar, M. Rifqi Faidhil Syam, A. Kunda, M. Syahlan Natsir, and J. K. Sistem Informasi Universitas Dipa Makassar Jln Perintis Kemerdekaan, “Analisis Penerapan Data Mining dalam Klasifikasi Penjualan Pakaian Pada Toko Online Shopee Menggunakan Algoritma C4.5,” JURNAL DIPANEGARA KOMPUTER SISTEM INFORMASI, vol. 17, no. 1, pp. 1–8, Jun. 2023, doi: https://doi.org/10.36774/dipakomsi.v17i1.1388.
J. Nangi and R. Rinaldi Hadistio, “PENERAPAN DATA MINING UNTUK MEMPREDIKSI PENJUALAN MENGGUNAKAN METODE KNEAREST NEIGHBOR (STUDI KASUS: THRIFTING SECOND 3),” ANIMATOR, vol. 3, no. 3, pp. 1–5, Dec. 2025, Accessed: Mar. 03, 2026. [Online]. Available: https://animator.uho.ac.id/index.php/journal/article/view/1284/56
M. Danny, A. Muhidin, and A. Jamal, “Application of the K-Nearest Neighbor Machine Learning Algorithm to Preduct Sales of Best-Selling Products,” Brilliance: Research of Artificial Intelligence, vol. 4, no. 1, pp. 255–264, Jun. 2024, doi: 10.47709/brilliance.v4i1.4063.
A. Verma, C. Nagar, S. Haryani, and S. Jain, “Prediction of E-Commerce Shoppers’ Purchasing Intention using Knn Algorithm,” in Atlantis Press, 2025, pp. 63–72. doi: 10.2991/978-94-6463-716-8_6.
Z. Fatah and M. Syafiq, “Prediksi Penjualan Sepeda Motor Menerapkan Metode K-Nearest Neighbor,” JAMASTIKA, vol. 5, no. 1, pp. 333–342, Apr. 2026, doi: https://doi.org/10.35473/jamastika.v5i1.4701.
Roni S and C. Crysdian, “Jurnal Teknologi dan Manajemen Informatika Studi Literature Analisis Potensi Pasar Marketplace terhadap Penjualan Article Info ABSTRACT,” Jurnal Teknologi dan Manajemen Informatika (JTMI), vol. 8, no. 2, pp. 134–142, 2022, [Online]. Available: http://http://jurnal.unmer.ac.id/index.php/jtmi
F. Wijaya, R. Rahmansyah, and N. A. Hasibuan, “Prediksi Penjualan Barang Menggunakan Algoritma K-Nearest Neighbor,” Jurnal Sistem Cerdas dan Rekayasa (JSCR), vol. 7, no. 2, pp. 2656–7504, Oct. 2025, doi: https://doi.org/10.61293/jscr.v7i2.856.
Downloads
Published
Scite Metrics
Altmetric
How to Cite
Issue
Section
License
Copyright (c) 2026 Sya'roni

This work is licensed under a Creative Commons Attribution 4.0 International License.











