Cluster Analysis of BPJS Kesehatan Claim Data in Madiun City to Identify High Claim Patterns and Fraud Indications

Muhammad Qolbi Shobri, Putri Balqis Al-Kubro, Gabriella Vindy Kawuri

Abstract


The increasing number of BPJS Kesehatan (Health Social Security) service claims in Madiun City poses significant challenges to financing efficiency and raises concerns about potential irregular or fraudulent claims. This study aims to identify high-claim patterns and detect indications of fraud using a data mining approach through the K-Means and Hierarchical Clustering methods. The research employed secondary data consisting of 309 hospital claim records from Madiun City in 2025. The primary variables were the number of claims and total claim costs, supported by additional variables such as age, gender, occupation, type of service, and disease diagnosis. Data analysis involved three main stages: preprocessing, clustering, and cluster quality evaluation using the Silhouette Score, Davies-Bouldin Index, and Calinski-Harabasz Index. Ths Study further compared the performance of both clustering methods, revealing that K-Means achieved superior validity scores across major evaluation metrics. The K-Means method produced the best performance, with a Silhouette Score of 0.617 and a Calinski-Harabasz Index of 419.581, reflecting well-separated and compact cluster structures. Three main clusters were identified-low, medium, and high. The high-claim cluster consisted of participants aged 55 years and above, with a claim frequencies of 2 to 7 claims and total claim costs exceeding IDR 20 million. This cluster was dominated by retirees, housewives, and private-sector employees utilizing inpatient services. Although categorized as a high-risk group, verification results revealed no signs of fraud but rather complex medical needs. These findings suggest that integrating clustering analysis into BPJS Kesehatan’s claim monitoring system can support early anomaly detection and enhance both financing efficiency and claim management integrity.

Keywords


BPJS Kesehatan (Health Social Security); K-Means; Hierarchical Clustering; High Claim; Potential Fraud

Full Text:

PDF

References


OJK, Statistik Jaminan Sosial Indonesia 2022. Jakarta: Direktorat Statistik dan Informasi IKNB, 2023.

Indonesia AIDS Coalition, Buku Panduan Jaminan Kesehatan Nasional (JKN) bagi Populasi Kunci, 2016. [Online]. Available: https://siha.kemkes.go.id/portal/files. [Accessed: 2025].

Jatimpos, “Sepanjang Tahun 2023 BPJS Kesehatan Kantor Cabang Madiun Keluarkan Biaya Kesehatan Rp1,5 Triliun,” Jatimpos.co, 2024. [Online]. Available: https://www.tempo.co/ekonomi. [Accessed: 2025].

M. Q. Shobri, R. A. Andyani, and M. Jeksen, “Regresi logistik Bayesian dan algoritma C4.5 dalam klasifikasi risiko penggunaan BPJS Kesehatan Kota Madiun,” Lebesgue: Jurnal Ilmiah Pendidikan Matematika, Matematika dan Statistika, vol. 5, no. 3, pp. 2064–2078, 2024. doi: 10.46306/lb.v5i3.814.

Tempo, “Dirut Sebut BPJS Kesehatan Alami Defisit Sekitar Rp20 Triliun Tahun Ini,” Tempo.co, Nov. 11, 2024. [Online]. Available: https://www.tempo.co/ekonomi. [Accessed: 2025].

V. Singgih, “BPJS Kesehatan Terancam Tekor Rp20 Triliun dan Gagal Bayar Klaim, Kenaikan Iuran Jadi ‘Keniscayaan’,” BBC News Indonesia, Nov. 15, 2024. [Online]. Available: https://www.bbc.com/indonesia. [Accessed: 2025].

CNN Indonesia, “KPK: Kerugian dari Fraud di Bidang Kesehatan Sekitar Rp20 Triliun,” CNN Indonesia, Sep. 20, 2024. [Online]. Available: https://www.cnnindonesia.com/nasional/20240920023933. [Accessed: 2025].

S. Tito, J. Julius, and K. N. Siregar, “Faktor pemicu dan penghambat fraud dalam program Jaminan Kesehatan Nasional dan strategi pencegahannya: Sebuah scoping review,” Jurnal Ekonomi Kesehatan Indonesia, vol. 9, no. 2, Art. 5, 2024, doi: 10.7454/eki.v9i2.1124.

J. Han, M. Kamber, and J. Pei, Data Mining: Concepts and Techniques, 3rd ed. USA: Morgan Kaufmann, 2012.

J. W. G. Putra, “Pengenalan konsep pembelajaran mesin dan deep learning,” Aug. 17, 2020. [Online]. Available: https://wiragotama.github.io/. [Accessed: 2025].

K. S. Pranata, A. A. S. Gunawan, and F. L. Gaol, “Development clustering system IDX company with K-means algorithm and DBSCAN based on fundamental indicator and ESG,” Procedia Computer Science, vol. 216, pp. 319–327, 2023, doi: 10.1016/j.procs.2022.12.142.

E. M. S. Rochman, Miswanto, and H. Suprajitno, “Comparison of clustering in tuberculosis using fuzzy C-means and K-means methods,” Communications in Mathematical Biology and Neuroscience, 2022, Art. ID 41, doi: 10.28919/cmbn/7335.

M. S. Hasibuan, A. H. Lubis, and M. N. Sari, “Perbandingan algoritma clustering DBSCAN dan K-Means dalam pengelompokan siswa terbaik,” INFOTECH: Jurnal Informatika & Teknologi, vol. 5, no. 2, pp. 301–309, 2024, doi: 10.37373/infotech.v5i2.1457.

I. Surairoh, A. C. Rani, K. Amalia, and D. Rolliawati, “Perbandingan hasil analisis clustering metode K-Means, DBSCAN, dan hierarchical pada data marketplace electronic phone,” Joins: Journal Information System, vol. 8, no. 1, pp. 95–105, 2023, doi: 10.33633/joins.v8i1.8016.

F. D. Wahyuningtyas, A. Arafat, A. Stiawan, and D. Rolliawati, “Komparasi algoritma hierarchical, K-Means, dan DBSCAN pada analisis data penjualan melalui Facebook,” EXPLORE: Jurnal Sistem Informasi dan Telematika, vol. 14, no. 1, pp. 7–16, 2023, doi: 10.36448/jsit.v14i1.2931.

A. Y. B. R. Thaifur, “Exploratory study of factors influencing fraud in the national health service in Buton Islands from a hexagon model perspective,” Healthcare in Low-resource Setting, vol. 13, no. 1, pp. 32–35, 2025, doi: 10.4081/hls.2024.12773.

N. Sariunita and R. A. Syakurah, “Analisis kejadian upcoding biaya pelayanan kesehatan di wilayah kerja BPJS Kesehatan Cabang Depok,” BIGES JUKES, vol. 14, no. 2, pp. 1–6, 2023, doi: 10.35907/bgjk.v14i2.220.

K. N. Aprilia and R. Nurhayati, “Analisis kompetensi auditor internal terhadap kemampuan mencegah dan mendeteksi fraud dalam program Jaminan Kesehatan Nasional (Studi Kasus di Rumah Sakit Bethesda Yogyakarta),” ABIS: Accounting and Business Information Systems Journal, vol. 9, no. 2, pp. 227–246, 2021, doi: 10.22146/abis.v9i2.65895.

R. A. Andyani, M. Q. Shobri, M. Baihaqi, P. B. Al-Kubro, and M. S. Adhantoro, “Aplikasi metode Ward dengan berbagai pengukuran jarak (studi kasus: klasifikasi tingkat perekonomian),” JIKM: Jurnal Ilmiah Kampus Mengajar, vol. 4, no. 2, pp. 177–190, 2024. doi: 10.56972/jikm.v4i2.208.

D. A. T. Devanta, “Optimization of the K-Means clustering algorithm using Davies–Bouldin index in iris data classification,” KLIK: Kajian Ilmiah Informatika dan Komputer, vol. 4, no. 1, pp. 545–552, 2023, doi: 10.30865/klik.v4i1.964.

L. A. Sari, A. R. Hakim, and A. Rusgiyono, “Penggunaan indeks Calinski–Harabasz pada clustering K-Medoids algorithm untuk penggolongan kabupaten/kota di Provinsi Jawa Tengah berdasarkan karakteristik penduduk miskin,” Jurnal Gaussian, vol. 14, no. 1, pp. 179–187, 2025, doi: 10.14710/j.gauss.14.1.179-187.




DOI: https://doi.org/10.37905/euler.v13i3.35013

Refbacks

  • There are currently no refbacks.


Copyright (c) 2025 Muhammad Qolbi Shobri, Putri Balqis Al-Kubro, Gabriella Vindy Kawuri

Creative Commons License
This work is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.


Euler : Jurnal Ilmiah Matematika, Sains dan Teknologi has been indexed by:


 EDITORIAL OFFICE OF EULER : JURNAL ILMIAH MATEMATIKA, SAINS, DAN TEKNOLOGI

 Department of Mathematics, Faculty of Mathematics and Natural Science, Universitas Negeri Gorontalo
Jl. Prof. Dr. Ing. B. J. Habibie, Tilongkabila, Kabupaten Bone Bolango 96554, Gorontalo, Indonesia
 Email: [email protected]
 +6287777-586462 (WhatsApp Only)
 Euler : Jurnal Ilmiah Matematika, Sains dan Teknologi (p-ISSN: 2087-9393 | e-ISSN:2776-3706) by Department of Mathematics Universitas Negeri Gorontalo is licensed under a Creative Commons Attribution-NonCommercial 4.0 International License.  Powered by Public Knowledge Project OJS.