Implementation of K-Means Clustering in Mapping Driver Performance and Consistency Characteristics in the 2026 Formula 1 New Regulation Era
DOI:
https://doi.org/10.21460/jutei.2026.101.472Keywords:
clustering, K-Means, Formula 1 2026, Sports Analytics, FastF1, Pre-season Testing, Machine LearningAbstract
The 2026 Formula 1 season introduces a radical regulatory transition, rendering historical performance data obsolete. This study addresses the "cold-start" problem in sports analytics by implementing the K-Means clustering algorithm to map competitive hierarchies during the 2026 Bahrain pre-season tests. The analysis is exclusively based on four key performance features: Fastest Lap, Average Lap Time, Standard Deviation (consistency), and Total Laps (reliability), extracted via the FastF1 API. A total of 3,624 telemetry data rows were processed and normalized using StandardScaler. The Elbow Method identified K=4 as the optimal cluster configuration. Although the Silhouette Coefficient of 0.350 reflects the inherent "noise" and "sandbagging" strategies of F1 testing, the model successfully differentiated four distinct performance tiers: Top-Tier Leaders, Stable Midfielders, Reliability-Focused Testers, and Technical Anomalies (Strugglers). The findings provide an objective, data-driven framework for interpreting competitive strength without relying on subjective media reports, proving that unsupervised learning can extract meaningful patterns from unlabelled telemetry data in highly volatile regulatory environments.
References
[1] FIA, “2026 Formula 1 Technical Regulations - Section C: Technical Regulations,” Issue 2, pp. 1–264, 2026.
[2] R. Joslin and G. Brinklow, “Open wheel competition car rear wing placement and underbody aerodynamic interactions : Aerodynamic design for a competitive advantage in motorsports,” 2025, doi: 10.1177/17543371251384445.
[3] M. Haghighat, H. Rastegari, and N. Nourafza, “A Review of Data Mining Techniques for Result Prediction in Sports,” vol. 2, no. 5, pp. 7–12, 2013.
[4] F. Hojaji, A. J. Toth, J. M. Joyce, and M. J. Campbell, “AI-enabled prediction of sim racing performance using telemetry data,” Comput. Hum. Behav. Reports, vol. 14, no. December 2023, p. 100414, 2024, doi: 10.1016/j.chbr.2024.100414.
[5] S. Zein and G. Gunawan, “Prediksi Hasil FIFA World Cup Qatar 2022 Menggunakan Machine Learning dengan Python,” J. Ris. Mat., pp. 153–162, 2022, doi: 10.29313/jrm.v2i2.1382.
[6] V. P. Saputra, U. Latifa, and Ibrahim, “Simulasi Detection Counter Pada Objek Kendaraan Motor Dan Mobil Menggunakan Metode Convolutional Neural Network Berbasis Python,” J. Ilm. Wahana Pendidik., vol. 9, no. 16, pp. 760–766, 2023, doi: 10.5281/zenodo.8265040.
[7] I. J. Informatika et al., “Penerapan metode K-Means Clustering untuk segmentasi performa pembalap F1 season 2024,” vol. 27, no. April, pp. 113–122, 2025, doi: 10.23969/infomatek.v27i1.24297.
[8] I. W. Angga, W. Kusuma, and R. L. Ellyana, “Penerapan Citra Terkompresi Pada Segmentasi Citra Menggunakan Algoritme K-MEANS,” pp. 65–74, doi: 10.21460/jutei.2018.21.65.
[9] B. E. Adiana, I. Soesanti, A. E. Permanasari, and J. G. No, “Analisis Segmentasi Pelanggan Menggunakan Kombinasi RFM Model dan Teknik Clustering,” no. 2, pp. 23–32, 2018, doi: 10.21460/jutei.2017.21.76.
[10] S. Raschka and V. Mirjalili, Python Machine Learning: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd ed. Birmingham: Packt Publishing, 2019.
[11] M. Gita Budiarti, N. Rahaningsih, and R. Danar Dana, “Analisis Cluster Data Daftar Kendaraan Bermotor Menggunakan Algoritma K-Means,” JATI (Jurnal Mhs. Tek. Inform.), vol. 7, no. 6, pp. 3286–3292, 2024, doi: 10.36040/jati.v7i6.8162.
[12] P. Putriana, N. Suarna, and W. Prihartono, “Analisis Clustering Prestasi Atlet Pada Berbagai Cabang Olahraga Menggunakan Algoritma K-Means,” JATI (Jurnal Mhs. Tek. Inform.), vol. 7, no. 6, pp. 3435–3442, 2024, doi: 10.36040/jati.v7i6.8211.
[13] J. MacQueen, “Some methods for classification and analysis of multivariate observations,” in Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, no. 14, 1967, pp. 281–297.
[14] Deti Karmanita and Billy Hendrik, “Penerapan Metode Clustering dengan Algoritma K-Means pada Pengelompokkan Peminatan Mata Kuliah,” J. Ilm. Dan Karya Mhs., vol. 1, no. 6, pp. 01–10, 2023, doi: 10.54066/jikma.v1i6.1028.
[15] M. Mustofa, “Penerapan Algoritma K-Means Clustering pada Karakter Permainan Multiplayer Online Battle Arena,” J. Inform., vol. 6, no. 2, pp. 246–254, 2019, doi: 10.31311/ji.v6i2.6096.
[16] T. M. Kodinariya, “Review on determining number of Cluster in K-Means Clustering,” vol. 1, no. 6, pp. 90–95, 2013.
[17] N. A. Maori and E. Evanita, “Metode Elbow dalam Optimasi Jumlah Cluster pada K-Means Clustering,” Simetris J. Tek. Mesin, Elektro dan Ilmu Komput., vol. 14, no. 2, pp. 277–288, 2023, doi: 10.24176/simet.v14i2.9630.
[18] N. A. Yolandari, L. E. Butarbutar, G. C. H. Rajagukguk, M. F. Zulfi, Arnita, and F. Ramadhani, “Analisis Perbandingan K-Means Dan Dbscan Dalam Pengelompokan Data Travel Review Ratings Menggunakan Evaluasi Silhouette Index Dan Davies-Bouldin Index,” J. Inform. dan Tek. Elektro Terap., vol. 13, no. 3, 2025, doi: 10.23960/jitet.v13i3.6884.
[19] I. Yati Beti and H. Juliansa, “KLIK: Kajian Ilmiah Informatika dan Komputer Penerapan Normalisasi Data Metode Decimal Scaling Dan Metode K-Means Dalam Mengelompokkan Kasus Demam Berdarah,” Media Online), vol. 4, no. 6, pp. 2928–2936, 2024, doi: 10.30865/klik.v4i6.1925.
[20] F. Pedregosa et al., “Scikit-learn: Machine Learning in Python,” Journal of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.
[21] A. P. Margaretha, N. Ulinnuha, and P. K. Intan, “Clustering Data Kecelakaan Lalu Lintas melalui Algoritma K-Means dengan Seleksi Fitur Chi-Square,” INTEGER J. Inf. Technol., vol. 10, no. 2, pp. 215–224, 2025, doi: 10.31284/j.integer.0.v10i2.7529.
[22] A. P. Putra, J. Tshivana, and E. Rilvani, “Perbandingan Teoritis Dan Eksperimen Algoritma K-Means Dan K-Medoids Dalam Klasterisasi Data,” Kohesi: Jurnal Multidisiplin Saintek, vol. 10, no. 2, 2025.
[23] C. A. da S. Barreto, J. C. Xavier-Júnior, A. M. P. Canuto, and I. M. D. Da Silva, “A Machine Learning Approach Based on Automotive Engine Data Clustering for Driver Usage Profiling Classification,” pp. 174–185, 2019, doi: 10.5753/eniac.2018.4414.
[24] J. M. Guntur, “Algoritma K-Means untuk Meningkatkan Silhouette Score pada Pengelompokan Data Stok Bahan Manufaktur di PT. XYZ Kabupaten Majalengka,” Multinetics, vol. 11, no. 1, pp. 11–21, 2025, doi: 10.32722/multinetics.v11i1.7259.
[25] Z. Muttaqin, D. Fernando, and S. Sulastriani, “Implementasi Unsupervised Learning Pada Nilai Jasmani Kesamaptaan Sekolah Polisi Negara,” vol. 10, no. 1, 2023.
[26] B. Juliartha, M. Putra, D. Ariani, and F. Yuniarti, “Analisis hasil belajar mahasiswa dengan clustering menggunakan metode K-Means,” vol. 12, no. 2, pp. 49–58, 2020.
[27] M. Ester, H. Kriegel, X. Xu, and D.- Miinchen, “A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise,” 1996.
Downloads
Published
How to Cite
Issue
Section
License
Copyright (c) 2026 Elsa Anggraini

This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivatives 4.0 International License.
Authors who publish articles in JUTEI agree on the following rules:
1. The author grants non exclusive royalty free rights, and is willing to publish articles online and complete (full access). With such rights JUTEI reserves the right to save, transfers, manages in various forms, maintains and publishes articles while keeping the author's name as the copyright owner.
2. Each author contained in the article has contributed fully to the substance and intellectual, and is accountable to the public. If in the future there is a copyright infringement notification then this will be responsibility of the author, not JUTEI.

