Implementation of K-Means Clustering in Mapping Driver Performance and Consistency Characteristics in the 2026 Formula 1 New Regulation Era

Authors

  • (*) Elsa Anggraini,  Universitas Bina Sarana Informatika

(*) Corresponding Author

DOI:

https://doi.org/10.21460/jutei.2026.101.472

Keywords:

clustering, K-Means, Formula 1 2026, Sports Analytics, FastF1, Pre-season Testing, Machine Learning

Abstract

The 2026 Formula 1 season introduces a radical regulatory transition, rendering historical performance data obsolete. This study addresses the "cold-start" problem in sports analytics by implementing the K-Means clustering algorithm to map competitive hierarchies during the 2026 Bahrain pre-season tests. The analysis is exclusively based on four key performance features: Fastest Lap, Average Lap Time, Standard Deviation (consistency), and Total Laps (reliability), extracted via the FastF1 API. A total of 3,624 telemetry data rows were processed and normalized using StandardScaler. The Elbow Method identified K=4 as the optimal cluster configuration. Although the Silhouette Coefficient of 0.350 reflects the inherent "noise" and "sandbagging" strategies of F1 testing, the model successfully differentiated four distinct performance tiers: Top-Tier Leaders, Stable Midfielders, Reliability-Focused Testers, and Technical Anomalies (Strugglers). The findings provide an objective, data-driven framework for interpreting competitive strength without relying on subjective media reports, proving that unsupervised learning can extract meaningful patterns from unlabelled telemetry data in highly volatile regulatory environments.

References

[1] FIA, “2026 Formula 1 Technical Regulations - Section C: Technical Regulations,” Issue 2, pp. 1–264, 2026.

[2] R. Joslin and G. Brinklow, “Open wheel competition car rear wing placement and underbody aerodynamic interactions : Aerodynamic design for a competitive advantage in motorsports,” 2025, doi: 10.1177/17543371251384445.

[3] M. Haghighat, H. Rastegari, and N. Nourafza, “A Review of Data Mining Techniques for Result Prediction in Sports,” vol. 2, no. 5, pp. 7–12, 2013.

[4] F. Hojaji, A. J. Toth, J. M. Joyce, and M. J. Campbell, “AI-enabled prediction of sim racing performance using telemetry data,” Comput. Hum. Behav. Reports, vol. 14, no. December 2023, p. 100414, 2024, doi: 10.1016/j.chbr.2024.100414.

[5] S. Zein and G. Gunawan, “Prediksi Hasil FIFA World Cup Qatar 2022 Menggunakan Machine Learning dengan Python,” J. Ris. Mat., pp. 153–162, 2022, doi: 10.29313/jrm.v2i2.1382.

[6] V. P. Saputra, U. Latifa, and Ibrahim, “Simulasi Detection Counter Pada Objek Kendaraan Motor Dan Mobil Menggunakan Metode Convolutional Neural Network Berbasis Python,” J. Ilm. Wahana Pendidik., vol. 9, no. 16, pp. 760–766, 2023, doi: 10.5281/zenodo.8265040.

[7] I. J. Informatika et al., “Penerapan metode K-Means Clustering untuk segmentasi performa pembalap F1 season 2024,” vol. 27, no. April, pp. 113–122, 2025, doi: 10.23969/infomatek.v27i1.24297.

[8] I. W. Angga, W. Kusuma, and R. L. Ellyana, “Penerapan Citra Terkompresi Pada Segmentasi Citra Menggunakan Algoritme K-MEANS,” pp. 65–74, doi: 10.21460/jutei.2018.21.65.

[9] B. E. Adiana, I. Soesanti, A. E. Permanasari, and J. G. No, “Analisis Segmentasi Pelanggan Menggunakan Kombinasi RFM Model dan Teknik Clustering,” no. 2, pp. 23–32, 2018, doi: 10.21460/jutei.2017.21.76.

[10] S. Raschka and V. Mirjalili, Python Machine Learning: Machine Learning and Deep Learning with Python, scikit-learn, and TensorFlow 2, 3rd ed. Birmingham: Packt Publishing, 2019.

[11] M. Gita Budiarti, N. Rahaningsih, and R. Danar Dana, “Analisis Cluster Data Daftar Kendaraan Bermotor Menggunakan Algoritma K-Means,” JATI (Jurnal Mhs. Tek. Inform.), vol. 7, no. 6, pp. 3286–3292, 2024, doi: 10.36040/jati.v7i6.8162.

[12] P. Putriana, N. Suarna, and W. Prihartono, “Analisis Clustering Prestasi Atlet Pada Berbagai Cabang Olahraga Menggunakan Algoritma K-Means,” JATI (Jurnal Mhs. Tek. Inform.), vol. 7, no. 6, pp. 3435–3442, 2024, doi: 10.36040/jati.v7i6.8211.

[13] J. MacQueen, “Some methods for classification and analysis of multivariate observations,” in Proceedings of the Fifth Berkeley Symposium on Mathematical Statistics and Probability, vol. 1, no. 14, 1967, pp. 281–297.

[14] Deti Karmanita and Billy Hendrik, “Penerapan Metode Clustering dengan Algoritma K-Means pada Pengelompokkan Peminatan Mata Kuliah,” J. Ilm. Dan Karya Mhs., vol. 1, no. 6, pp. 01–10, 2023, doi: 10.54066/jikma.v1i6.1028.

[15] M. Mustofa, “Penerapan Algoritma K-Means Clustering pada Karakter Permainan Multiplayer Online Battle Arena,” J. Inform., vol. 6, no. 2, pp. 246–254, 2019, doi: 10.31311/ji.v6i2.6096.

[16] T. M. Kodinariya, “Review on determining number of Cluster in K-Means Clustering,” vol. 1, no. 6, pp. 90–95, 2013.

[17] N. A. Maori and E. Evanita, “Metode Elbow dalam Optimasi Jumlah Cluster pada K-Means Clustering,” Simetris J. Tek. Mesin, Elektro dan Ilmu Komput., vol. 14, no. 2, pp. 277–288, 2023, doi: 10.24176/simet.v14i2.9630.

[18] N. A. Yolandari, L. E. Butarbutar, G. C. H. Rajagukguk, M. F. Zulfi, Arnita, and F. Ramadhani, “Analisis Perbandingan K-Means Dan Dbscan Dalam Pengelompokan Data Travel Review Ratings Menggunakan Evaluasi Silhouette Index Dan Davies-Bouldin Index,” J. Inform. dan Tek. Elektro Terap., vol. 13, no. 3, 2025, doi: 10.23960/jitet.v13i3.6884.

[19] I. Yati Beti and H. Juliansa, “KLIK: Kajian Ilmiah Informatika dan Komputer Penerapan Normalisasi Data Metode Decimal Scaling Dan Metode K-Means Dalam Mengelompokkan Kasus Demam Berdarah,” Media Online), vol. 4, no. 6, pp. 2928–2936, 2024, doi: 10.30865/klik.v4i6.1925.

[20] F. Pedregosa et al., “Scikit-learn: Machine Learning in Python,” Journal of Machine Learning Research, vol. 12, pp. 2825–2830, 2011.

[21] A. P. Margaretha, N. Ulinnuha, and P. K. Intan, “Clustering Data Kecelakaan Lalu Lintas melalui Algoritma K-Means dengan Seleksi Fitur Chi-Square,” INTEGER J. Inf. Technol., vol. 10, no. 2, pp. 215–224, 2025, doi: 10.31284/j.integer.0.v10i2.7529.

[22] A. P. Putra, J. Tshivana, and E. Rilvani, “Perbandingan Teoritis Dan Eksperimen Algoritma K-Means Dan K-Medoids Dalam Klasterisasi Data,” Kohesi: Jurnal Multidisiplin Saintek, vol. 10, no. 2, 2025.

[23] C. A. da S. Barreto, J. C. Xavier-Júnior, A. M. P. Canuto, and I. M. D. Da Silva, “A Machine Learning Approach Based on Automotive Engine Data Clustering for Driver Usage Profiling Classification,” pp. 174–185, 2019, doi: 10.5753/eniac.2018.4414.

[24] J. M. Guntur, “Algoritma K-Means untuk Meningkatkan Silhouette Score pada Pengelompokan Data Stok Bahan Manufaktur di PT. XYZ Kabupaten Majalengka,” Multinetics, vol. 11, no. 1, pp. 11–21, 2025, doi: 10.32722/multinetics.v11i1.7259.

[25] Z. Muttaqin, D. Fernando, and S. Sulastriani, “Implementasi Unsupervised Learning Pada Nilai Jasmani Kesamaptaan Sekolah Polisi Negara,” vol. 10, no. 1, 2023.

[26] B. Juliartha, M. Putra, D. Ariani, and F. Yuniarti, “Analisis hasil belajar mahasiswa dengan clustering menggunakan metode K-Means,” vol. 12, no. 2, pp. 49–58, 2020.

[27] M. Ester, H. Kriegel, X. Xu, and D.- Miinchen, “A Density-Based Algorithm for Discovering Clusters in Large Spatial Databases with Noise,” 1996.

Downloads

Published

2026-04-30

How to Cite

[1]
Elsa Anggraini, “Implementation of K-Means Clustering in Mapping Driver Performance and Consistency Characteristics in the 2026 Formula 1 New Regulation Era”, JUTEI, vol. 10, no. 1, pp. 9–18, Apr. 2026.

Similar Articles

1 2 3 4 5 6 7 > >> 

You may also start an advanced similarity search for this article.