Abstract
Ranking sports teams generally relies on supervised techniques, requiring either prior knowledge or arbitrary metrics. In this paper, we offer a purely unsupervised technique. We apply this to operational decision-making, specifically, the controversial European Super League for associa-tion football, demonstrating how this approach can select dominant teams to form the new league. We first use random forest regression to select important variables predicting goal difference, which we use to calculate the Euclidian distances between teams. Creating a Laplacian eigenmap, we bisect the Fiedler vector to identify the natural clusters in five major European football leagues. Our results show how an unsupervised approach could identify four clusters based on five basic performance metrics: shots, shots on target, shots conceded, possession, and pass success. The top two clusters identify teams that dominate their respective leagues and are the best candidates to create the most competitive elite super league.
More Information
Identification Number: | https://doi.org/10.3390/math11030720 |
---|---|
Status: | Published |
Refereed: | Yes |
Publisher: | MDPI AG |
Additional Information: | Copyright: © 2023 by the authors. |
Depositing User (symplectic) | Deposited by Bento, Thalita |
Date Deposited: | 31 Jan 2023 14:16 |
Last Modified: | 19 Jul 2024 13:35 |
Item Type: | Article |