Soccer Team Vectors
Reference
- Müller, R., Langer, S., Ritz, F., Roch, C., Illium, S., and Linnhoff-Popien, C. 2020. Soccer Team Vectors. Machine Learning and Knowledge Discovery in Databases: International Workshops of ECML PKDD 2019, Würzburg, Germany, September 16–20, 2019, Proceedings, Part II, Springer International Publishing, 247–257.
This research introduces STEVE (Soccer Team Vectors), a novel methodology for learning meaningful, real-valued vector representations (embeddings) for professional soccer teams. The primary goal is to capture intrinsic team characteristics and relationships within a continuous vector space, such that teams with similar playing styles, strengths, or performance levels are positioned closely together.
Leveraging widely available public data from soccer matches (e.g., results, possibly performance statistics), STEVE employs machine learning techniques to generate these low-dimensional team vectors.
The utility of these learned representations is demonstrated through several downstream applications:
- Team Market Value Estimation: The vectors serve as effective features for predicting team market values, outperforming baseline models.
- Similarity Search: The vector space allows for efficient identification of teams similar to a given query team based on proximity.
- Team Ranking: The embeddings provide a basis for generating data-driven team rankings.
Across these application domains, STEVE demonstrated superior performance compared to competing approaches evaluated in the study. This work provides a valuable tool for quantitative analysis in sports analytics, enabling various machine learning tasks related to team comparison and prediction. For a comprehensive description of the methodology and results, please refer to the publication by [Müller et al. 2020].