Abstract
This paper presents a model that predicts the results (Home Win/Draw/Away Win) of the football matches played in the Turkish Super League, by using three machine learning classification methods, which are Support Vector Machines (SVMs), Bagging with REP Tree (BREP), and Random Forest (RF). The dataset used in this study includes the data of 70 features, which are composed of 69 input variables relating to statistical data of home and away teams, and a target variable in 1222 total football games in a 4-year period between 2009 and 2013. In connection, the most effective features of these input variables were determined as a reduced dataset by using some feature selection methods. The results showed that the match outcomes were predicted using the reduced dataset better than using the original dataset and the RF classifier produced the best result.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Stekler, H.O., Sendor, D., Verlander, R.: Issues in sports forecasting. Int. J. Forecast. 26, 606–621 (2010)
Delen, D., Cogdell, D., Kasap, N.: A comparative analysis of data mining methods in predicting NCAA bowl outcomes. Int. J. Forecast. 28, 543–552 (2012)
Baker, R.D., McHale, I.G.: Forecasting exact scores in National Football League games. Int. J. Forecast. 29, 122–130 (2013)
Hvattum, L.M., Arntzen, H.: Using ELO ratings for match result prediction in association football. Int. J. Forecast. 26, 460–470 (2010)
Constantinou, A.C., Fenton, N.E., Neil, M.: pi-footbal: A Bayesian network model for forecasting Association Football match outcomes. Knowl.-Based Syst. 36, 322–339 (2012)
Baio, G., Blangiardo, M.: Bayesian hierarchical model for the prediction of football results. J. Appl. Stat. 37(2), 253–264 (2010)
Joseph, A., Fenton, N., Neil, M.: Predicting football results using Bayesian nets and other machine learning techniques. Knowl.-Based Syst. 7, 544–553 (2006)
Tsakonas, A., Dounias, G., Shtovba, S., Vivdyuk, V.: Soft computing-based result prediction of football games. In: The First International Conference on Inductive Modelling (ICIM’2002). Lviv, Ukraine (2002)
Dixon, M., Coles, S.: Modelling association football scores and inefficienies in the football betting market. Appl. Stat. 46, 265–280 (1997)
Goddard, J.: Regression models for forecasting goals and match results in association football. Int. J. Forecast. 21, 331–340 (2005)
Han, J., Kamber, M.: Data mining: concepts and techniques. Morgan Kauffmann Publishers, San Francisco (2001)
Dash, M., Liu, H.: Consistency-based search in feature selection. Artif. Intell. 151, 155–176 (2003)
Machine Learning Group at University of Waikato. www.cs.waikato.ac.nz/ml/weka/ (2015). Accessed 09 June 2015
Hall, M.A.: Correlation-based feature selection for machine learning. Ph.D. thesis, Department of Computer Science, University of Waikato, Hamilton, New Zeland (1998)
Kohavi, R., John, G.: Wrappers for feature subset selection. Artif. Intell. 97, 273–324 (1997)
Hall, M.A., Holmes, G.: Benchmarking attribute selection techniques for discrete class data mining. IEEE Trans. Knowl. Data Eng. 15, 1437–1447 (2003)
El-Sebakhy, E.: Constrained estimation functional networks for statistical pattern recognition problems: theory and methodolgy. Int. J. Pattern Recogn. Artif. Intell. (2007)
Alpaydın, E.: Introduction to Machine Learning, 2nd ed. MIT Press (2010)
Dreiseitl, S., Ohno-Machado, L.: Logistic regression and artificial neural network classification models: a methodology review. J. Biomed. Inf. 35(5–6) (2002)
Hassan, S., Mihalcea, R., Banea, C.: Random walk term weighting for improved text classification. Int. Seman. Comput. 1(4) (2007)
Breiman, L.: Bagging predictors. Mach. Learn. 24(2), 123–140 (1996)
D’Haen, J., Van den Poel, D.: Temporary staffing services: a data mining perspective. In: IEEE 12th International Conference on Data Mining Workshops (2012)
Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001)
Zeng, L., Watson, D.G., Johnston, B.F., Clark, R.L., Edrada-Ebel, R., Elseheri, W.: A chemometric study of chromatograms of tea extracts by correlation optimization warping in conjunction with PCA, support vector machines and random forest data modeling. Analytica Chimica Acta 642 (2009)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Tüfekci, P. (2016). Prediction of Football Match Results in Turkish Super League Games. In: Abraham, A., Wegrzyn-Wolska, K., Hassanien, A., Snasel, V., Alimi, A. (eds) Proceedings of the Second International Afro-European Conference for Industrial Advancement AECIA 2015. Advances in Intelligent Systems and Computing, vol 427. Springer, Cham. https://doi.org/10.1007/978-3-319-29504-6_48
Download citation
DOI: https://doi.org/10.1007/978-3-319-29504-6_48
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-29503-9
Online ISBN: 978-3-319-29504-6
eBook Packages: EngineeringEngineering (R0)