
Analysis of the Accuracy of Reported Track Conditions Utilizing Predictive Modeling
- 1 University of North Carolina at Chapel Hill, School of Art and Science, Chapel Hill 27514, US
- 2 The Wheatley School, Old Westbury 11568, US
- 3 New York University, College of Art and Science, NY 10012, US
* Author to whom correspondence should be addressed.
Abstract
Horse racing is a globally viewed sport in which the performance of horses may be influenced by numerous factors, with the condition and moisture in the racing surface, known as the "going," being one of the most influential. Accurate reporting of the going is essential for ensuring fair competition. This study aims to measure the accuracy of reported track conditions by utilizing a dataset from three racecourses—Catterick, Chester, and Newmarket. We first identified discrepancies suspected to be caused by rounding in reported distances in the data and corrected them by reverting them to officially sanctioned distances. We started with linear regression models to predict winning times, using key variables such as race distance, class of the race, and the reported going. Then we applied log transformation to the data to solve heteroscedasticity. The final model will be used to generate prediction intervals for winning times under each going conditions, allowing us to figure out which goings might be reasonable for a specific race. The results indicate that approximately 6-7% of races were outside of the calculated bounds, which may lead to errors in strategic decisions by trainers and bettors. By calculating the posterior probabilities of each going condition using Bayesian inference, we created a list of reasonable goings for each new race, giving trainers and bettors more accurate information so that they can better prepare for future races in the day.
Keywords
track conditions, linear regression, horse racing, actual going prediction
[1]. Janett Williams and Yan Li, “A case study using neural networks algorithms:horse racing predictions in Jamaica. In International Conference on Artificial Intelligence, Las Vegas, NV, 2008. , ” .
[2]. Ali Reza Khanteymoori Elnaz Davoodi, “Horse racing prediction using artificial neural networks. NN’10/EC’10/FS’10 Proceedings of the 11th WSEAS international conference on nural networks and 11th WSEAS international conference on evolutionary computing and 11th WSEAS international conference on fuzzy systems pages 155-160 . Iasi, Romania June, 2010,
[3]. Michael Baulch, “Using machine learning to predict the results of sporting matches Baulch, M. ”Using Machine Learning to Predict the Results of Sporting Matches”. Department of Computer Science and Electrical Engineering, University of Queensland, 2001. , ”.
[4]. Michael Pardee, “An artificial neural network approach to college football prediction and ranking, ” Technical Paper. Madison, WI: University of Wisconsin, Electrical and Computer Engineering Department, 1999.
[5]. Zimo Yang Guo, Fangjian and Tao Zhou, “Predicting link directions via a recursive subgraph-based ranking Web Sciences Center, University of Electronic Science and Technology of China, Chengdu 611731, PR China, 2013., ”.
[6]. Daniel Hunttenlocher Jon Kleinberg Leskovec, Jure, “Predicting positive and negative links in online social network. Proceedings of the 19th international conference on World wide web, ACM, ” Raleigh, North Carolina, USA April 26 - 30, 2010.
[7]. Robert P. Schumaker, “Using SVM Regression to predict harness races: A one year study of northfield park, ” Computer and Information Science’s Department Cleveland State University, Cleveland, Ohio 44115, USA 2011.
[8]. “Schumaker, Robert P. and Johnson, James W. ”An Investigation of SVM Regression to Predict Longshot Greyhound Races, ” Communications of the IIMA: Vol. 8 : Iss. 2 , Article 7. available at: http://scholarworks.lib.csusb.edu/ciima/vol8/iss2/7, ” 2008.
[9]. Plouffe, Dominic. (2017). “Predicting Horse Racing Results”. Retrieved from https://github.com/dominicplouffe/HorseRacingPrediction
[10]. Wayne Lu Suvrat Bhooshan, Josh King, “Edge direction prediction of sporting tournaments graphs, Stanford University, Final Project, ” 2016.graphs, Stanford University, Final Project, ” 2016.
[11]. Linlin She Siunie Sutjahjo Chris Sommer Daryl Neely Hsinchun Chen, Peter Buntin Rinde, “Expert prediction, symbolic learning, and neural networks: An experiment on greyhound racing, IEEE Intelligent Systems, vol. 9, no. 6, pp. 21-27, ” Journal, Dec 1994.
[12]. At The Races. (2023).Course Guides, At The Races. https://www.attheraces.com/course-guides (Accessed July 24, 2024).
[13]. R Core Team (2021). R: A language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria. URL https://www.R-project.org/.
[14]. Montgomery, D. C., Peck, E. A., & Vining, G. G. (2012). Introduction to Linear Regression Analysis (5th ed.). John Wiley & Sons. (upper lower bound function)
[15]. Statlect. (n.d.). Bayesian inference. Retrieved July 28, 2024, from https://www.statlect.com/fundamentals-of-statistics/Bayesian-inference
[16]. Gulum, Mehmet Akif, “Horse racing prediction using graph-based features.” (2018). Electronic Theses and Dissertations. Paper 2953.
[17]. Taiyun Wei and Viliam Simko (2021). R package ‘corrplot’: Visualization of a Correlation Matrix (Version 0.92).
[18]. Iannone R, Cheng J, Schloerke B, Hughes E, Lauer A, Seo J, Brevoort K, Roy O (2024). gt: Easily Create Presentation-Ready Display Tables. R package version 0.11.0
[19]. Wickham H (2016). ggplot2: Elegant Graphics for Data Analysis. Springer-Verlag New York. ISBN 978-3-319-24277-4, https://ggplot2.tidyverse.org.
Cite this article
Li,M.;Yang,D.;Zhao,T. (2025). Analysis of the Accuracy of Reported Track Conditions Utilizing Predictive Modeling. Applied and Computational Engineering,131,18-33.
Data availability
The datasets used and/or analyzed during the current study will be available from the authors upon reasonable request.
Disclaimer/Publisher's Note
The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of EWA Publishing and/or the editor(s). EWA Publishing and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.
About volume
Volume title: Proceedings of the 2nd International Conference on Machine Learning and Automation
© 2024 by the author(s). Licensee EWA Publishing, Oxford, UK. This article is an open access article distributed under the terms and
conditions of the Creative Commons Attribution (CC BY) license. Authors who
publish this series agree to the following terms:
1. Authors retain copyright and grant the series right of first publication with the work simultaneously licensed under a Creative Commons
Attribution License that allows others to share the work with an acknowledgment of the work's authorship and initial publication in this
series.
2. Authors are able to enter into separate, additional contractual arrangements for the non-exclusive distribution of the series's published
version of the work (e.g., post it to an institutional repository or publish it in a book), with an acknowledgment of its initial
publication in this series.
3. Authors are permitted and encouraged to post their work online (e.g., in institutional repositories or on their website) prior to and
during the submission process, as it can lead to productive exchanges, as well as earlier and greater citation of published work (See
Open access policy for details).