How AI is opening the playbook on sports analytics
Waterloo researchers create simulated soccer datasets to unlock insights once reserved for pro teams
Waterloo researchers create simulated soccer datasets to unlock insights once reserved for pro teams
By Media RelationsProfessional sports teams pour millions of dollars into data analytics, using advanced tracking systems to study every sprint, pass, and decision on the field. The results of that analysis, however, are industry secrets, making many sports difficult for researchers to study.
Now, two University of Waterloo researchers, Dr. David Radke and Kyle Tilbury, are using AI to level the playing field.
By tapping into Google Research Football’s reinforcement learning environment, the researchers developed a system that can simulate and record unlimited soccer matches. To get things started, they generated and saved data from 3,000 simulated soccer games, resulting in a rich and complex dataset of passes, goals, and player movements for researchers to study.
“While researchers have access to a lot of data about episodic sports like baseball, continuous invasion-game sports like soccer and hockey are much more difficult to analyze,” said Radke, a recent Waterloo PhD graduate in computer science and currently a senior research scientist for the NHL’s Chicago Blackhawks.
“While the AI-generated players might not exactly play like Lionel Messi, the simulated datasets they generate are still useful for developing sports analysis tools.”
Datasets like the ones generated by the team are particularly useful for researchers, enthusiastic fans, and smaller research teams that may not have extensive access to proprietary sports data.
“Enabling researchers to have this data will open up all kinds of opportunities,” said Tilbury, a Waterloo PhD student in computer science who equally co-authored the research. “It’s a democratization of access to this kind of sports analytics data.
While datasets like the one generated by the team are particularly interesting for sports enthusiasts, they have larger implications for AI research as well.
“At its core, invasion-game sports analytics is about understanding complex multiagent systems,” Radke said. “The better we are at modelling the complexity of human behaviour in a sporting situation, the more useful that is for AI research. In turn, more advanced multiagent systems will help us better understand invasion-game sports.”
Radke and the team believe the future of sports analytics relies on progress in the space of tracking data. They therefore hope researchers interested in sports without access to tracking data will utilize their datasets and repository to gain experience working with this type of data.
The study, “Simulating tracking data to advance sports analytics research,” appeared in the proceedings of the 24th International Conference on Autonomous Agents and Multiagent Systems.

Dr. Chris Bauch, a professor of Applied Mathematics at the University of Waterloo, is part of a team that has developed a new approach to help public health officials predict where outbreaks might occur. (Elisabetta Paiano/University of Waterloo)
Read more
New research demonstrates that vaccine skepticism on social media can predict public health crises

Read more
Waterloo researcher Cameron Seth is breaking down the world’s hardest computer science problem piece by piece

Read more
New study shows updated 2024–2025 vaccines remain effective against severe outcomes
The University of Waterloo acknowledges that much of our work takes place on the traditional territory of the Neutral, Anishinaabeg, and Haudenosaunee peoples. Our main campus is situated on the Haldimand Tract, the land granted to the Six Nations that includes six miles on each side of the Grand River. Our active work toward reconciliation takes place across our campuses through research, learning, teaching, and community building, and is co-ordinated within the Office of Indigenous Relations.