Reinforcement Learning

All papers, news and topics related to Reinforcement Learning.

Subramanian, S.Ganapathi et al., 2021. Partially Observable Mean Field Reinforcement Learning. In Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS). 3–7 May. London, United Kingdom: International Foundation for Autonomous Agents and Multiagent Systems, pp. 537-545.

Abstract

Traditional multi-agent reinforcement learning algorithms are not scalable to environments with more than a few agents, since these algorithms are exponential in the number of agents. Recent research has introduced successful methods to scale multi-agent reinforcement learning algorithms to many agent scenarios using mean field theory. Previous work in this field assumes that an agent has access to exact cumulative metrics regarding the mean field behaviour of the system, which it can then use to take its actions. In this paper, we relax this assumption and maintain a distribution to model the uncertainty regarding the mean field of the system. We consider two different settings for this problem. In the first setting, only agents in a fixed neighbourhood are visible, while in the second setting, the visibility of agents is determined at random based on distances. For each of these settings, we introduce a Q-learning based algorithm that can learn effectively. We prove that this Q-learning estimate stays very close to the Nash Q-value (under a common set of assumptions) for the first setting. We also empirically show our algorithms outperform multiple baselines in three different games in the MAgents framework, which supports large environments with many agents learning simultaneously to achieve possibly distinct goals.

Coogan, S.C.P. et al., 2020. A review of machine learning applications in wildfire science and management. Environmental Reviews, 28(3), p.73. Available at: https://www.nrcresearchpress.com/doi/10.1139/er-2020-0019#.X1jbKtNKhTY. Publisher's Version

Final Published Version

Artiﬁcial intelligence has been applied in wildﬁre science and management since the 1990s, with early applications including neural networks and expert systems. Since then the ﬁeld has rapidly progressed congruently with the wide adoption of machine learning (ML) methods in the environmental sciences. Here, we present a scoping review of ML applications in wildﬁre science and management. Our overall objective is to improve awareness of ML methods among wildﬁre researchers and managers, as well as illustrate the diverse and challenging range of problems in wildﬁre science available to ML data scientists. To that end, we ﬁrst present an overview of popular ML approaches used in wildﬁre science to date, and then review the use of ML in wildﬁre science as broadly categorized into six problem domains, including: 1) fuels characterization, ﬁre detection, and mapping; 2) ﬁre weather and climate change; 3) ﬁre occurrence, susceptibility, and risk; 4) ﬁre behavior prediction; 5) ﬁre eﬀects; and 6) ﬁre management. Furthermore, we discuss the advantages and limitations of various ML approaches relating to data size, computational requirements, generalizability, and interpretability, as well as identify opportunities for future advances in the science and management of wildﬁres within a data science context. In total, we identfied 300 relevant publications up to the end of 2019, where the most frequently used ML methods across problem domains included random forests, MaxEnt, artiﬁcial neural networks, decision trees, support vector machines, and genetic algorithms. As such, there exists opportunities to apply more current ML methods — including deep learning and agent based learning — in the wildﬁre sciences, especially in instances involving very large multivariate datasets. We must recognize, however, that despite the ability of ML methods to learn on their own, expertise in wildﬁre science is necessary to ensure realistic modelling of ﬁre processes across multiple scales, while the complexity of some ML methods, such as deep learning, requires a dedicated and sophisticated knowledge of their application. Finally, we stress that the wildﬁre research and management communities play an active role in providing relevant, high quality, and freely available wildﬁre data for use by practitioners of ML methods.

Adaptation Through Learning: Using Machine Learning to Improve Forest Wildfire Management Thursday, June 18, 2020:

reworkcrisistalk2020-markcrowley.pdf

My at a special Re-Work event on how Artificial Intelligence and Machine Learning are being used to understand and prepare for managing Forest Wildfires.

AI for Crisis Prediction & Management

Waterloo professor says artificial intelligence is a useful tool to help fight wildfires

Subramanian, S.Ganapthi, Bhalla, S. & Crowley, M., 2019. Learning Multi-Agent Communication with Reinforcement Learning. In Conference on Reinforcement Learning and Decision Making (RLDM-19). Montreal, Canada., p. 4.

Good News in the UWECEML Lab

May 15, 2019

As spring continues to tease us with cold and rain we all could use some good news to cheer us up, well it seems we've been storing it up recently, there are several exciting achievements to highlight:

Laura McCrackin has been awarded the Ontario Graduate Fellowship, a very competitive program to fund a year of her PhD thesis work!
Benyamin Ghojogh got in not one, not two, but three papers into the upcoming International Conference on Image Analysis and Recognition (...

See also: Reinforcement Learning

Bhalla, S., Subramanian, S.G. & Crowley, M., 2019. Training Cooperative Agents for Multi-Agent Reinforcement Learning. In Proc. of the 18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2019). Montreal, Canada.

Abstract

Deep Learning and back-propagation has been successfully used to perform centralized training with communication protocols among multiple agents in a cooperative environment. In this paper we present techniques for centralized training of Multi-Agent (Deep) Reinforcement Learning (MARL) using the model-free Deep Q-Network as the baseline model and message sharing between agents. We present a novel, scalable, centralized MARL training technique, which separates the message learning module from the policy module. The separation of these modules helps in faster convergence in complex domains like autonomous driving simulators. A second contribution uses the centrally trained model to bootstrap training of distributed, independent, cooperative agent policies for execution and thus addresses the challenges of noise and communication bottlenecks in real-time communication channels. This paper theoretically and empirically compares our centralized training algorithms to current research in the field of MARL. We also present and release a new OpenAI-Gym environment which can be used for multi-agent research as it simulates multiple autonomous cars driving cooperatively on a highway.

Adaptation Through Learning : Using Machine Learning to Improve Forest Wildfire Management, at Department of Electrical Engineering & Computer Science and Engineering, York University, Wednesday, February 13, 2019

Schematic Representation of Fire Prediction Agent ...

Adaptation Through Learning : Using Machine Learning to Improve Forest Wildfire Management, at San Francisco, California, Thursday, January 24, 2019

Rework 2019 Deep Learning summit I'll be giving a talk and taking part in a panel on opportunities for AI in Environmental and Sustainability domains at the Re-Work Deep Learning Summit on January 24, 2019 in San Francisco.

...

Fighting Fire with AI: Why AI is Different...and How., at Vancouver, BC, Friday, October 12, 2018

Not The New Normal I was invited to the BC AI Wildfire Symposium to give insights into the previous and potential uses of Artificial Intelligence and Machine Learning for modeling,...

Fighting Fire with AI: Using Artificial Intelligence to Improve Modelling and Decision Making in Wildfire Management, at Banff International Research Station, Banff, Alberta, Canada, Friday, November 17, 2017:

BIRS2017.pdf

I was invited to speak at this week-long workshop at the fabulous BIRS facility in Banff Alberta. The workshop was entitled "Forest and Wildland Fire Management: a Risk Management Perspective" which brought together a wide range of experts and stakeholders from across Canada as well as some researchers from around the world to discuss the latest research on Forest Fire Management. It was an incredibly productive week that built many new connections. Read more about Fighting Fire with AI: Using Artificial Intelligence to Improve Modelling and Decision Making in Wildfire Management

Using Deep Learning and Reinforcement Learning to Tame Spatially Spreading Processes, at University of Waterloo, Wednesday, October 25, 2017

This was an invited talk for the Waterloo Institute for Complexity and Innovation (WICI) seminar series. The talk was recorded and can be watched from WICI's website here.

Abstract:

Recent advances in Artificial Intelligence and Machine Learning (AI/ML) allow us to learn predictive models and control policies for larger, more complex systems than ever before. However, some important real world domains such as...

BIRC Workshop On Deep Learning In Medicine, at University Hospital, London, Ontario, Canada, Monday, August 28, 2017:

deeplearningmedprez_-_final.pdf

This all-day workshop brough together researchers, students and medical professionals from medical imaging, image processing and machine learning to discuss what the new class of machine learning algorithms known collectively as Deep Learning are, how they are and could be used for medicine and what the impacts for medicine as a whole are of this technology. The workshop was hosted by the Biomedical Imaging Research Centre (BIRC) at the University of Western Ontario. I gave an introductory...

Crowley, M., 2014. Using equilibrium policy gradients for spatiotemporal planning in forest ecosystem management. IEEE Transactions on Computers, 63(1), pp.142–154. Available at: http://doi.ieeecomputersociety.org/10.1109/TC.2013.113. Publisher's Version

Abstract

Spatiotemporal planning involves making choices at multiple locations in space over some planning horizon to maximize utility and satisfy various constraints. In Forest Ecosystem Management, the problem is to choose actions for thousands of locations each year including harvesting, treating trees for fire or pests, or doing nothing. The utility models could place value on sale of lumber, ecosystem sustainability or employment levels and incorporate legal and logistical constraints on actions such as avoiding large contiguous areas of clearcutting. Simulators developed by forestry researchers provide detailed dynamics but are generally inaccesible black boxes. We model spatiotemporal planning as a factored Markov decision process and present a policy gradient planning algorithm to optimize a stochastic spatial policy using simulated dynamics. It is common in environmental and resource planning to have actions at different locations be spatially interelated; this makes representation and planning challenging. We define a global spatial policy in terms of interacting local policies defining distributions over actions at each location conditioned on actions at nearby locations. Markov chain Monte Carlo simulation is used to sample landscape policies and estimate their gradients. Evaluation is carried out on a forestry planning problem with 1,880 locations using a variety of value models and constraints. Index

Mark Crowley

Associate Professor in Electrical and Computer Engineering

PhD Computer Science

Reinforcement Learning

Waterloo professor says artificial intelligence is a useful tool to help fight wildfires

Good News in the UWECEML Lab

Research by Topic

Contact

Office Hours