<?xml version="1.0" encoding="UTF-8"?><xml><records><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Reza Godaz</style></author><author><style face="normal" font="default" size="100%">Benyamin Ghojogh</style></author><author><style face="normal" font="default" size="100%">Reshad Hosseini</style></author><author><style face="normal" font="default" size="100%">Reza Monsefi</style></author><author><style face="normal" font="default" size="100%">Fakhri Karray</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Vector Transport Free Riemannian LBFGS for Optimization on Symmetric Positive Definite Matrix Manifolds</style></title><secondary-title><style face="normal" font="default" size="100%">Asian Conference on Machine Learning (ACML)</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">Accepted</style></year><pub-dates><date><style  face="normal" font="default" size="100%">November</style></date></pub-dates></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.acml-conf.org/2021/conference/accepted-papers/81/</style></url></web-urls></urls><pub-location><style face="normal" font="default" size="100%">Virtual</style></pub-location><pages><style face="normal" font="default" size="100%">8</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">This work concentrates on optimization on Riemannian manifolds. The Limited-memory Broyden–Fletcher–Goldfarb–Shanno (LBFGS) algorithm is a commonly used quasi-Newton method for numerical optimization in Euclidean spaces. Riemannian LBFGS (RLBFGS) is an extension of this method to Riemannian manifolds. RLBFGS involves computationally expensive vector transports as well as unfolding recursions using adjoint vector transports. In this article, we propose two mappings in the tangent space using the inverse second root and Cholesky decomposition. These mappings make both vector transport and adjoint vector transport identity and therefore isometric. Identity vector transport makes RLBFGS less computationally expensive and its isometry is also very useful in convergence analysis of RLBFGS. Moreover, under the proposed mappings, the Riemannian metric reduces to Euclidean inner product, which is much less computationally expensive. We focus on the Symmetric Positive Definite (SPD) manifolds which are beneficial in various fields such as data science and statistics. This work opens a research opportunity for extension of the proposed mappings to other well-known manifolds.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Poorheravi*, Parisa</style></author><author><style face="normal" font="default" size="100%">Ghojogh*, Benyamin</style></author><author><style face="normal" font="default" size="100%">Gaudet, Vincent</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Acceleration of Large Margin Metric Learning for Nearest Neighbor Classification Using Triplet Mining and Stratified Sampling</style></title><secondary-title><style face="normal" font="default" size="100%">Journal of Computational Vision and Imaging Systems</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2021</style></year><pub-dates><date><style  face="normal" font="default" size="100%">Jan.</style></date></pub-dates></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://openjournals.uwaterloo.ca/index.php/vsl/article/view/3534</style></url></web-urls></urls><number><style face="normal" font="default" size="100%">1</style></number><volume><style face="normal" font="default" size="100%">6</style></volume><pages><style face="normal" font="default" size="100%">5</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Metric learning is a technique in manifold learning to find a projection subspace for increasing and decreasing the inter- and intra-class variances, respectively. Some metric learning methods are based on triplet learning with anchor-positive-negative triplets. Large margin metric learning for nearest neighbor classification is one of the fundamental methods to do this. Recently, Siamese networks have been introduced with the triplet loss. Many triplet mining methods have been developed for Siamese nets; however, these techniques have not been applied on the triplets of large margin metric learning. In this work, inspired by the mining methods for Siamese nets, we propose several triplet mining techniques for large margin metric learning. Moreover, a hierarchical approach is proposed, for acceleration and scalability of optimization, where triplets are selected by stratified sampling in hierarchical hyper-spheres. We analyze the proposed methods on three publicly available datasets.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Colin Bellinger</style></author><author><style face="normal" font="default" size="100%">Rory Coles</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author><author><style face="normal" font="default" size="100%">Isaac Tamblyn</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Active Measure Reinforcement Learning for Observation Cost Minimization: A framework for minimizing measurement costs in reinforcement learning</style></title><secondary-title><style face="normal" font="default" size="100%">Canadian Conference on Artificial Intelligence</style></secondary-title><tertiary-title><style face="normal" font="default" size="100%">Lecture Notes in Computer Science (LNCS)</style></tertiary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">chemistry</style></keyword><keyword><style  face="normal" font="default" size="100%">Machine-Learning</style></keyword><keyword><style  face="normal" font="default" size="100%">proj-deepchemrl</style></keyword><keyword><style  face="normal" font="default" size="100%">reinforcement-learning</style></keyword><keyword><style  face="normal" font="default" size="100%">showcase</style></keyword><keyword><style  face="normal" font="default" size="100%">year-in-review-2021</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2021</style></year></dates><publisher><style face="normal" font="default" size="100%">Springer</style></publisher><pages><style face="normal" font="default" size="100%">12</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Markov Decision Processes (MDP) with explicit measurement cost are a class of en- vironments in which the agent learns to maximize the costed return. Here, we define the costed return as the discounted sum of rewards minus the sum of the explicit cost of measuring the next state. The RL agent can freely explore the relationship between actions and rewards but is charged each time it measures the next state. Thus, an op- timal agent must learn a policy without making a large number of measurements. We propose the active measure RL framework (Amrl) as a solution to this novel class of problem, and contrast it with standard reinforcement learning under full observability and planning under partially observability. We demonstrate that Amrl-Q agents learn to shift from a reliance on costly measurements to exploiting a learned transition model in order to reduce the number of real-world measurements and achieve a higher costed return. Our results demonstrate the superiority of Amrl-Q over standard RL methods, Q-learning and Dyna-Q, and POMCP for planning under a POMDP in environments with explicit measurement costs.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Aishwarya Krishna Allada</style></author><author><style face="normal" font="default" size="100%">Yuanxin Wang</style></author><author><style face="normal" font="default" size="100%">Veni Jindal</style></author><author><style face="normal" font="default" size="100%">Morteza Babee</style></author><author><style face="normal" font="default" size="100%">H.R. Tizhoosh</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Analysis of Language Embeddings for Classification of Unstructured Pathology Reports</style></title><secondary-title><style face="normal" font="default" size="100%">International Conference of the IEEE Engineering in Medicine and Biology Society</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Deep Neural Networks</style></keyword><keyword><style  face="normal" font="default" size="100%">digital pathology</style></keyword><keyword><style  face="normal" font="default" size="100%">natural language processing</style></keyword><keyword><style  face="normal" font="default" size="100%">proj-digipath</style></keyword><keyword><style  face="normal" font="default" size="100%">year-in-review-2021</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2021</style></year><pub-dates><date><style  face="normal" font="default" size="100%">November</style></date></pub-dates></dates><publisher><style face="normal" font="default" size="100%">IEEE</style></publisher><pages><style face="normal" font="default" size="100%">4</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">A pathology report is one of the most significant medical documents providing interpretive insights into the visual appearance of the patient's biopsy sample. In digital pathology, high-resolution images of tissue samples are stored along with pathology reports. Despite the valuable information that pathology reports hold, they are not used in any systematic manner to promote computational pathology. In this work, we focus on analyzing the reports, which are generally unstructured documents written in English with sophisticated and highly specialized medical terminology. We provide a comparative analysis of various embedding models like BioBERT, Clinical BioBERT, BioMed-RoBERTa and Term Frequency-Inverse Document Frequency (TF-IDF), a traditional NLP technique, as well as the combination of embeddings from pre-trained models with TF-IDF. Our results demonstrate the effectiveness of various word embedding techniques for pathology reports.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Milad Sikaroudi</style></author><author><style face="normal" font="default" size="100%">Benyamin Ghojogh</style></author><author><style face="normal" font="default" size="100%">Fakhri Karray</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author><author><style face="normal" font="default" size="100%">H. R. Tizhoosh</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Batch-Incremental Triplet Sampling for Training Triplet Networks Using Bayesian Updating Theorem</style></title><secondary-title><style face="normal" font="default" size="100%">25th International Conference on Pattern Recognition (ICPR)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">digital-pathology</style></keyword><keyword><style  face="normal" font="default" size="100%">manifold-learning</style></keyword><keyword><style  face="normal" font="default" size="100%">proj-digipath</style></keyword><keyword><style  face="normal" font="default" size="100%">triplet-mining</style></keyword><keyword><style  face="normal" font="default" size="100%">year-in-review-2021</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2021</style></year><pub-dates><date><style  face="normal" font="default" size="100%">January</style></date></pub-dates></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://ieeexplore.ieee.org/document/9412478</style></url></web-urls></urls><publisher><style face="normal" font="default" size="100%">IEEE</style></publisher><pub-location><style face="normal" font="default" size="100%">Milan, Italy (virtual)</style></pub-location><pages><style face="normal" font="default" size="100%">7</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Variants of Triplet networks are robust entities for learning a discriminative embedding subspace. There exist different triplet mining approaches for selecting the most suitable training triplets. Some of these mining methods rely on the extreme distances between instances, and some others make use of sampling. However, sampling from stochastic distributions of data rather than sampling merely from the existing embedding instances can provide more discriminative information. In this work, we sample triplets from distributions of data rather than from existing instances. We consider a multivariate normal distribution for the embedding of each class. Using Bayesian updating and conjugate priors, we update the distributions of classes dynamically by receiving the new mini-batches of training data. The proposed triplet mining with Bayesian updating can be used with any triplet-based loss function, e.g., triplet-loss or Neighborhood Component Analysis (NCA) loss. Accordingly, Our triplet mining approaches are called Bayesian Updating Triplet (BUT) and Bayesian Updating NCA (BUNCA), depending on which loss function is being used. Experimental results on two public datasets, namely MNIST and histopathology colorectal cancer (CRC), substantiate the effectiveness of the proposed triplet mining method.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>34</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Benyamin Ghojogh</style></author><author><style face="normal" font="default" size="100%">Ali Ghodsi</style></author><author><style face="normal" font="default" size="100%">Fakhri Karray</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Generative Locally Linear Embedding</style></title></titles><keywords><keyword><style  face="normal" font="default" size="100%">embeddings</style></keyword><keyword><style  face="normal" font="default" size="100%">manifold-learning</style></keyword><keyword><style  face="normal" font="default" size="100%">representation-learning</style></keyword><keyword><style  face="normal" font="default" size="100%">year-in-review-2021</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2021</style></year><pub-dates><date><style  face="normal" font="default" size="100%">October</style></date></pub-dates></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://arxiv.org/abs/2104.01525</style></url></web-urls></urls><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Locally Linear Embedding (LLE) is a nonlin- ear spectral dimensionality reduction and manifold learning method. It has two main steps which are linear reconstruc- tion and linear embedding of points in the input space and embedding space, respectively. In this work, we propose two novel generative versions of LLE, named Generative LLE (GLLE), whose linear reconstruction steps are stochastic rather than deterministic. GLLE assumes that every data point is caused by its linear reconstruction weights as latent factors. The proposed GLLE algorithms can generate various LLE embeddings stochastically while all the generated embeddings relate to the original LLE embedding. We propose two versions for stochastic linear reconstruction, one using expectation maximization and another with direct sampling from a derived distribution by optimization. The proposed GLLE methods are closely related to and inspired by variational inference, factor analysis, and probabilistic principal component analysis. Our simulations show that the proposed GLLE methods work effectively in unfolding and generating submanifolds of data.</style></abstract><work-type><style face="normal" font="default" size="100%">Preprint</style></work-type></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Ghodsi, Ali</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Generative locally linear embedding: A module for manifold unfolding and visualization</style></title><secondary-title><style face="normal" font="default" size="100%">Software Impacts</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2021</style></year></dates><number><style face="normal" font="default" size="100%">100105</style></number><publisher><style face="normal" font="default" size="100%">Elsevier</style></publisher><volume><style face="normal" font="default" size="100%">9</style></volume><pages><style face="normal" font="default" size="100%">3</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Data often have nonlinear patterns in machine learning. One can unfold the nonlinear manifold of a dataset for low-dimensional visualization and feature extraction. Locally Linear Embedding (LLE) is a nonlinear spectral method for dimensionality reduction and manifold unfolding. It embeds data using the same linear reconstruction weights as in the input space. In this paper, we propose an open source module which not only implements LLE, but also includes implementations of two generative LLE algorithms whose linear reconstruction phases are stochastic. Using this module, one can generate as many manifold unfoldings as desired for data visualization or feature extraction.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Akgun, Sami Alperen</style></author><author><style face="normal" font="default" size="100%">Ghafurian, Moojan</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Dautenhahn, Kerstin</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Integrating Affective Expressions into the Search and Rescue Context in order to Improve Non-Verbal Human-Robot Interaction</style></title><secondary-title><style face="normal" font="default" size="100%">Workshop on Exploring Applications for Autonomous Non-Verbal Human-Robot Interactions (HRI)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">affective communication</style></keyword><keyword><style  face="normal" font="default" size="100%">emotion-modeling</style></keyword><keyword><style  face="normal" font="default" size="100%">human-robot-interaction</style></keyword><keyword><style  face="normal" font="default" size="100%">search-and-rescue</style></keyword><keyword><style  face="normal" font="default" size="100%">year-in-review-2021</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2021</style></year><pub-dates><date><style  face="normal" font="default" size="100%">March</style></date></pub-dates></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://sites.google.com/view/non-verbal-hri-2021/home</style></url></web-urls></urls><publisher><style face="normal" font="default" size="100%">ACM</style></publisher><pub-location><style face="normal" font="default" size="100%">Virtual</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Milad Sikaroudi</style></author><author><style face="normal" font="default" size="100%">Benyamin Ghojogh</style></author><author><style face="normal" font="default" size="100%">Fakhri Karray</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author><author><style face="normal" font="default" size="100%">H. R. Tizhoosh</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Magnification Generalization for Histopathology Image Embedding</style></title><secondary-title><style face="normal" font="default" size="100%">IEEE International Symposium on Biomedical Imaging (ISBI)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">proj-digipath</style></keyword><keyword><style  face="normal" font="default" size="100%">year-in-review-2021</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2021</style></year><pub-dates><date><style  face="normal" font="default" size="100%">April</style></date></pub-dates></dates><pages><style face="normal" font="default" size="100%">5</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Histopathology image embedding is an active research area in computer vision. Most of the embedding models exclusively concentrate on a specific magnification level. However, a useful task in histopathology embedding is to train an embedding space regardless of the magnification level. Two main approaches for tackling this goal are domain adaptation and domain generalization, where the target magnification levels may or may not be introduced to the model in training, respectively. Although magnification adaptation is a well-studied topic in the literature, this paper, to the best of our knowledge, is the first work on magnification generalization for histopathology image embedding. We use an episodic trainable domain generalization technique for magnification generalization, namely Model Agnostic Learning of Semantic Features (MASF), which works based on the Model Agnostic Meta-Learning (MAML) concept. Our experimental results on a breast cancer histopathology dataset with four different magnification levels show the proposed method's effectiveness for magnification generalization.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ganapathi Subramanian, Sriram</style></author><author><style face="normal" font="default" size="100%">Taylor, Matthew</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Poupart, Pascal</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Partially Observable Mean Field Reinforcement Learning</style></title><secondary-title><style face="normal" font="default" size="100%">Proceedings of the 20th International Conference on Autonomous Agents and MultiAgent Systems (AAMAS)</style></secondary-title><tertiary-title><style face="normal" font="default" size="100%">AAMAS '21</style></tertiary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">mean field theory</style></keyword><keyword><style  face="normal" font="default" size="100%">Multi-Agent Reinforcement Learning</style></keyword><keyword><style  face="normal" font="default" size="100%">partial observation</style></keyword><keyword><style  face="normal" font="default" size="100%">reinforcement learning</style></keyword><keyword><style  face="normal" font="default" size="100%">showcase</style></keyword><keyword><style  face="normal" font="default" size="100%">year-in-review-2021</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2021</style></year><pub-dates><date><style  face="normal" font="default" size="100%">3–7 May</style></date></pub-dates></dates><publisher><style face="normal" font="default" size="100%">International Foundation for Autonomous Agents and Multiagent Systems</style></publisher><pub-location><style face="normal" font="default" size="100%">London, United Kingdom</style></pub-location><pages><style face="normal" font="default" size="100%">537-545</style></pages><isbn><style face="normal" font="default" size="100%">9781450383073</style></isbn><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Traditional multi-agent reinforcement learning algorithms are not scalable to environments with more than a few agents, since these algorithms are exponential in the number of agents. Recent research has introduced successful methods to scale multi-agent reinforcement learning algorithms to many agent scenarios using mean field theory. Previous work in this field assumes that an agent has access to exact cumulative metrics regarding the mean field behaviour of the system, which it can then use to take its actions. In this paper, we relax this assumption and maintain a distribution to model the uncertainty regarding the mean field of the system. We consider two different settings for this problem. In the first setting, only agents in a fixed neighbourhood are visible, while in the second setting, the visibility of agents is determined at random based on distances. For each of these settings, we introduce a Q-learning based algorithm that can learn effectively. We prove that this Q-learning estimate stays very close to the Nash Q-value (under a common set of assumptions) for the first setting. We also empirically show our algorithms outperform multiple baselines in three different games in the MAgents framework, which supports large environments with many agents learning simultaneously to achieve possibly distinct goals.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Mark Crowley</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Prediction and Causality: How Can Machine Learning be Used for COVID-19?</style></title><secondary-title><style face="normal" font="default" size="100%">&quot;What Needs to be done in order to Curb the Spread of Covid-19: Exposure Notification, Legal Considerations, and Statistical Modeling&quot;, a Conference on Data and Privacy during a Global Pandemic</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Causality</style></keyword><keyword><style  face="normal" font="default" size="100%">covid-19</style></keyword><keyword><style  face="normal" font="default" size="100%">epidemiology</style></keyword><keyword><style  face="normal" font="default" size="100%">graphical-models</style></keyword><keyword><style  face="normal" font="default" size="100%">Machine-Learning</style></keyword><keyword><style  face="normal" font="default" size="100%">medical</style></keyword><keyword><style  face="normal" font="default" size="100%">reinforcement-learning</style></keyword><keyword><style  face="normal" font="default" size="100%">year-in-review-2021</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2021</style></year><pub-dates><date><style  face="normal" font="default" size="100%">July</style></date></pub-dates></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://uwaterloo.ca/master-of-public-service/events/data-and-privacy-during-global-pandemic-conference</style></url></web-urls></urls><publisher><style face="normal" font="default" size="100%">Master of Public Service (MPS) Policy and Data Lab, University of Waterloo</style></publisher><pub-location><style face="normal" font="default" size="100%">Waterloo, Canada</style></pub-location><pages><style face="normal" font="default" size="100%">6</style></pages><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Quantile&amp;ndash;Quantile Embedding for Distribution Transformation and Manifold Embedding with Ability to Choose the Embedding Distribution</style></title><secondary-title><style face="normal" font="default" size="100%">Machine Learning with Applications (MLWA)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">dimensionality-reduction</style></keyword><keyword><style  face="normal" font="default" size="100%">embeddings</style></keyword><keyword><style  face="normal" font="default" size="100%">image-processing</style></keyword><keyword><style  face="normal" font="default" size="100%">image-understanding</style></keyword><keyword><style  face="normal" font="default" size="100%">manifold-learning</style></keyword><keyword><style  face="normal" font="default" size="100%">proj-digipath</style></keyword><keyword><style  face="normal" font="default" size="100%">showcase</style></keyword><keyword><style  face="normal" font="default" size="100%">year-in-review-2021</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2021</style></year></dates><volume><style face="normal" font="default" size="100%">6</style></volume><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">We propose a new embedding method, named Quantile-Quantile Embedding (QQE), for distribution transformation and manifold embedding with the ability to choose the embedding distribution. QQE, which uses the concept of quantile-quantile plot from visual statistical tests, can transform the distribution of data to any theoretical desired distribution or empirical reference sample. Moreover, QQE gives the user a choice of embedding distribution in embedding the manifold of data into the low dimensional embedding space. It can also be used for modifying the embedding distribution of other dimensionality reduction methods, such as PCA, t-SNE, and deep metric learning, for better representation or visualization of data. We propose QQE in both unsupervised and supervised forms. QQE can also transform a distribution to either an exact reference distribution or its shape. We show that QQE allows for better discrimination of classes in some cases. Our experiments on different synthetic and image datasets show the effectiveness of the proposed embedding method.</style></abstract><notes><style face="normal" font="default" size="100%">1</style></notes></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Moojan Ghafurian</style></author><author><style face="normal" font="default" size="100%">Sami Alperen Akgun</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author><author><style face="normal" font="default" size="100%">Kerstin Dautenhahn</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Recognition of a Robot's Affective Expressions under Conditions with Limited Visibility</style></title><secondary-title><style face="normal" font="default" size="100%">18th International Conference promoted by the IFIP Technical Committee 13 on Human–Computer Interaction (INTERACT 2021)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">human-robot-interaction</style></keyword><keyword><style  face="normal" font="default" size="100%">showcase</style></keyword><keyword><style  face="normal" font="default" size="100%">year-in-review-2021</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2021</style></year><pub-dates><date><style  face="normal" font="default" size="100%">September</style></date></pub-dates></dates><pub-location><style face="normal" font="default" size="100%">Bari, Italy</style></pub-location><pages><style face="normal" font="default" size="100%">22</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">The capability of showing affective expressions is important for the design of social robots in many contexts, especially where the robot is designed to communicate with humans. It is reasonable to expect that, similar to all other interaction modalities, communicating with affective expressions is not without limitations. In this paper, we present two online video studies (N=72 and N=50) and investigate if/how much the recognition of affective displays of a zoomorphic robot is affected under situations with different levels of visibility. Five affective expressions combined with five visibility effects were studied. The intensity of the effects was more pronounced in the second experiment. While visual constraints affected recognition of expressions, our results showed that affective displays of the robot conveyed through its head and body gestures can be robust and recognition rates can still be high even under severe constraints. This study supported the effectiveness of using affective displays as a complementary communication modality in human-robot interaction in situations with low visibility.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Anomaly Detection and Prototype Selection Using Polyhedron Curvature</style></title><secondary-title><style face="normal" font="default" size="100%">Canadian Conference on Artificial Intelligence</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">anomaly detection</style></keyword><keyword><style  face="normal" font="default" size="100%">representation</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2020</style></year></dates><publisher><style face="normal" font="default" size="100%">Springer</style></publisher><pub-location><style face="normal" font="default" size="100%">Ottawa, Canada</style></pub-location><pages><style face="normal" font="default" size="100%">10</style></pages><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Backprojection for Training Feedforward Neural Networks in the Input and Feature Spaces</style></title><secondary-title><style face="normal" font="default" size="100%">International Conference on Image Analysis and Recognition</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Deep Neural Networks</style></keyword><keyword><style  face="normal" font="default" size="100%">neural networks</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2020</style></year></dates><publisher><style face="normal" font="default" size="100%">Springer</style></publisher><pub-location><style face="normal" font="default" size="100%">Póvoa de Varzim, Portugal (virtual)</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Sushrut Bhalla</style></author><author><style face="normal" font="default" size="100%">Sriram Ganapathi Subramanian</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Deep Multi Agent Reinforcement Learning for Autonomous Driving</style></title><secondary-title><style face="normal" font="default" size="100%">Canadian Conference on Artificial Intelligence</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2020</style></year></dates><publisher><style face="normal" font="default" size="100%">Spring, Lecture Notes in Artificial Intelligence</style></publisher><volume><style face="normal" font="default" size="100%">LNAI 12109</style></volume><pages><style face="normal" font="default" size="100%">17</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	Deep Learning and back-propagation have been successfully used to perform centralized training with communication protocols among multiple agents in a cooperative environment. In this work, we present techniques for centralized training of Multi-Agent Deep Reinforcement Learning (MARL) using the model-free Deep Q-Network (DQN) as the baseline model and communication between agents. We present two novel, scalable and centralized MARL training techniques (MA-MeSN, MA- BoN), which achieve faster convergence and higher cumulative reward in complex domains like autonomous driving simulators. Subsequently, we present a memory module to achieve a decentralized cooperative pol- icy for execution and thus addressing the challenges of noise and com- munication bottlenecks in real-time communication channels. This work theoretically and empirically compares our centralized and decentralized training algorithms to current research in the field of MARL. We also present and release a new OpenAI-Gym environment which can be used for multi-agent research as it simulates multiple autonomous cars driving on a highway. We compare the performance of our centralized algorithms to existing state-of-the-art algorithms, DIAL and IMS based on cumu- lative reward achieved per episode. MA-MeSN and MA-BoN achieve a cumulative reward of at least 263% of the reward achieved by the DIAL and IMS. We also present an ablation study of the scalability of MA- BoN showing that it has a linear time and space complexity compared to quadratic for DIAL in the number of agents.
&lt;/p&gt;
</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>10</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Basiri, Mohammad Hossein</style></author><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Azad, Nasser L</style></author><author><style face="normal" font="default" size="100%">Fischmeister, Sebastian</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Distributed Nonlinear Model Predictive Control and Metric Learning for Heterogeneous Vehicle Platooning with Cut-in/Cut-out Maneuvers</style></title><secondary-title><style face="normal" font="default" size="100%">59th IEEE Conference on Decision and Control (CDC) 2020</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2020</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">arXiv preprint arXiv:2004.00417</style></url></web-urls></urls><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Sikaroudi, Milad</style></author><author><style face="normal" font="default" size="100%">Shafiei, Sobhan</style></author><author><style face="normal" font="default" size="100%">Tizhoosh, H.R.</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Fisher Discriminant Triplet and Contrastive Losses for Training Siamese Networks</style></title><secondary-title><style face="normal" font="default" size="100%">IEEE International Joint Conference on Neural Networks (IJCNN)</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2020</style></year></dates><publisher><style face="normal" font="default" size="100%">IEEE</style></publisher><pub-location><style face="normal" font="default" size="100%">Glasgow, UK</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Generalized Subspace Learning by Roweis Discriminant Analysis</style></title><secondary-title><style face="normal" font="default" size="100%">International Conference on Image Analysis and Recognition</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Data Reduction</style></keyword><keyword><style  face="normal" font="default" size="100%">Manifold learning</style></keyword><keyword><style  face="normal" font="default" size="100%">Numerosity Reduction</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2020</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://arxiv.org/abs/1910.05437</style></url></web-urls></urls><publisher><style face="normal" font="default" size="100%">Springer</style></publisher><pub-location><style face="normal" font="default" size="100%">Póvoa de Varzim, Portugal (virtual)</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">We present a new method which generalizes subspace learning based on eigenvalue and generalized eigenvalue problems. This method, Roweis Discriminant Analysis (RDA), is named after Sam Roweis to whom the field of subspace learning owes significantly. RDA is a family of infinite number of algorithms where Principal Component Analysis (PCA), Supervised PCA (SPCA), and Fisher Discriminant Analysis (FDA) are special cases. One of the extreme special cases, which we name Double Supervised Discriminant Analysis (DSDA), uses the labels twice; it is novel and has not appeared elsewhere. We propose a dual for RDA for some special cases. We also propose kernel RDA, generalizing kernel PCA, kernel SPCA, and kernel FDA, using both dual RDA and representation theory. Our theoretical analysis explains previously known facts such as why SPCA can use regression but FDA cannot, why PCA and SPCA have duals but FDA does not, why kernel PCA and kernel SPCA use kernel trick but kernel FDA does not, and why PCA is the best linear method for reconstruction. Roweisfaces and kernel Roweisfaces are also proposed generalizing eigenfaces, Fisherfaces, supervised eigenfaces, and their kernel variants. We also report experiments showing the effectiveness of RDA and kernel RDA on some benchmark datasets.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>10</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ma, Haoran</style></author><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Samad, Maria N</style></author><author><style face="normal" font="default" size="100%">Zheng, Dongyu</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Isolation Mondrian Forest for Batch and Online Anomaly Detection</style></title><secondary-title><style face="normal" font="default" size="100%">IEEE International Conference on Systems, Man, and Cybernetics (SMC) 2020</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2020</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">arXiv preprint arXiv:2003.03692</style></url></web-urls></urls><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	We propose a new method, named isolation Mon- drian forest (iMondrian forest), for batch and online anomaly detection. The proposed method is a novel hybrid of isolation forest and Mondrian forest which are existing methods for batch anomaly detection and online random forest, respectively. iMondrian forest takes the idea of isolation, using the depth of a node in a tree, and implements it in the Mondrian forest structure. The result is a new data structure which can accept streaming data in an online manner while being used for anomaly detection. Our experiments show that iMondrian forest mostly performs better than isolation forest in batch settings and has better or comparable performance against other batch and online anomaly detection methods.
&lt;/p&gt;
</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Sriram Ganapathi Subramanian</style></author><author><style face="normal" font="default" size="100%">Pascal Poupart</style></author><author><style face="normal" font="default" size="100%">Matthew E. Taylor</style></author><author><style face="normal" font="default" size="100%">Nidhi Hegde</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Multi Type Mean Field Reinforcement Learning</style></title><secondary-title><style face="normal" font="default" size="100%">AAMAS 2020</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">mean-field-reinforcement-learning</style></keyword><keyword><style  face="normal" font="default" size="100%">proj-meanfield</style></keyword><keyword><style  face="normal" font="default" size="100%">reinforcement-learning</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2020</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://www.semanticscholar.org/paper/Multi-Type-Mean-Field-Reinforcement-Learning-Subramanian-Poupart/9a32f1527193f0cc753acc0606886a28ede95c8e</style></url></web-urls></urls><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Mean field theory provides an effective way of scaling multiagent reinforcement learning algorithms to environments with many agents that can be abstracted by a virtual mean agent. In this paper, we extend mean field multiagent algorithms to multiple types. The types enable the relaxation of a core assumption in mean field games, which is that all agents in the environment are playing almost similar strategies and have the same goal. We conduct experiments on three different testbeds for the field of many agent reinforcement learning, based on the standard MAgents framework. We consider two different kinds of mean field games: a) Games where agents belong to predefined types that are known a priori and b) Games where the type of each agent is unknown and therefore must be learned based on observations. We introduce new algorithms for each type of game and demonstrate their superior performance over state of the art algorithms that assume that all agents belong to the same type and other baseline algorithms in the MAgent framework</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>25</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Du, Zhiyuan</style></author><author><style face="normal" font="default" size="100%">Lull, Joseph</style></author><author><style face="normal" font="default" size="100%">Malhan, Rajesh</style></author><author><style face="normal" font="default" size="100%">Ganapathi Subramanian, Sriram</style></author><author><style face="normal" font="default" size="100%">Bhalla, Sushrut</style></author><author><style face="normal" font="default" size="100%">Sambee, Jaspreet</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Fischmeister, Sebastian</style></author><author><style face="normal" font="default" size="100%">Shin, Donghyun</style></author><author><style face="normal" font="default" size="100%">Melek, William</style></author><author><style face="normal" font="default" size="100%">Fidan, Baris</style></author><author><style face="normal" font="default" size="100%">Woo, Ami</style></author><author><style face="normal" font="default" size="100%">Sahoo, Bismaya</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Multi-Level Collaborative Control System With Dual Neural Network Planning For Autonomous Vehicle Control In A Noisy Environment</style></title></titles><dates><year><style  face="normal" font="default" size="100%">2020</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://patents.google.com/patent/US11131992B2/en</style></url></web-urls></urls><edition><style face="normal" font="default" size="100%">United States of America</style></edition><volume><style face="normal" font="default" size="100%">US11131992B2</style></volume><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	[[{&quot;fid&quot;:&quot;8001&quot;,&quot;view_mode&quot;:&quot;default&quot;,&quot;type&quot;:&quot;media&quot;,&quot;attributes&quot;:{&quot;height&quot;:&quot;1346&quot;,&quot;width&quot;:&quot;1875&quot;,&quot;style&quot;:&quot;width: 600px; height: 431px;&quot;,&quot;alt&quot;:&quot;Design schematic for patent&quot;,&quot;title&quot;:&quot;Design schematic for patent&quot;,&quot;class&quot;:&quot;media-element file-default&quot;}}]]
&lt;/p&gt;

&lt;p&gt;
	&amp;nbsp;
&lt;/p&gt;

&lt;p&gt;
	A RLP system for a host vehicle includes a memory and levels. The memory stores a RLP algorithm, which is a multi-agent collaborative DQN with PER algorithm. A first level includes a data processing module that provides sensor data, object location data, and state information of the host vehicle and other vehicles. A second level includes a coordinate location module that, based on the sensor data, the object location data, the state information, and a refined policy provided by the third level, generates an updated policy and a set of future coordinate locations implemented via the first level. A third level includes evaluation and target neural networks and a processor that executes instructions of the RLP algorithm for collaborative action planning between the host and other vehicles based on outputs of the evaluation and target networks and to generate the refined policy based on reward values associated with events.
&lt;/p&gt;
</style></abstract><issue><style face="normal" font="default" size="100%">US Patent Office</style></issue><section><style face="normal" font="default" size="100%">US</style></section></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Milad Sikaroudi</style></author><author><style face="normal" font="default" size="100%">Benyamin Ghojogh</style></author><author><style face="normal" font="default" size="100%">Amir Safarpoor</style></author><author><style face="normal" font="default" size="100%">Fakhri Karray</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author><author><style face="normal" font="default" size="100%">H. R. Tizhoosh</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Offline versus Online Triplet Mining based on Extreme Distances of Histopathology Patches</style></title><secondary-title><style face="normal" font="default" size="100%">International Conference on Intelligent Systems and Computer Vision (ISCV 2020) </style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">digital pathology</style></keyword><keyword><style  face="normal" font="default" size="100%">machine learning</style></keyword><keyword><style  face="normal" font="default" size="100%">triplet mining</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2020</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://arxiv.org/abs/2007.02200</style></url></web-urls></urls><publisher><style face="normal" font="default" size="100%">IEEE</style></publisher><pub-location><style face="normal" font="default" size="100%">Fez-Morrocco (virtual)</style></pub-location><pages><style face="normal" font="default" size="100%">8</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">We analyze the effect of offline and online triplet mining for colorectal cancer (CRC) histopathology dataset containing 100,000 patches. We consider the extreme, i.e., farthest and nearest patches with respect to a given anchor, both in online and offline mining. While many works focus solely on how to select the triplets online (batch-wise), we also study the effect of extreme distances and neighbor patches before training in an offline fashion. We analyze the impacts of extreme cases for offline versus online mining, including easy positive, batch semi-hard, and batch hard triplet mining as well as the neighborhood component analysis loss, its proxy version, and distance weighted sampling. We also investigate online approaches based on extreme distance and comprehensively compare the performance of offline and online mining based on the data patterns and explain offline mining as a tractable generalization of the online mining with large mini-batch size. As well, we discuss the relations of different colorectal tissue types in terms of extreme distances. We found that offline mining can generate a better statistical representation of the population by working on the whole dataset.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Coogan, Sean CP</style></author><author><style face="normal" font="default" size="100%">Jain, Piyush</style></author><author><style face="normal" font="default" size="100%">Ganapathi Subramanian, Sriram</style></author><author><style face="normal" font="default" size="100%">Taylor, Steve</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Flannigan, Mike D</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">A review of machine learning applications in wildfire science and management.</style></title><secondary-title><style face="normal" font="default" size="100%">Environmental Reviews</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2020</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://www.nrcresearchpress.com/doi/10.1139/er-2020-0019#.X1jbKtNKhTY</style></url></web-urls></urls><volume><style face="normal" font="default" size="100%">28</style></volume><pages><style face="normal" font="default" size="100%">73</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Artiﬁcial intelligence has been applied in wildﬁre science and management since the 1990s, with early applications including neural networks and expert systems. Since then the ﬁeld has rapidly progressed congruently with the wide adoption of machine learning (ML) methods in the environmental sciences. Here, we present a scoping review of ML applications in wildﬁre science and management. Our overall objective is to improve awareness of ML methods among wildﬁre researchers and managers, as well as illustrate the diverse and challenging range of problems in wildﬁre science available to ML data scientists. To that end, we ﬁrst present an overview of popular ML approaches used in wildﬁre science to date, and then review the use of ML in wildﬁre science as broadly categorized into six problem domains, including: 1) fuels characterization, ﬁre detection, and mapping; 2) ﬁre weather and climate change; 3) ﬁre occurrence, susceptibility, and risk; 4) ﬁre behavior prediction; 5) ﬁre eﬀects; and 6) ﬁre management. Furthermore, we discuss the advantages and limitations of various ML approaches relating to data size, computational requirements, generalizability, and interpretability, as well as identify opportunities for future advances in the science and management of wildﬁres within a data science context. In total, we identfied 300 relevant publications up to the end of 2019, where the most frequently used ML methods across problem domains included random forests, MaxEnt, artiﬁcial neural networks, decision trees, support vector machines, and genetic algorithms. As such, there exists opportunities to apply more current ML methods — including deep learning and agent based learning — in the wildﬁre sciences, especially in instances involving very large multivariate datasets. We must recognize, however, that despite the ability of ML methods to learn on their own, expertise in wildﬁre science is necessary to ensure realistic modelling of ﬁre processes across multiple scales, while the complexity of some ML methods, such as deep learning, requires a dedicated and sophisticated knowledge of their application. Finally, we stress that the wildﬁre research and management communities play an active role in providing relevant, high quality, and freely available wildﬁre data for use by practitioners of ML methods.</style></abstract><issue><style face="normal" font="default" size="100%">3</style></issue></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Sikaroudi, Milad</style></author><author><style face="normal" font="default" size="100%">Safarpoor, Amir</style></author><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Shafiei, Sobhan</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Tizhoosh, HR</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Supervision and Source Domain Impact on Representation Learning: A Histopathology Case Study</style></title><secondary-title><style face="normal" font="default" size="100%">International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC'20)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">digipath</style></keyword><keyword><style  face="normal" font="default" size="100%">digital pathology</style></keyword><keyword><style  face="normal" font="default" size="100%">histopathology</style></keyword><keyword><style  face="normal" font="default" size="100%">medical</style></keyword><keyword><style  face="normal" font="default" size="100%">representation learning</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2020</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://embs.papercept.net/conferences/scripts/rtf/EMBC20_ContentListWeb_1.html#moat2-15_02</style></url></web-urls></urls><publisher><style face="normal" font="default" size="100%">IEEE Engineering in Medicine and Biology Society</style></publisher><pub-location><style face="normal" font="default" size="100%">42nd Annual International Conference of the IEEE Engineering in Medicine and Biology Society (EMBC'20)</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	As many algorithms depend on a suitable representation of data, learning unique features is considered a crucial task. Although supervised techniques using deep neural networks have boosted the performance of representation learning, the need for a large set of labeled data limits the application of such methods. As an example, high-quality delineations of regions of interest in the field of pathology is a tedious and time-consuming task due to the large image dimensions. In this work, we explored the performance of a deep neural network and triplet loss in the area of representation learning. We investigated the notion of similarity and dissimilarity in pathology whole-slide images and compared different setups from unsupervised and semi-supervised to supervised learning in our experiments. Additionally, different approaches were tested, applying few-shot learning on two publicly available pathology image datasets. We achieved high accuracy and generalization when the learned representations were applied to two different pathology datasets.&lt;br&gt;&amp;nbsp;
&lt;/p&gt;

&lt;p&gt;
	&amp;nbsp;
&lt;/p&gt;
</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Theoretical Insights into the Use of Structural Similarity Index In Generative Models and Inferential Autoencoders</style></title><secondary-title><style face="normal" font="default" size="100%">International Conference on Image Analysis and Recognition</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">image understanding</style></keyword><keyword><style  face="normal" font="default" size="100%">Manifold learning</style></keyword><keyword><style  face="normal" font="default" size="100%">representation</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2020</style></year></dates><publisher><style face="normal" font="default" size="100%">Springer</style></publisher><pub-location><style face="normal" font="default" size="100%">Póvoa de Varzim, Portugal (virtual)</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Akgun, Sami Alperen</style></author><author><style face="normal" font="default" size="100%">Ghafurian, Moojan</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Dautenhahn, Kerstin</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Using Emotions to Complement Multi-Modal Human-Robot Interaction in Urban Search and Rescue Scenarios</style></title><secondary-title><style face="normal" font="default" size="100%">22nd International Conference on Multimodal Interaction (ICMI-2020)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">disaster-response</style></keyword><keyword><style  face="normal" font="default" size="100%">multimodel interaction</style></keyword><keyword><style  face="normal" font="default" size="100%">robotics</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2020</style></year><pub-dates><date><style  face="normal" font="default" size="100%">October</style></date></pub-dates></dates><pub-location><style face="normal" font="default" size="100%">Utrecht, the Netherlands</style></pub-location><pages><style face="normal" font="default" size="100%">9</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">In this study, we investigated if there is consensus in using emotions as an additional communication channel to support multi-modal Human-Robot Interactions (HRI) in urban search and rescue scenarios, in which effectiveness of interactions is the key to success. The natural mappings between specific situations that might happen during missions and the proper emotions for the robots to show in that situation were obtained through a Mturk study with 78 participants. Suggested mappings for a set of 10 common situations were provided, which can be used to improve HRI in urban search and rescue situations.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Sikaroudi, Milad</style></author><author><style face="normal" font="default" size="100%">Tizhoosh, H.R.</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Weighted Fisher Discriminant Analysisin the Input and Feature Spaces</style></title><secondary-title><style face="normal" font="default" size="100%">International Conference on Image Analysis and Recognition (ICIAR)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Manifold learning</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2020</style></year></dates><publisher><style face="normal" font="default" size="100%">Springer</style></publisher><pub-location><style face="normal" font="default" size="100%">Póvoa de Varzim, Portugal (virtual)</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>34</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Addressing the Mystery of Population Decline of the Rose-Crested Blue Pipit in a Nature Preserve using Data Visualization</style></title><secondary-title><style face="normal" font="default" size="100%">ArXiv Preprint. ArXiv: 1903.06671</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Hadi NekoeiQachkanloo</style></author><author><style face="normal" font="default" size="100%">Benyamin Ghojogh</style></author><author><style face="normal" font="default" size="100%">Ali Saheb Pasand</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Artificial Counselor System For Stock Investment</style></title><secondary-title><style face="normal" font="default" size="100%">Innovative Applications of Artificial Intelligence (IAAI-19)</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year><pub-dates><date><style  face="normal" font="default" size="100%">27 January </style></date></pub-dates></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://aaai.org/ojs/index.php/AAAI/article/view/5016</style></url></web-urls></urls><publisher><style face="normal" font="default" size="100%">AAAI Press.</style></publisher><pub-location><style face="normal" font="default" size="100%">IAAI-19 Conference, Honolulu, Hawaii, USA, 2019.</style></pub-location><pages><style face="normal" font="default" size="100%">8</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">This paper proposes a novel trading system which plays the role of an artificial counselor for stock investment. In this paper, the stock future prices (technical features) are predicted using Support Vector Regression from the previous stock prices, average directional index, and parabolic stop and reverse index. Thereafter, the predicted prices are used to recommend which portions of the budget an investor should invest in different existing stocks to have an optimum expected profit considering their level of risk tolerance. Two different methods are used for suggesting best portions, which are Markowitz portfolio theory and fuzzy investment counselor. The first approach is an optimization-based method which considers merely technical features, while the second approach is based on Fuzzy Logic taking into account both technical and fundamental features of the stock market. The utilized dataset in this work is the New York Stock Exchange (NYSE). The experiments are performed on price prediction as well as optimum investment in various well-known stocks for a period of 30 days, and the results show the effectiveness of the proposed system.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Sushrut Bhalla</style></author><author><style face="normal" font="default" size="100%">Matthew Yao</style></author><author><style face="normal" font="default" size="100%">Jean-Pierre Hickey</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Compact Representation of a Multi-dimensional Combustion Manifold Using Deep Neural Networks</style></title><secondary-title><style face="normal" font="default" size="100%">European Conference on Machine Learning</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><pub-location><style face="normal" font="default" size="100%">Wurzburg, Germany</style></pub-location><pages><style face="normal" font="default" size="100%">8</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	[[{&quot;fid&quot;:&quot;4045&quot;,&quot;view_mode&quot;:&quot;default&quot;,&quot;type&quot;:&quot;media&quot;,&quot;attributes&quot;:{&quot;height&quot;:&quot;465&quot;,&quot;width&quot;:&quot;538&quot;,&quot;style&quot;:&quot;float: right; width: 250px; height: 216px; border-width: 1px; border-style: solid; margin-left: 10px; margin-right: 10px;&quot;,&quot;alt&quot;:&quot;Example of Flamelet model&quot;,&quot;title&quot;:&quot;Example of Flamelet model&quot;,&quot;class&quot;:&quot;media-element file-default&quot;}}]]The computational challenges in turbulent combustion simulations stem from the physical complexities and multi-scale nature of the problem which make it intractable to compute scale-resolving simulations. For most engineering applications, the large scale separation between the flame (typically sub-millimeter scale) and the characteristic turbulent flow (typically centimeter or meter scale) &amp;nbsp;allows us to evoke simplifying assumptions--such as done for the flamelet model--to pre-compute all the chemical reactions and map them to a low-order manifold. The resulting manifold is then tabulated and looked-up at run-time. As the physical complexity of combustion simulations increases (including radiation, soot formation, pressure variations etc.) the dimensionality of the resulting manifold grows which impedes an efficient tabulation and look-up. In this paper we present a novel approach to model the multi-dimensional combustion manifold. We approximate the combustion manifold using a neural network function approximator and use it to predict the temperature and composition of the reaction. We present a novel training procedure which is developed to generate a smooth output curve for temperature over the course of a reaction. We then evaluate our work against the current approach of tabulation with linear interpolation in combustion simulations. We also provide an ablation study of our training procedure in the context of over-fitting in our model. The combustion dataset used for the modeling of combustion of H2 and O2 in this work is&lt;br&gt;released alongside this paper. See the &lt;a href=&quot;https://www.uwaterloo.ca/scholar/sites/ca.scholar/files/mcrowley/files/ecml_poster.pdf&quot;&gt;poster version&lt;/a&gt; here.
&lt;/p&gt;

&lt;p&gt;
	&amp;nbsp;
&lt;/p&gt;

&lt;p&gt;
	[[{&quot;fid&quot;:&quot;4044&quot;,&quot;view_mode&quot;:&quot;default&quot;,&quot;type&quot;:&quot;media&quot;,&quot;attributes&quot;:{&quot;style&quot;:&quot;margin-left: 10px; margin-right: 10px; width: 830px; height: 266px;&quot;,&quot;alt&quot;:&quot;Prediction Results&quot;,&quot;title&quot;:&quot;Prediction Results&quot;,&quot;class&quot;:&quot;media-element file-default &quot;}}]]
&lt;/p&gt;


</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Carrillo, Juan</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Pan, Guangyuan</style></author><author><style face="normal" font="default" size="100%">Fu, Liping</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Comparison of Deep Learning models for Determining Road Surface Condition from Roadside Camera Images and Weather Data</style></title><secondary-title><style face="normal" font="default" size="100%">The Transportation Association of Canada and Intelligent Transportation Systems Canada Joint Conference (TAC-ITS)</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><pub-location><style face="normal" font="default" size="100%">Halifax, Canada</style></pub-location><pages><style face="normal" font="default" size="100%">16</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	[[{&quot;fid&quot;:&quot;4043&quot;,&quot;view_mode&quot;:&quot;default&quot;,&quot;type&quot;:&quot;media&quot;,&quot;attributes&quot;:{&quot;style&quot;:&quot;float: right; width: 700px; height: 269px;&quot;,&quot;alt&quot;:&quot;Example Image of Levels of Snow Road Cover&quot;,&quot;title&quot;:&quot;Example Image of Levels of Snow Road Cover&quot;,&quot;class&quot;:&quot;media-element file-default &quot;}}]]Road maintenance during the Winter season is a safety critical and resource demanding operation. One of its key activities is determining road surface condition (RSC) in order to prioritize roads and allocate cleaning efforts such as plowing or salting. Two conventional approaches for determining RSC are: visual examination of roadside camera images by trained personnel and patrolling the roads to perform on-site inspections. However, with more than 500 cameras collecting images across Ontario, visual examination becomes a resource-intensive activity, difficult to scale especially during periods of snowstorms. This paper presents the results of a study focused on improving the efficiency of road maintenance operations. We use multiple Deep Learning models to automatically determine RSC from roadside camera images and weather variables, extending previous research where similar methods have been used to deal with the problem. The dataset we use was collected during the 2017-2018 Winter season from 40 stations connected to the Ontario Road Weather Information System (RWIS), it includes 14.000 labeled images and 70.000 weather measurements. We train and evaluate the performance of seven state-of-the-art models from the Computer Vision literature, including the recent DenseNet, NASNet, and MobileNet. Also, by integrating observations from weather variables, the models are able to better ascertain RSC under poor visibility conditions.
&lt;/p&gt;


</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Carrillo, J.</style></author><author><style face="normal" font="default" size="100%">Crowley, M.</style></author><author><style face="normal" font="default" size="100%">Pan, G.</style></author><author><style face="normal" font="default" size="100%">Fu, L.</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Comparison of Deep Learning models for Determining Road Surface Condition from Roadside Camera Images and Weather Data</style></title><secondary-title><style face="normal" font="default" size="100%">TAC-ITS Canada Joint Conference</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">machine learning</style></keyword><keyword><style  face="normal" font="default" size="100%">smart cities</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://tac-its.ca/conference-papers/comparison-deep-learning-models-determining-road-surface-condition-roadside-camera</style></url></web-urls></urls><pub-location><style face="normal" font="default" size="100%">Halifax, Canada</style></pub-location><pages><style face="normal" font="default" size="100%">17</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Road maintenance during the Winter season is a safety critical and resource demanding operation. One of its key activities is determining road surface condition (RSC) in order to prioritize roads and allocate cleaning efforts such as plowing or salting. Two conventional approaches for determining RSC are: visual examination of roadside camera images by trained personnel and patrolling the roads to perform on-site inspections. However, with more than 500 cameras collecting images across Ontario, visual examination becomes a resource-intensive activity, difficult to scale especially during periods of snow storms. This paper presents the preliminary results of an ongoing study focused on improving the efficiency of road maintenance operations. We use multiple Deep Learning models to automatically determine RSC from roadside camera images and weather variables, extending previous research where similar methods have been used to deal with the problem. The dataset we use was collected during the 2017-2018 Winter season from 40 stations connected to the Ontario Road Weather Information System (RWIS), it includes 14.000 labelled images and 70.000 weather measurements. In particular, we train and evaluate the performance of seven state-of-the-art models from the Computer Vision literature, including the recent DenseNet, NASNet, and MobileNet. Also, by integrating observations from weather variables, the models are able to better ascertain RSC under poor visibility conditions.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>34</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Salehkaleybar, Saber</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Distributed Voting in Beep Model</style></title><secondary-title><style face="normal" font="default" size="100%">arXiv preprint arXiv:1910.09882</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>34</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Eigenvalue and Generalized Eigenvalue Problems: Tutorial</style></title><secondary-title><style face="normal" font="default" size="100%">ArXiv Preprint arXiv:1903.11240</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>34</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Samad, Maria N</style></author><author><style face="normal" font="default" size="100%">Mashhadi, Sayema Asif</style></author><author><style face="normal" font="default" size="100%">Kapoor, Tania</style></author><author><style face="normal" font="default" size="100%">Ali, Wahab</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Feature selection and feature extraction in pattern analysis: A literature review</style></title><secondary-title><style face="normal" font="default" size="100%">arXiv preprint arXiv:1905.02845</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>34</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Fisher and kernel fisher discriminant analysis: Tutorial</style></title><secondary-title><style face="normal" font="default" size="100%">arXiv preprint arXiv:1906.09436</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>34</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Ghojogh, Aydin</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Fitting A Mixture Distribution to Data: Tutorial</style></title><secondary-title><style face="normal" font="default" size="100%">ArXiv preprint. arXiv:1901.06708</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">parameter estimation</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>34</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Hidden Markov Model: Tutorial</style></title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><publisher><style face="normal" font="default" size="100%">engrXiv</style></publisher><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Image Structure Subspace Learning Using Structural Similarity Index</style></title><secondary-title><style face="normal" font="default" size="100%">International Conference on Image Analysis and Recognition (ICIAR-19)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Data Reduction</style></keyword><keyword><style  face="normal" font="default" size="100%">Manifold learning</style></keyword><keyword><style  face="normal" font="default" size="100%">Numerosity Reduction</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><publisher><style face="normal" font="default" size="100%">Springer, Cham</style></publisher><pub-location><style face="normal" font="default" size="100%">Waterloo, Canada</style></pub-location><pages><style face="normal" font="default" size="100%">33–44</style></pages><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Benyamin Ghojogh</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Instance Ranking and Numerosity Reduction Using Matrix Decompositionand Subspace Learning</style></title><secondary-title><style face="normal" font="default" size="100%">Canadian Conference on Artificial Intelligence</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><publisher><style face="normal" font="default" size="100%">Springer’s Lecture Notes in Artificial Intelligence.</style></publisher><pub-location><style face="normal" font="default" size="100%">Kingston, ON, Canada</style></pub-location><pages><style face="normal" font="default" size="100%">12</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	One way to deal with the ever increasing amount of available data for processing is to rank data instances by usefulness and reduce the dataset size. In this work, we introduce a framework to achieve this using matrix decomposition and subspace learning. Our central contribution is a novel similarity measure for data instances that uses the basis obtained from matrix decomposition of the dataset. Using this similarity measure, we propose several related algorithms for ranking data instances and performing numerosity reduction. We then validate the effectiveness of these algorithms for data reduction on several datasets for classification, regression, and clustering tasks.
&lt;/p&gt;
</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Juan Carrillo</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Integration of Roadside Camera Images and Weather Data for monitoring Winter Road Surface Conditions</style></title><secondary-title><style face="normal" font="default" size="100%">Canadian Association of Road Safety Professionals CARSP Conference</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.carsp.ca/research/research-papers/research-papers-search/download-info/integration-of-roadside-camera-images-and-weather-data-for-monitoring-winter-road-surface-conditions/</style></url></web-urls></urls><pub-location><style face="normal" font="default" size="100%">CARSP Conference, Calgary, Alberta. </style></pub-location><pages><style face="normal" font="default" size="100%">4 (Won best paper award!)</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	&lt;strong&gt;This won the best paper award!&lt;/strong&gt;
&lt;/p&gt;

&lt;p&gt;
	&amp;nbsp;
&lt;/p&gt;

&lt;p&gt;
	Background/Context: During the Winter season, real-time monitoring of road surface conditions is critical for the safety of drivers and road maintenance operations. Previous research has evaluated the potential of image classification methods for detecting road snow coverage by processing images from roadside cameras installed in RWIS (Road Weather Information System) stations. However, it is a challenging task due to limitations such as image resolution, camera angle, and illumination. Two common approaches to improve the accuracy of image classification methods are: adding more input features to the model and increasing the number of samples in the training dataset. Additional input features can be weather variables and more sample images can be added by including other roadside cameras. Although RWIS stations are equipped with both cameras and weather measurement instruments, they are only a subset of the total number of roadside cameras installed across transportation networks, most of which do not have weather measurement instruments. Thus, improvements in use of image data could benefit from additional data sources.
&lt;/p&gt;

&lt;p&gt;
	Aims/Objectives: The first objective of this study is to complete an exploratory data analysis over three data sources in Ontario: RWIS stations, all the other MTO (Ministry of Transportation of Ontario) roadside cameras, and Environment Canada weather stations. The second objective is to determine the feasibility of integrating these three datasets into a more extensive and richer dataset with weather variables as additional features and other MTO roadside cameras as additional sources of images.
&lt;/p&gt;

&lt;p&gt;
	Methods/Targets: First, we quantify the advantage of adding other MTO roadside cameras using spatial statistics, the number of monitored roads, and the coverage of ecoregions with different climate regimes. We then analyze experimental variograms from the literature and determine the feasibility of using Environment Canada stations and RWIS stations to interpolate weather variables for all the other MTO roadside cameras without weather instruments.
&lt;/p&gt;

&lt;p&gt;
	Results/Activities: By adding all other MTO cameras as image data sources, the total number of cameras in the dataset increases from 139 to 578 across Ontario. The average distance to the nearest camera decreases from 38.4km to 9.4km, and the number of monitored roads increases approximately four times. Additionally, six times more cameras are available in the four most populated ecoregions in Ontario. The experimental variograms show that it is feasible to interpolate weather variables with reasonable accuracy. Moreover, observations in the three datasets are collected with similar frequency, which facilitates our data integration approach.
&lt;/p&gt;

&lt;p&gt;
	Discussion/Deliverables: Integrating these three datasets is feasible and can benefit the design and development of automated image classification methods for monitoring road snow coverage. We do not consider data from pavement-embedded sensors, an additional line of research may explore the integration of this data. Our approach can provide actionable insights which can be used to more selectively perform manual patrolling to better identify road surface conditions.
&lt;/p&gt;

&lt;p&gt;
	Conclusions: Our initial results are promising and demonstrate that additional, image only datasets can be added to road monitoring data by using existing multimodal sensors as ground truth, which will lead to greater performance on the future image classification tasks.
&lt;/p&gt;

&lt;p&gt;
	&amp;nbsp;
&lt;/p&gt;
</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ganapthi Subramanian, Sriram</style></author><author><style face="normal" font="default" size="100%">Bhalla, Sushrut</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Learning Multi-Agent Communication with Reinforcement Learning</style></title><secondary-title><style face="normal" font="default" size="100%">Conference on Reinforcement Learning and Decision Making (RLDM-19)</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><pub-location><style face="normal" font="default" size="100%">Montreal, Canada.</style></pub-location><pages><style face="normal" font="default" size="100%">4</style></pages><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>34</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Linear and Quadratic Discriminant Analysis: Tutorial</style></title><secondary-title><style face="normal" font="default" size="100%">arXiv preprint arXiv:1906.02590</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Locally Linear Image Structural Embedding for Image Structure Manifold Learning</style></title><secondary-title><style face="normal" font="default" size="100%">International Conference on Image Analysis and Recognition (ICIAR-19)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Data Reduction</style></keyword><keyword><style  face="normal" font="default" size="100%">Manifold learning</style></keyword><keyword><style  face="normal" font="default" size="100%">Numerosity Reduction</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><publisher><style face="normal" font="default" size="100%">Springer, Cham</style></publisher><pub-location><style face="normal" font="default" size="100%">Waterloo, Canada</style></pub-location><pages><style face="normal" font="default" size="100%">126–138</style></pages><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Principal Component Analysis Using Structural Similarity Index for Images</style></title><secondary-title><style face="normal" font="default" size="100%">International Conference on Image Analysis and Recognition (ICIAR-19)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Data Reduction</style></keyword><keyword><style  face="normal" font="default" size="100%">Manifold learning</style></keyword><keyword><style  face="normal" font="default" size="100%">Numerosity Reduction</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><publisher><style face="normal" font="default" size="100%">Springer, Cham</style></publisher><pub-location><style face="normal" font="default" size="100%">Waterloo, Canada</style></pub-location><pages><style face="normal" font="default" size="100%">77–88</style></pages><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>34</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Pasand, Ali Saheb</style></author><author><style face="normal" font="default" size="100%">Karray, Fakhri</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Quantized Fisher Discriminant Analysis</style></title><secondary-title><style face="normal" font="default" size="100%">arXiv preprint arXiv:1909.03037</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Juan Manuel Carrillo Garcia</style></author><author><style face="normal" font="default" size="100%">Daniel Garijo</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author><author><style face="normal" font="default" size="100%">Rober Carrillo</style></author><author><style face="normal" font="default" size="100%">Yolanda Gil</style></author><author><style face="normal" font="default" size="100%">Katherine Borda</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Semantic Workflows and Machine Learning for the Assessment of Carbon Storage by Urban Trees</style></title><secondary-title><style face="normal" font="default" size="100%">Third International Workshop on Capturing Scientific Knowledge (SciKnow19)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">computational sustainability</style></keyword><keyword><style  face="normal" font="default" size="100%">machine learning</style></keyword><keyword><style  face="normal" font="default" size="100%">Semantics</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://www.semanticscholar.org/paper/Semantic-Workflows-and-Machine-Learning-for-the-of-Garcia-Garijo/cb059bd7de50122a9a7b5a778e04e21f2c02b2c6</style></url></web-urls></urls><pub-location><style face="normal" font="default" size="100%">Los Angeles, California, USA</style></pub-location><pages><style face="normal" font="default" size="100%">1–6</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Climate science is critical for understanding both the causes and consequences of changes in global temperatures and has become imperative for decisive policy-making. However, climate science studies commonly require addressing complex interoperability issues between data, software, and experimental approaches from multiple fields. Scientific workflow systems provide unparalleled advantages to address these issues, including reproducibility of experiments, provenance capture, software reusability and knowledge sharing. In this paper, we introduce a novel workflow with a series of connected components to perform spatial data preparation, classification of satellite imagery with machine learning algorithms, and assessment of carbon stored by urban trees. To the best of our knowledge, this is the first study that estimates carbon storage for a region in Africa following the guidelines from the Intergovernmental Panel on Climate Change (IPCC).</style></abstract><notes><style face="normal" font="default" size="100%">1</style></notes></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>34</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">The theory behind overfitting, cross validation, regularization, bagging, and boosting: tutorial</style></title><secondary-title><style face="normal" font="default" size="100%">arXiv preprint arXiv:1905.12787</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Bhalla, Sushrut</style></author><author><style face="normal" font="default" size="100%">Subramanian, Sriram Ganapathi</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors><secondary-authors><author><style face="normal" font="default" size="100%">Agmon, N.</style></author><author><style face="normal" font="default" size="100%">Taylor, M.E.</style></author><author><style face="normal" font="default" size="100%">Elkind, E.</style></author><author><style face="normal" font="default" size="100%">Veloso, M.</style></author></secondary-authors></contributors><titles><title><style face="normal" font="default" size="100%">Training Cooperative Agents for Multi-Agent Reinforcement Learning</style></title><secondary-title><style face="normal" font="default" size="100%">Proc. of the 18th International Conference on Autonomous Agents and Multiagent Systems (AAMAS 2019)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Autonomous Driving</style></keyword><keyword><style  face="normal" font="default" size="100%">MARL</style></keyword><keyword><style  face="normal" font="default" size="100%">Multi-Agent Reinforcement Learning</style></keyword><keyword><style  face="normal" font="default" size="100%">MultiAgent Systems</style></keyword><keyword><style  face="normal" font="default" size="100%">reinforcement learning</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><pub-location><style face="normal" font="default" size="100%">Montreal, Canada</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Deep Learning and back-propagation has been successfully used to perform centralized training with communication protocols among multiple agents in a cooperative environment. In this paper we present techniques for centralized training of Multi-Agent (Deep) Reinforcement Learning (MARL) using the model-free Deep Q-Network as the baseline model and message sharing between agents. We present a novel, scalable, centralized MARL training technique, which separates the message learning module from the policy module. The separation of these modules helps in faster convergence in complex domains like autonomous driving simulators. A second contribution uses the centrally trained model to bootstrap training of distributed, independent, cooperative agent policies for execution and thus addresses the challenges of noise and communication bottlenecks in real-time communication channels. This paper theoretically and empirically compares our centralized training algorithms to current research in the field of MARL. We also present and release a new OpenAI-Gym environment which can be used for multi-agent research as it simulates multiple autonomous cars driving cooperatively on a highway.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>34</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Unsupervised and supervised principal component analysis: Tutorial</style></title><secondary-title><style face="normal" font="default" size="100%">arXiv preprint arXiv:1906.03148</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2019</style></year></dates><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Eaton, Eric</style></author><author><style face="normal" font="default" size="100%">Koenig, Sven</style></author><author><style face="normal" font="default" size="100%">Schulz, Claudia</style></author><author><style face="normal" font="default" size="100%">Maurelli, Francesco</style></author><author><style face="normal" font="default" size="100%">Lee, John</style></author><author><style face="normal" font="default" size="100%">Eckroth, Joshua</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Freedman, Richard G</style></author><author><style face="normal" font="default" size="100%">Cardona-Rivera, Rogelio E</style></author><author><style face="normal" font="default" size="100%">Machado, Tiago</style></author><author><style face="normal" font="default" size="100%">Williams, Tom</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Blue Sky Ideas in Artificial Intelligence Education from the EAAI 2017 New and Future AI Educator Program</style></title><secondary-title><style face="normal" font="default" size="100%">AI Matters</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2018</style></year><pub-dates><date><style  face="normal" font="default" size="100%">feb</style></date></pub-dates></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://doi.acm.org/10.1145/3175502.3175509</style></url></web-urls></urls><number><style face="normal" font="default" size="100%">4</style></number><publisher><style face="normal" font="default" size="100%">ACM</style></publisher><pub-location><style face="normal" font="default" size="100%">New York, NY, USA</style></pub-location><volume><style face="normal" font="default" size="100%">3</style></volume><pages><style face="normal" font="default" size="100%">23–31</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	The 7th Symposium on Educational Advances in Artificial Intelligence (EAAI’17, co-chaired by Sven Koenig and Eric Eaton) launched the EAAI New and Future AI Educator Program to support the training of early-career university faculty, secondary school faculty, and future educators (PhD candidates or postdocs who intend a career in academia).
&lt;/p&gt;

&lt;p&gt;
	As part of the program, awardees were asked to address one of the following “blue sky” questions:
&lt;/p&gt;

&lt;ol&gt;
	&lt;li&gt;
		How could/should Artificial Intelligence (AI) courses incorporate ethics into the curriculum?
	&lt;/li&gt;
	&lt;li&gt;
		How could we teach AI topics at an early undergraduate or a secondary school level?
	&lt;/li&gt;
	&lt;li&gt;
		AI has the potential for broad impact to numerous disciplines. How could we make AI education more interdisciplinary, specifically to benefit non-engineering fields.
	&lt;/li&gt;
&lt;/ol&gt;

&lt;p&gt;
	This paper is a collection of their responses, intended to help motivate discussion around these issues in AI education.
&lt;/p&gt;
</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ganapathi Subramanian, Sriram</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Combining MCTS and A3C for prediction of spatially spreading processes in forest wildfire settings</style></title><secondary-title><style face="normal" font="default" size="100%">Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">a3c</style></keyword><keyword><style  face="normal" font="default" size="100%">computational sustainability</style></keyword><keyword><style  face="normal" font="default" size="100%">monte-carlo tree search</style></keyword><keyword><style  face="normal" font="default" size="100%">reinforcement-learning</style></keyword><keyword><style  face="normal" font="default" size="100%">Wildfire-Management</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2018</style></year></dates><pub-location><style face="normal" font="default" size="100%">Toronto, Ontario, Canada</style></pub-location><volume><style face="normal" font="default" size="100%">10832 LNAI</style></volume><pages><style face="normal" font="default" size="100%">285–291</style></pages><isbn><style face="normal" font="default" size="100%">9783319896557</style></isbn><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">In recent years, Deep Reinforcement Learning (RL) algorithms have shown super-human performance in a variety Atari and classic board games like chess and GO. Research into applications of RL in other domains with spatial considerations like environmental planning are still in their nascent stages. In this paper, we introduce a novel combination of Monte-Carlo Tree Search (MCTS) and A3C algorithms on an online simulator of a wildfire, on a pair of forest fires in Northern Alberta (Fort McMurray and Richardson fires) and on historical Saskatchewan fires previously compared by others to a physics-based simulator. We conduct several experiments to predict fire spread for several days before and after the given spatial information of fire spread and ignition points. Our results show that the advancements in Deep RL applications in the gaming world have advantages in spatially spreading real-world problems like forest fires. © Springer International Publishing AG, part of Springer Nature 2018.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Subramanian, Sriram Ganapathi</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Combining MCTS and A3C for Prediction of Spatially Spreading Processes in Forest Wildfire Settings</style></title><secondary-title><style face="normal" font="default" size="100%">Canadian Conference on Artificial Intelligence</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">a3c</style></keyword><keyword><style  face="normal" font="default" size="100%">computational sustainability</style></keyword><keyword><style  face="normal" font="default" size="100%">fire</style></keyword><keyword><style  face="normal" font="default" size="100%">forest wild-</style></keyword><keyword><style  face="normal" font="default" size="100%">monte-carlo tree search</style></keyword><keyword><style  face="normal" font="default" size="100%">reinforcement learning</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2018</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://link.springer.com/chapter/10.1007/978-3-319-89656-4_28</style></url></web-urls></urls><edition><style face="normal" font="default" size="100%">Lecture Notes in Artificial Intelligence</style></edition><publisher><style face="normal" font="default" size="100%">Springer</style></publisher><pub-location><style face="normal" font="default" size="100%">Toronto, Ontario, Canada</style></pub-location><volume><style face="normal" font="default" size="100%">10832</style></volume><pages><style face="normal" font="default" size="100%">285-291</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	&amp;nbsp;In recent years, Deep Reinforcement Learning (RL) algorithms have shown super-human performance in a variety Atari and classic board games like chess and GO. Research into applications of RL in other domains with spatial considerations like environmental planning are still in their nascent stages. In this paper, we introduce a novel combination of Monte-Carlo Tree Search (MCTS) and A3C algorithms on an online simulator of a wildfire, on a pair of forest fires in Northern Alberta (Fort McMurray and Richardson fires) and on historical Saskatchewan fires previously compared by others to a physics-based simulator. We conduct several experiments to predict fire spread for several days before and after the given spatial information of fire spread and ignition points. Our results show that the advancements in Deep RL applications in the gaming world have advantages in spatially spreading real-world problems like forest fires.&amp;nbsp;
&lt;/p&gt;
</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Subramanian, Sriram Ganapathi</style></author><author><style face="normal" font="default" size="100%">Sambee, Jaspreet Singh</style></author><author><style face="normal" font="default" size="100%">Ghojogh, Benyamin</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Decision Assist For Self-Driving Cars</style></title><secondary-title><style face="normal" font="default" size="100%">31st Canadian Conference on Artificial Intelligence, Candian AI 2018</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">autonomous cars</style></keyword><keyword><style  face="normal" font="default" size="100%">forcement learning</style></keyword><keyword><style  face="normal" font="default" size="100%">machine learning</style></keyword><keyword><style  face="normal" font="default" size="100%">obstacle avoidance</style></keyword><keyword><style  face="normal" font="default" size="100%">path planning</style></keyword><keyword><style  face="normal" font="default" size="100%">rein-</style></keyword><keyword><style  face="normal" font="default" size="100%">weather estimation</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2018</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://link.springer.com/chapter/10.1007%2F978-3-319-89656-4_44</style></url></web-urls></urls><edition><style face="normal" font="default" size="100%">Lecture Notes in Artificial Intelligence</style></edition><publisher><style face="normal" font="default" size="100%">Springer</style></publisher><pub-location><style face="normal" font="default" size="100%">Toronto, Ontario, Canada</style></pub-location><volume><style face="normal" font="default" size="100%">10832</style></volume><pages><style face="normal" font="default" size="100%">381-387</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Research into self-driving cars has grown enormously in the last decade primarily due to the advances in the fields of machine intelligence and image processing. An under-appreciated aspect of self-driving cars is actively avoiding high traffic zones, low visibility zones, and routes with rough weather conditions by learning different conditions and making decisions based on trained experiences.&amp;nbsp;&lt;br&gt;This paper addresses this challenge by introducing a novel hierarchical structure for dynamic path planning and experiential learning for vehicles.&amp;nbsp;&lt;br&gt;A multistage system is proposed for detecting and compensating for weather, lighting, and traffic conditions as well as a novel adaptive path planning algorithm named \textbf{Checked State A3C}. This algorithm improves upon the existing A3C Reinforcement Learning (RL) algorithm by adding state memory which provides the ability to learn an adaptive model of the best decisions to take from experience.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Benyamin Ghojogh</style></author><author><style face="normal" font="default" size="100%">Mark Crowley</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Principal Sample Analysis for Data Reduction</style></title><secondary-title><style face="normal" font="default" size="100%">International Conference on Big Knowledge (ICBK) </style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2018</style></year></dates><publisher><style face="normal" font="default" size="100%">IEEE, 2018.</style></publisher><pub-location><style face="normal" font="default" size="100%">Singapore</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Data reduction is an essential technique used for purifying data, training discriminative models more efficiently, encouraging generalizability, and for using less storage space for memory-limited systems.&amp;nbsp;&lt;br&gt;The literature on data reduction focuses mostly on dimensionality reduction, however, data sample reduction (i.e. removal of data points from a dataset) has its own benefits and is no less important given growing sizes of datasets and the growing need for usable data analysis methods on the network edge.&lt;br&gt;This paper proposes a new data sample reduction method, Principal Sample Analysis (PSA), which reduces the number (population) of data samples as a preprocessing step for classification. PSA ranks the samples of each class considering how well they represent it and enables better discriminative learning by using the sparsity and similarity of samples at the same time. Data sample reduction then occurs by cutting off the lowest ranked samples. The PSA method can work alongside any other data reduction/expansion and classification method. Experiments are carried out on three datasets (WDBC, AT&amp;amp;T, and MNIST) with contrasting characteristics and show the state-of-the-art effectiveness of the proposed method.&amp;nbsp;</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Subramanian, Sriram Ganapathi</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Using Spatial Reinforcement Learning to Build Forest Wildfire Dynamics Models from Satellite Images</style></title><secondary-title><style face="normal" font="default" size="100%">Frontiers in ICT: Environmental Informatics</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2018</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://www.frontiersin.org/articles/10.3389/fict.2018.00006/abstract</style></url></web-urls></urls><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Machine learning algorithms have increased tremendously in power in recent years but have yet to be fully utilized in many ecology and sustainable resource management domains such as wildlife reserve design, forest fire management and invasive species spread. One thing these domains have in common is that they contain dynamics that can be characterized as a Spatially Spreading Process (SSP) which requires many parameters to be set precisely to model the dynamics, spread rates and directional biases of the elements which are spreading.&amp;nbsp;We present related work in Artificial Intelligence and Machine Learning for SSP sustainability domains including forest wildfire prediction.&amp;nbsp;We then introduce a novel approach for learning in SSP domains using Reinforcement Learning (RL) where fire is the agent at any cell in the landscape and the set of actions the fire can take from a location at any point in time includes spreading North, South, East, West or not spreading.&amp;nbsp;This approach inverts the usual RL setup since the dynamics of the corresponding Markov Decision Process (MDP) is a known function for immediate wildfire spread.&amp;nbsp;Meanwhile, we learn an agent policy for a predictive model of the dynamics of a complex spatially-spreading process.&amp;nbsp;Rewards are provided for correctly classifying which cells are on fire or not compared to satellite and other related data.&amp;nbsp;We examine the behaviour of five RL algorithms on this problem: Value Iteration, Policy Iteration, Q-Learning, Monte Carlo Tree Search and Asynchronous Advantage Actor-Critic (A3C).&amp;nbsp;We compare to a Gaussian process based supervised learning approach and discuss the relation of our approach to manually constructed, state-of-the-art methods from forest wildfire modelling.&amp;nbsp;We also discuss the relation of our approach to manually constructed, state-of-the-art methods from forest wildfire modelling.&amp;nbsp;We validate our approach with satellite image data of two massive wildfire events in Northern Alberta, Canada; the Fort McMurray fire of 2016 and the Richardson fire of 2011. The results show that we can learn predictive, agent-based policies as models of spatial dynamics using RL on readily available satellite images that other methods and have many additional advantages in terms of generalizability and interpretability.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Maryam, Syeda</style></author><author><style face="normal" font="default" size="100%">McCrackin, Laura</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Rathi, Yogesh</style></author><author><style face="normal" font="default" size="100%">Michailovich, Oleg</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Application of Probabilistically-Weighted Graphs to Image-Based Diagnosis of Alzheimer&amp;#39;s Disease using Diffusion MRI.</style></title><secondary-title><style face="normal" font="default" size="100%">SPIE Medical Imaging Conference on Computer-Aided Diagnosis</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2017</style></year><pub-dates><date><style  face="normal" font="default" size="100%">March 3</style></date></pub-dates></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://dx.doi.org/10.1117/12.2254164</style></url></web-urls></urls><publisher><style face="normal" font="default" size="100%">International Society for Optics and Photonics</style></publisher><pub-location><style face="normal" font="default" size="100%">Orlando, FL, United States</style></pub-location><volume><style face="normal" font="default" size="100%">10134</style></volume><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	The world’s aging population has given rise to an increasing awareness towards neurodegenerative disorders, including Alzheimers Disease (AD). Treatment options for AD are currently limited, but it is believed that future success depends on our ability to detect the onset of the disease in its early stages. The most frequently used tools for this include neuropsychological assessments, along with genetic, proteomic, and image-based diagnosis. Recently, the applicability of Diffusion Magnetic Resonance Imaging (dMRI) analysis for early diagnosis of AD has also been reported. The sensitivity of dMRI to the microstructural organization of cerebral tissue makes it particularly well-suited to detecting changes which are known to occur in the early stages of AD. Existing dMRI approaches can be divided into two broad categories: region-based and tract-based. In this work, we propose a new approach, which extends region-based approaches to the simultaneous characterization of multiple brain regions. Given a predefined set of features derived from dMRI data, we compute the probabilistic distances between different brain regions and treat the resulting connectivity pattern as an undirected, fully-connected graph. The characteristics of this graph are then used as markers to discriminate between AD subjects and normal controls (NC). Although in this preliminary work we omit subjects in the prodromal stage of AD, mild cognitive impairment (MCI), our method demonstrates perfect separability between AD and NC subject groups with substantial margin, and thus holds promise for fine-grained stratification of NC, MCI and AD populations. © (2017) COPYRIGHT Society of Photo-Optical Instrumentation Engineers (SPIE). Downloading of the abstract is permitted for personal use only.
&lt;/p&gt;
</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Ganapathi Subramanian, Sriram</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Learning Forest Wildfire Dynamics from Satellite Images Using Reinforcement Learning</style></title><secondary-title><style face="normal" font="default" size="100%">Conference on Reinforcement Learning and Decision Making</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">asynchronous advantage actor-critic</style></keyword><keyword><style  face="normal" font="default" size="100%">deep reinforcement learning</style></keyword><keyword><style  face="normal" font="default" size="100%">forest wildfires</style></keyword><keyword><style  face="normal" font="default" size="100%">images</style></keyword><keyword><style  face="normal" font="default" size="100%">satellite</style></keyword><keyword><style  face="normal" font="default" size="100%">spatial informa-</style></keyword><keyword><style  face="normal" font="default" size="100%">spatiotemporal model</style></keyword><keyword><style  face="normal" font="default" size="100%">tion</style></keyword><keyword><style  face="normal" font="default" size="100%">value iteration</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2017</style></year></dates><pub-location><style face="normal" font="default" size="100%">Ann Arbor, MI, USA.</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Salem, Mahmoud</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Fischmeister, Sebastian</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Anomaly Detection Using Inter-Arrival Curves for Real-time Systems</style></title><secondary-title><style face="normal" font="default" size="100%">2016 28th Euromicro Conference on Real-Time Systems</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">anomaly detection</style></keyword><keyword><style  face="normal" font="default" size="100%">arrival curves</style></keyword><keyword><style  face="normal" font="default" size="100%">embedded systems</style></keyword><keyword><style  face="normal" font="default" size="100%">proj-MassiveTraceAnomalyDetection</style></keyword><keyword><style  face="normal" font="default" size="100%">trace analysis</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2016</style></year><pub-dates><date><style  face="normal" font="default" size="100%">jul</style></date></pub-dates></dates><pub-location><style face="normal" font="default" size="100%">Toulouse, France</style></pub-location><pages><style face="normal" font="default" size="100%">97–106</style></pages><isbn><style face="normal" font="default" size="100%">978-1-5090-2811-5</style></isbn><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">–-Real-time embedded systems are a significant class of applications, poised to grow even further as automated vehicles and the Internet of Things become a reality. An important problem for these systems is to detect anomalies during operation. Anomaly detection is a form of classification, which can be driven by data collected from the system at execution time. We propose inter-arrival curves as a novel analytic modelling technique for discrete event traces. Our approach relates to the existing technique of arrival curves and expands the technique to anomaly detection. Inter-arrival curves analyze the behaviour of events within a trace by providing upper and lower bounds to their inter-arrival occurrence. We exploit inter-arrival curves in a classification framework that detects deviations within these bounds for anomaly detection. Also, we show how inter-arrival curves act as good features to extract recurrent behaviour that these systems often exhibit. We demonstrate the feasibility and viability of the fully implemented approach with an industrial automotive case study (CAN traces) as well as a deployed aerospace case study (RTOS kernel traces).</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Salem, Mahmoud</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Fischmeister, Sebastian</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Inter-Arrival Curves for Multi-Mode and Online Anomaly Detection</style></title><secondary-title><style face="normal" font="default" size="100%">Euromicro Conference on Real-Time Systems 2016 - Work-in-Progress Proceedings</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">anomaly detection</style></keyword><keyword><style  face="normal" font="default" size="100%">proj-MassiveTraceAnomalyDetection</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2016</style></year></dates><pub-location><style face="normal" font="default" size="100%">Toulouse, France</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Salem, Mahmoud</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Fischmeister, Sebastian</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Inter-Arrival Curves for Multi-Mode and Online Anomaly Detection</style></title><secondary-title><style face="normal" font="default" size="100%">Euromicro Conference on Real-Time Systems 2016 - Work-in-Progress Proceedings</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">anomaly detection</style></keyword><keyword><style  face="normal" font="default" size="100%">proj-MassiveTraceAnomalyDetection</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2016</style></year></dates><pub-location><style face="normal" font="default" size="100%">Toulouse, France</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Answering Simple Questions About Spatially Spreading Systems</style></title><secondary-title><style face="normal" font="default" size="100%">2015 Summer Solstice: 7th International Conference on Discrete Models of Complex Systems</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2015</style></year></dates><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Taleghan, Majid Alkaee</style></author><author><style face="normal" font="default" size="100%">Dietterich, Thomas G.</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Hall, Kim</style></author><author><style face="normal" font="default" size="100%">Albers, H. Jo</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">PAC Optimal MDP Planning with Application to Invasive Species Management</style></title><secondary-title><style face="normal" font="default" size="100%">Journal of Machine Learning Research</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">computational sustainability</style></keyword><keyword><style  face="normal" font="default" size="100%">Good- Turing estimate</style></keyword><keyword><style  face="normal" font="default" size="100%">grant-wici16</style></keyword><keyword><style  face="normal" font="default" size="100%">invasive species management</style></keyword><keyword><style  face="normal" font="default" size="100%">machine learning</style></keyword><keyword><style  face="normal" font="default" size="100%">Markov decision processes</style></keyword><keyword><style  face="normal" font="default" size="100%">mdp</style></keyword><keyword><style  face="normal" font="default" size="100%">optimization</style></keyword><keyword><style  face="normal" font="default" size="100%">reinforcement-learning</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2015</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://jmlr.org/papers/v16/taleghan15a.html</style></url></web-urls></urls><volume><style face="normal" font="default" size="100%">16</style></volume><pages><style face="normal" font="default" size="100%">3877–3903</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">In a simulator-defined MDP, the Markovian dynamics and rewards are provided in the form of a simulator from which samples can be drawn. This paper studies MDP planning algorithms that attempt to minimize the number of simulator calls before terminating and outputting a policy that is approximately optimal with high probability. The paper introduces two heuristics for efficient exploration and an improved confidence interval that enables earlier termination with probabilis- tic guarantees. We prove that the heuristics and the confidence interval are sound and produce with high probability an approximately optimal policy in polynomial time. Experiments on two benchmark problems and two instances of an invasive species management problem show that the improved confidence intervals and the new search heuristics yield reductions of between 8% and 47% in the number of simulator calls required to reach near-optimal policies.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Using equilibrium policy gradients for spatiotemporal planning in forest ecosystem management</style></title><secondary-title><style face="normal" font="default" size="100%">IEEE Transactions on Computers</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">computational sustainability</style></keyword><keyword><style  face="normal" font="default" size="100%">Ecosystem management</style></keyword><keyword><style  face="normal" font="default" size="100%">Forestry planning</style></keyword><keyword><style  face="normal" font="default" size="100%">grant-wici16</style></keyword><keyword><style  face="normal" font="default" size="100%">machine learning</style></keyword><keyword><style  face="normal" font="default" size="100%">Markov decision processes</style></keyword><keyword><style  face="normal" font="default" size="100%">optimization</style></keyword><keyword><style  face="normal" font="default" size="100%">Policy gradient planning</style></keyword><keyword><style  face="normal" font="default" size="100%">proj-spatiallyspreadingprocess</style></keyword><keyword><style  face="normal" font="default" size="100%">reinforcement learning</style></keyword><keyword><style  face="normal" font="default" size="100%">spatiotemporal planning</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2014</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://doi.ieeecomputersociety.org/10.1109/TC.2013.113</style></url></web-urls></urls><publisher><style face="normal" font="default" size="100%">IEEE computer Society Digital Library. IEEE Computer Society.</style></publisher><volume><style face="normal" font="default" size="100%">63</style></volume><pages><style face="normal" font="default" size="100%">142–154</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">&lt;p&gt;
	Spatiotemporal planning involves making choices at multiple locations in space over some planning horizon to maximize utility and satisfy various constraints. In Forest Ecosystem Management, the problem is to choose actions for thousands of locations each year including harvesting, treating trees for fire or pests, or doing nothing. The utility models could place value on sale of lumber, ecosystem sustainability or employment levels and incorporate legal and logistical constraints on actions such as avoiding large contiguous areas of clearcutting. Simulators developed by forestry researchers provide detailed dynamics but are generally inaccesible black boxes. We model spatiotemporal planning as a factored Markov decision process and present a policy gradient planning algorithm to optimize a stochastic spatial policy using simulated dynamics. It is common in environmental and resource planning to have actions at different locations be spatially interelated; this makes representation and planning challenging. We define a global spatial policy in terms of interacting local policies defining distributions over actions at each location conditioned on actions at nearby locations. Markov chain Monte Carlo simulation is used to sample landscape policies and estimate their gradients. Evaluation is carried out on a forestry planning problem with 1,880 locations using a variety of value models and constraints. Index
&lt;/p&gt;
</style></abstract><issue><style face="normal" font="default" size="100%">1</style></issue></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>17</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Houtman, Rachel M.</style></author><author><style face="normal" font="default" size="100%">Montgomery, Claire A.</style></author><author><style face="normal" font="default" size="100%">Gagnon, Aaron R.</style></author><author><style face="normal" font="default" size="100%">Calkin, David E.</style></author><author><style face="normal" font="default" size="100%">Dietterich, Thomas G.</style></author><author><style face="normal" font="default" size="100%">McGregor, Sean</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Allowing a wildfire to burn: Estimating the effect on future fire suppression costs</style></title><secondary-title><style face="normal" font="default" size="100%">International Journal of Wildland Fire</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">bio-economic modelling</style></keyword><keyword><style  face="normal" font="default" size="100%">forest economics</style></keyword><keyword><style  face="normal" font="default" size="100%">forest fire policy</style></keyword><keyword><style  face="normal" font="default" size="100%">grant-wici16</style></keyword><keyword><style  face="normal" font="default" size="100%">proj-spatiallyspreadingprocess</style></keyword><keyword><style  face="normal" font="default" size="100%">spatial simulation</style></keyword><keyword><style  face="normal" font="default" size="100%">wildland fire management</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2013</style></year></dates><volume><style face="normal" font="default" size="100%">22</style></volume><pages><style face="normal" font="default" size="100%">871–882</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Where a legacy of aggressive wildland fire suppression has left forests in need of fuel reduction, allowing wildland fire to burn may provide fuel treatment benefits, thereby reducing suppression costs from subsequent fires. The least-cost-plus-net-value-change model of wildland fire economics includes benefits of wildfire in a framework for evaluating suppression options. In this study, we estimated one component of that benefit – the expected present value of the reduction in suppression costs for subsequent fires arising from the fuel treatment effect of a current fire. To that end, we employed Monte Carlo methods to generate a set of scenarios for subsequent fire ignition and weather events, which are referred to as sample paths, for a study area in central Oregon. We simulated fire on the landscape over a 100-year time horizon using existing models of fire behaviour, vegetation and fuels development, and suppression effectiveness, and we estimated suppression costs using an existing suppression cost model. Our estimates suggest that the potential cost savings may be substantial. Further research is needed to estimate the full least-cost-plus-net-value-change model. This line of research will extend the set of tools available for developing wildfire management plans for forested landscapes.</style></abstract><issue><style face="normal" font="default" size="100%">7</style></issue></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Poole, David</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Cyclic causal models with discrete variables: Markov chain equilibrium semantics and sample ordering</style></title><secondary-title><style face="normal" font="default" size="100%">IJCAI International Joint Conference on Artificial Intelligence</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">cyclic causality</style></keyword><keyword><style  face="normal" font="default" size="100%">probabilistic inference</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2013</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://dl.acm.org/citation.cfm?id=2540281</style></url></web-urls></urls><pub-location><style face="normal" font="default" size="100%">Beijing, China</style></pub-location><pages><style face="normal" font="default" size="100%">1060–1068</style></pages><isbn><style face="normal" font="default" size="100%">9781577356332</style></isbn><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">We analyze the foundations of cyclic causal models for discrete variables, and compare structural equation models (SEMs) to an alternative semantics as the equilibrium (stationary) distribution of a Markov chain. We show under general conditions, discrete cyclic SEMs cannot have independent noise; even in the simplest case, cyclic structural equation models imply constraints on the noise. We give a formalization of an alternative Markov chain equilibrium semantics which requires not only the causal graph, but also a sample order. We show how the resulting equilibrium is a function of the sample ordering, both theoretically and empirically.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Poole, David</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Cyclic causal models with discrete variables: Markov chain equilibrium semantics and sample ordering</style></title><secondary-title><style face="normal" font="default" size="100%">IJCAI International Joint Conference on Artificial Intelligence</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">Causality</style></keyword><keyword><style  face="normal" font="default" size="100%">probabilistic inference</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2013</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://dl.acm.org/citation.cfm?id=2540281</style></url></web-urls></urls><pub-location><style face="normal" font="default" size="100%">Beijing, China</style></pub-location><pages><style face="normal" font="default" size="100%">1060–1068</style></pages><isbn><style face="normal" font="default" size="100%">9781577356332</style></isbn><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">We analyze the foundations of cyclic causal models for discrete variables, and compare structural equation models (SEMs) to an alternative semantics as the equilibrium (stationary) distribution of a Markov chain. We show under general conditions, discrete cyclic SEMs cannot have independent noise; even in the simplest case, cyclic structural equation models imply constraints on the noise. We give a formalization of an alternative Markov chain equilibrium semantics which requires not only the causal graph, but also a sample order. We show how the resulting equilibrium is a function of the sample ordering, both theoretically and empirically.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Dietterich, Thomas G</style></author><author><style face="normal" font="default" size="100%">Taleghan, Majid Alkaee</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">PAC Optimal Planning for Invasive Species Management: Improved Exploration for Reinforcement Learning from Simulator-Defined MDPs</style></title><secondary-title><style face="normal" font="default" size="100%">Proceedings of the AAAI Conference on Artificial Intelligence (AAAI-2013)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">computational sustainability</style></keyword><keyword><style  face="normal" font="default" size="100%">invasive species</style></keyword><keyword><style  face="normal" font="default" size="100%">machine learning</style></keyword><keyword><style  face="normal" font="default" size="100%">mdp</style></keyword><keyword><style  face="normal" font="default" size="100%">PAC learning</style></keyword><keyword><style  face="normal" font="default" size="100%">planning</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2013</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.aaai.org/ocs/index.php/AAAI/AAAI13/paper/view/6478</style></url></web-urls></urls><pub-location><style face="normal" font="default" size="100%">Bellevue, WA, USA</style></pub-location><pages><style face="normal" font="default" size="100%">7</style></pages><isbn><style face="normal" font="default" size="100%">9781577356158</style></isbn><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">Often the most practical way to define a Markov Decision Process (MDP) is as a simulator that, given a state and an action, produces a resulting state and immediate reward sam- pled from the corresponding distributions. Simulators in nat- ural resource management can be very expensive to execute, so that the time required to solve such MDPs is dominated by the number of calls to the simulator. This paper presents an algorithm, DDV, that combines improved confidence inter- vals on the Q values (as in interval estimation) with a novel upper bound on the discounted state occupancy probabilities to intelligently choose state-action pairs to explore. We prove that this algorithm terminates with a policy whose value is within $ε$ of the optimal policy (with probability 1−$δ$) after making only polynomially-many calls to the simulator. Ex- periments on one benchmark MDP and on an MDP for inva- sive species management show very large reductions in the number of simulator calls required.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Policy Gradient Optimization Using Equilibrium Policies for Spatial Planning Domains</style></title><secondary-title><style face="normal" font="default" size="100%">13th INFORMS Computing Society Conference</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2013</style></year></dates><pub-location><style face="normal" font="default" size="100%">Santa Fe, NM, United States</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Hall, Kim</style></author><author><style face="normal" font="default" size="100%">Taleghan, Majid Alkaee</style></author><author><style face="normal" font="default" size="100%">Albers, Heidi J.</style></author><author><style face="normal" font="default" size="100%">Dietterich, Thomas</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Managing Invasive Species in a River Network</style></title><secondary-title><style face="normal" font="default" size="100%">Third International Conference on Computational Sustainability</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">computational sustainability</style></keyword><keyword><style  face="normal" font="default" size="100%">ecology</style></keyword><keyword><style  face="normal" font="default" size="100%">invasive species</style></keyword><keyword><style  face="normal" font="default" size="100%">reinforcement learning</style></keyword><keyword><style  face="normal" font="default" size="100%">value iteration</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2012</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.cs.ubc.ca/ crowley/papers/compsust2012.pdf</style></url></web-urls></urls><pub-location><style face="normal" font="default" size="100%">Copenhagen, Denmark</style></pub-location><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">There are many pest species that invade valuable ecosystems and damage ecosystem services. An important management goal is to limit the spread of these species or even eradicate them. From the computational perspective, these problems can be formulated as Markov Decision Processes. However, solving them is difficult, because of their spatial nature. Viewed abstractly, the current state of the ecosystem can be formalized as a graph. Each node of the graph may be invaded or not, and at each time step, a “local” action must be selected at each node. Consequently, at the level of the whole MDP, each global action is a vector of local actions of length N, the number of nodes in the graph. The exponentially large action space raises major computational problems. During the state transitions, the invasive species can spread along the edges of the graph. We study a specific instance of this general problem based on the spread of Tamarisk in river networks. In this formulation, we work with the dual of the standard river network graph. Each node corresponds to a stretch of the river between two confluences (also called a “reach”). We divide each reach into H “slots”, where each slot can either be occupied by a single native tree, occupied by a single Tamarisk tree, or empty. In each time step, the plants (both native and invader) first must survive through winter (with some probability). Then, the surviving trees produce seeds that disperse through the river network according to a dispersal kernel that prefers downstream movement with high probability. Each dispersing seed lands at some slot in some node of the network. All seeds arriving at a slot compete to establish themselves in that slot (if the slot is empty). In our model, there are three local actions: simple eradication of invasive plants, eradication followed by planting of native plants, or doing nothing at all. Doing nothing costs nothing, while eradication is more expensive, and eradication plus planting is the most expensive. One local action must be chosen for each node (reach) at each time step, and it applies to all H slots at that node. The actions only succeed stochastically. The planning goal is to minimize the expected discounted sum of the cost of invasive damage and the cost of management actions. To solve this planning problem, we apply the standard value iteration algorithm. However, standard value iteration requires a fully-specified state transition matrix. In our problem, it is easy to write a simulator for drawing samples of the dispersal and competition processes, but it is computationally intractable to compute the exact transition probabilities. Hence, we estimate those probabilities by drawing a large number of samples from the simulator. Once we have an optimal policy on the approximated MDP, we explore its behaviour both directly and via simulated rollouts. The optimal policy can be visualized with one picture per state annotated by the types of plants in each reach and the optimal action that should be taken in each reach. We also run the optimal policy forward in time on many simulated trajectories in order to gather statistics about its behaviour such as: how long it takes to reach a steady state; average cost of a policy over time; or the frequency with which completely invaded or uninvaded states are reached. Comparisons of these statistics show that in many situations our optimal policies greatly outperform established rules of thumb from the literature. An example of a rule of thumb is to always prioritize treatment of upstream over downstream reaches. Another rule targets the most invaded reaches (those with the most slots occupied by Tamarisk) first. We have found that the optimality of these rules of thumb depends strongly on the dispersal and competition parameters. Some contributions this work makes to bioeconomics and ecology are showing evidence that the stochastic spread of the invasion process needs to be modelled and that the spatial characteristics of the system under invasion are relevant to optimal management. The computational challenges raised by this problem have led us into research on methods for planning that can provide bounds on the optimality of the policy while minimizing the number of calls to expensive simulators. We are also studying algorithms that can scale to large graphs with thousands of nodes and large out-degrees. Finally, we see potential in these problems for intelligently reducing the size of the state space and the complexity of the policy description by utilizing domain knowledge.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>32</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Equilibrium Policy Gradients for Spatiotemporal Planning</style></title></titles><keywords><keyword><style  face="normal" font="default" size="100%">computational sustainability</style></keyword><keyword><style  face="normal" font="default" size="100%">graphical models</style></keyword><keyword><style  face="normal" font="default" size="100%">machine learning</style></keyword><keyword><style  face="normal" font="default" size="100%">mdp</style></keyword><keyword><style  face="normal" font="default" size="100%">reinforcement learning</style></keyword><keyword><style  face="normal" font="default" size="100%">spatial planning</style></keyword><keyword><style  face="normal" font="default" size="100%">spatiotemporal planning</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2011</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://hdl.handle.net/2429/38971</style></url></web-urls></urls><publisher><style face="normal" font="default" size="100%">University of British Columbia</style></publisher><pages><style face="normal" font="default" size="100%">187 pages</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">In spatiotemporal planning, agents choose actions at multiple locations in space over some planning horizon to maximize their utility and satisfy various constraints. In forestry planning, for example, the problem is to choose actions for thousands of locations in the forest each year. The actions at each location could include harvesting trees, treating trees against disease and pests, or doing nothing. A utility model could place value on sale of forest products, ecosystem sustainability or employment levels, and could incorporate legal and logistical constraints such as avoiding large contiguous areas of clearcutting and managing road access. Planning requires a model of the dynamics. Existing simulators developed by forestry researchers can provide detailed models of the dynamics of a forest over time, but these simulators are often not designed for use in automated planning. This thesis presents spatiotemoral planning in terms of factored Markov decision processes. A policy gradient planning algorithm optimizes a stochastic spatial policy using existing simulators for dynamics. When a planning problem includes spatial interaction between locations, deciding on an action to carry out at one location requires considering the actions performed at other locations. This spatial interdependence is common in forestry and other environmental planning problems and makes policy representation and planning challenging. We define a spatial policy in terms of local policies defined as distributions over actions at one location conditioned upon actions at other locations. A policy gradient planning algorithm using this spatial policy is presented which uses Markov Chain Monte Carlo simulation to sample the landscape policy, estimate its gradient and use this gradient to guide policy improvement. Evaluation is carried out on a forestry planning problem with 1880 locations using a variety of value models and constraints. The distribution over joint actions at all locations can be seen as the equilibrium of a cyclic causal model. This equilibrium semantics is compared to Structural Equation Models. We also define an algorithm for approximating the equilibrium distribution for cyclic causal networks which exploits graphical structure and analyse when the algorithm is exact.</style></abstract><work-type><style face="normal" font="default" size="100%">phdPh.D.</style></work-type></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Poole, David</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Policy gradient planning for environmental decision making with existing simulators</style></title><secondary-title><style face="normal" font="default" size="100%">25th AAAI Conference on Artificial Intelligence (AAAI-11)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">mdp</style></keyword><keyword><style  face="normal" font="default" size="100%">reinforcement learning</style></keyword><keyword><style  face="normal" font="default" size="100%">spatiotemporal planning</style></keyword><keyword><style  face="normal" font="default" size="100%">thesis research</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2011</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">https://www.scopus.com/record/display.uri?eid=2-s2.0-80055051332&amp;origin=inward&amp;txGid=de2006c39235aac9ba20cf0e76073dd9</style></url></web-urls></urls><pub-location><style face="normal" font="default" size="100%">San Francisco</style></pub-location><volume><style face="normal" font="default" size="100%">2</style></volume><pages><style face="normal" font="default" size="100%">1323–1330</style></pages><isbn><style face="normal" font="default" size="100%">9781577355090</style></isbn><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">In environmental and natural resource planning domains actions are taken at a large number of locations over multiple time periods. These problems have enormous state and action spaces, spatial correlation between actions, uncertainty and complex utility models. We present an approach for modeling these planning problems as factored Markov decision processes. The reward model can contain local and global components as well as spatial constraints between locations. The transition dynamics can be provided by existing simulators developed by domain experts. We propose a landscape policy defined as the equilibrium distribution of a Markov chain built from many locally-parameterized policies. This policy is optimized using a policy gradient algorithm. Experiments using a forestry simulator demonstrate the algorithm's ability to devise policies for sustainable harvest planning of a forest. Copyright © 2011, Association for the Advancement of Artificial Intelligence. All rights reserved.</style></abstract><issue><style face="normal" font="default" size="100%">1</style></issue></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Patitsas, Elizabeth</style></author><author><style face="normal" font="default" size="100%">Voll, Kimberly</style></author><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Wolfman, Steven</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Circuits and logic in the lab : Toward a coherent picture of computation</style></title><secondary-title><style face="normal" font="default" size="100%">15th Western Canadian Conference on Computing Education</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">curriculum</style></keyword><keyword><style  face="normal" font="default" size="100%">education research</style></keyword><keyword><style  face="normal" font="default" size="100%">Pedagogy</style></keyword><keyword><style  face="normal" font="default" size="100%">survey</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2010</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.cs.ubc.ca/ crowley/papers/wccce2010.pdf</style></url></web-urls></urls><pub-location><style face="normal" font="default" size="100%">Kelowna, BC, Canada</style></pub-location><isbn><style face="normal" font="default" size="100%">9781450300988</style></isbn><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">We describe extensive modifications made over time to a first year computer science course at the University of British Columbia covering logic and digital circuits (among other topics). Smoothly integrating the hardware-based labs with the more theory-based lectures into a cohesive picture of computation has always been a challenge in this course. The seeming disconnect between implementation and abstraction has historically led to frustration and dissatisfaction among students. We describe changes to the lab curriculum, equipment logistics, the style of in-lab activities and evaluation. We have also made logistical changes to the management and ongoing training of teaching assistants, allowing us to better anchor our larger course story into the lab curriculum. These changes have greatly improved student and TA opinions of the lab experience, as well as the overall course.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Nelson, John</style></author><author><style face="normal" font="default" size="100%">Poole, David</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Seeing the Forest Despite the Trees : Large Scale Spatial-Temporal Decision Making</style></title><secondary-title><style face="normal" font="default" size="100%">Conference on Uncertainty in Artificial Intelligence (UAI09)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">computational sustainability</style></keyword><keyword><style  face="normal" font="default" size="100%">Forest Management</style></keyword><keyword><style  face="normal" font="default" size="100%">inference</style></keyword><keyword><style  face="normal" font="default" size="100%">mdp</style></keyword><keyword><style  face="normal" font="default" size="100%">planning</style></keyword><keyword><style  face="normal" font="default" size="100%">spatiotemporal planning</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2009</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.cs.ubc.ca/ crowley/papers/uai09-mark-crowley.pdf</style></url></web-urls></urls><pub-location><style face="normal" font="default" size="100%">Montreal, Canada</style></pub-location><pages><style face="normal" font="default" size="100%">126–134</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">We introduce a challenging real world planning problem where actions must be taken at each location in a spatial area at each point in time. We use forestry planning as the motivating application. In Large Scale Spatial-Temporal (LSST) planning problems, the state and action spaces are defined as the cross-products of many local state and action spaces spread over a large spatial area such as a city or forest. These problems possess state uncertainty, have complex utility functions involving spatial constraints and we generally must rely on simulations rather than an explicit transition model. We define LSST problems as reinforcement learning prob- lems and present a solution using policy gradients. We compare two different policy formulations: an explicit policy that identifies each location in space and the action to take there; and an abstract policy that defines the proportion of actions to take across all locations in space. We show that the abstract policy is more robust and achieves higher rewards with far fewer parameters than the elementary policy. This abstract policy is also a better fit to the properties that practitioners in LSST problem domains require for such methods to be widely useful</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Nelson, John</style></author><author><style face="normal" font="default" size="100%">Poole, David</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Seeing the Forest Despite the Trees : Large Scale Spatial-Temporal Decision Making</style></title><secondary-title><style face="normal" font="default" size="100%">Conference on Uncertainty in Artificial Intelligence (UAI09)</style></secondary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">computational sustainability</style></keyword><keyword><style  face="normal" font="default" size="100%">forestry</style></keyword><keyword><style  face="normal" font="default" size="100%">inference</style></keyword><keyword><style  face="normal" font="default" size="100%">mdp</style></keyword><keyword><style  face="normal" font="default" size="100%">planning</style></keyword><keyword><style  face="normal" font="default" size="100%">spatiotemporal planning</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2009</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.cs.ubc.ca/ crowley/papers/uai09-mark-crowley.pdf</style></url></web-urls></urls><pub-location><style face="normal" font="default" size="100%">Montreal, Canada</style></pub-location><pages><style face="normal" font="default" size="100%">126–134</style></pages><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">We introduce a challenging real world planning problem where actions must be taken at each location in a spatial area at each point in time. We use forestry planning as the motivating application. In Large Scale Spatial-Temporal (LSST) planning problems, the state and action spaces are defined as the cross-products of many local state and action spaces spread over a large spatial area such as a city or forest. These problems possess state uncertainty, have complex utility functions involving spatial constraints and we generally must rely on simulations rather than an explicit transition model. We define LSST problems as reinforcement learning prob- lems and present a solution using policy gradients. We compare two different policy formulations: an explicit policy that identifies each location in space and the action to take there; and an abstract policy that defines the proportion of actions to take across all locations in space. We show that the abstract policy is more robust and achieves higher rewards with far fewer parameters than the elementary policy. This abstract policy is also a better fit to the properties that practitioners in LSST problem domains require for such methods to be widely useful</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>47</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author><author><style face="normal" font="default" size="100%">Boerlage, Brent</style></author><author><style face="normal" font="default" size="100%">Poole, David</style></author><author><style face="normal" font="default" size="100%">Kobti, Ziad</style></author><author><style face="normal" font="default" size="100%">Wu, Dan</style></author></authors><secondary-authors><author><style face="normal" font="default" size="100%">Kobti, Ziad</style></author><author><style face="normal" font="default" size="100%">Wu, Dan</style></author></secondary-authors></contributors><titles><title><style face="normal" font="default" size="100%">Adding Local Constraints to Bayesian Networks</style></title><secondary-title><style face="normal" font="default" size="100%">Advances in Artificial Intelligence</style></secondary-title><tertiary-title><style face="normal" font="default" size="100%">Lecture Notes in Computer Science</style></tertiary-title></titles><keywords><keyword><style  face="normal" font="default" size="100%">bayesian networks</style></keyword><keyword><style  face="normal" font="default" size="100%">Computer Science</style></keyword><keyword><style  face="normal" font="default" size="100%">constraints</style></keyword><keyword><style  face="normal" font="default" size="100%">probabilistic inference</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2007</style></year></dates><urls><web-urls><url><style face="normal" font="default" size="100%">http://www.springerlink.com/content/u1j205nhr750m717/</style></url></web-urls></urls><edition><style face="normal" font="default" size="100%">4509</style></edition><publisher><style face="normal" font="default" size="100%">Springer Berlin Heidelberg</style></publisher><pub-location><style face="normal" font="default" size="100%">Canadian AI Conference, Montreal, Quebec, Canada, 2007.</style></pub-location><volume><style face="normal" font="default" size="100%">4509</style></volume><pages><style face="normal" font="default" size="100%">344–355</style></pages><isbn><style face="normal" font="default" size="100%">978-3-540-72664-7</style></isbn><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">When using Bayesian networks, practitioners often express constraints among variables by conditioning a common child node to induce the desired distribution. For example, an ‘or' constraint can be easily expressed by a node modelling a logical ‘or' of its parents' values being conditioned to true. This has the desired effect that at least one parent must be true. However, conditioning also alters the distributions of further ancestors in the network. In this paper we argue that these side effects are undesirable when constraints are added during model design. We describe a method called shielding to remove these side effects while remaining within the directed language of Bayesian networks. This method is then compared to chain graphs which allow undirected and directed edges and which model equivalent distributions. Thus, in addition to solving this common modelling problem, shielded Bayesian networks provide a novel method for implementing chain graphs with existing Bayesian network tools.</style></abstract></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>32</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Shielding Against Conditioning Side-Effects in Graphical Models</style></title></titles><keywords><keyword><style  face="normal" font="default" size="100%">bayesian networks</style></keyword><keyword><style  face="normal" font="default" size="100%">constraints</style></keyword><keyword><style  face="normal" font="default" size="100%">inference</style></keyword><keyword><style  face="normal" font="default" size="100%">influence diagrams</style></keyword><keyword><style  face="normal" font="default" size="100%">side-effects</style></keyword></keywords><dates><year><style  face="normal" font="default" size="100%">2005</style></year></dates><publisher><style face="normal" font="default" size="100%">University of British Columbia</style></publisher><language><style face="normal" font="default" size="100%">eng</style></language><issue><style face="normal" font="default" size="100%">October</style></issue><work-type><style face="normal" font="default" size="100%">PhD Thesis</style></work-type></record><record><source-app name="Biblio" version="7.x">Drupal-Biblio</source-app><ref-type>34</ref-type><contributors><authors><author><style face="normal" font="default" size="100%">Crowley, Mark</style></author></authors></contributors><titles><title><style face="normal" font="default" size="100%">Evaluating Influence Diagrams</style></title><secondary-title><style face="normal" font="default" size="100%">Unpublished Working Paper</style></secondary-title></titles><dates><year><style  face="normal" font="default" size="100%">2004</style></year></dates><language><style face="normal" font="default" size="100%">eng</style></language><abstract><style face="normal" font="default" size="100%">In this unpublished literature review we will survey the history of Influence Diagrams from their origin in Decision Theory to modern AI uses. We will compare the various methods that have been used to find an optimal strategy for a given influence diagram. These methods have various advantages under different assumptions and there is a pro- gression to the present of richer, more general solutions. We also look at an abstract algorithm used as an informal tool and determine whether it is equivalent to other formal methods in the literature.</style></abstract></record></records></xml>