Lee, S., Lee, J., & Hasuo, I. (2020). Predictive PER: Balancing Priority and Diversity towards Stable Deep Reinforcement Learning Presented at the Predictive PER: Balancing Priority and Diversity towards Stable Deep Reinforcement Learning conference. Retrieved from https://sites.google.com/view/deep-rl-workshop-neurips2020/home
References
Filter by:
2020
Bouchard, F. (2020). Expert System and a Rule Set Development Method for Urban Behaviour Planning Retrieved from http://hdl.handle.net/10012/15864 (Original work published 2020)
Salay, R., Czarnecki, K., Alvarez, I., Elli, M. S., Sedwards, S., & Weast, J. (2020). PURSS: Towards Perceptual Uncertainty Aware Responsibility Sensitive Safety with ML Presented at the PURSS: Towards Perceptual Uncertainty Aware Responsibility Sensitive Safety With ML conference. New York: CEUR.
Gaurav, A. (2020). Safety-Oriented Stability Biases for Continual Learning Waterloo. Retrieved from https://uwspace.uwaterloo.ca/handle/10012/15579 (Original work published 2020)
Chen, W. T. (2020). Accelerating the Training of Convolutional Neural Networks for Image Segmentation with Deep Active Learning Waterloo. Retrieved from https://uwspace.uwaterloo.ca/handle/10012/15537 (Original work published 2020)
Gaurav, A., Vernekar, S., Lee, J., Sedwards, S., Abdelzad, V., & Czarnecki, K. (2020). Simple Continual Learning Strategies for Safer Classifers Presented at the Simple Continual Learning Strategies for Safer Classifers conference. CEUR. Retrieved from http://ceur-ws.org/Vol-2560/paper6.pdf (Original work published 2020)
Chen, H., Cohen, R., Dautenhahn, K., Law, E., & Czarnecki, K. (2020). Autonomous Vehicle Visual Signals for Pedestrians: Experiments and Design Recommendations Presented at the Autonomous Vehicle Visual Signals for Pedestrians: Experiments and Design Recommendations conference. Retrieved from https://arxiv.org/abs/2010.05115
Vernekar, S. (2020). Training Reject-Classifiers for Out-of-distribution Detection via Explicit Boundary Sample Generation Waterloo. Retrieved from http://hdl.handle.net/10012/15582 (Original work published 2020)
Denouden, T. (2020). An Application of Out-of-Distribution Detection for Two-Stage Object Detection Networks Waterloo. Retrieved from https://uwspace.uwaterloo.ca/handle/10012/15646 (Original work published 2020)
Jhunjhunwala, A., Lee, J., Sedwards, S., Abdelzad, V., & Czarnecki, K. (2020). Improved Policy Extraction via Online Q-Value Distillation Presented at the Improved Policy Extraction via Online Q-Value Distillation conference. Glasgow: IEEE.