AI seminar: Integrating value-directed compression and belief compression for POMDPs
Speaker: Xin Li (Hong Kong Baptist University)
Partially observable Markov decision process (POMDP) is a commonly adopted framework to model planning problems in a stochastic environment. However high dimensionality of POMDP's belief space is still one major cause for making the underlying optimal policy computation intractable.