Automatic Recognition and Generation of Affective Movements
Robert Gorbet and Dana Kulic
Body movements are an important non-verbal communication medium through which affective states of the demonstrator can be discerned. For machines, the capability to recognize affective expressions of their users and generate appropriate actuated responses with recognizable affective content has the potential to improve their life-like attributes and to create an engaging, entertaining, and empathic human-machine interaction.
This thesis develops approaches to systematically identify movement features most salient to affective expressions and to exploit these features to design computational models for automatic recognition and generation of affective movements. The proposed approaches enable 1) identifying which features of movement convey affective expressions, 2) the automatic recognition of affective expressions from movements, 3) understanding the impact of kinematic embodiment on the perception of affective movements, and 4) adapting pre-defined motion paths in order to “overlay”' specific affective content.
Statistical learning and stochastic modeling approaches are leveraged, extended, and adapted to derive a concise representation of the movements that isolates movement features salient to affective expressions and enables efficient and accurate affective movement recognition and generation. In particular, salient movement feature identification is achieved via discriminative movement modeling in terms of 1) functional features represented by a weighted linear combination of a fixed number of basis functions, and 2) stochastically-transformed movement features (Fisher scores). For functional representation, dimensionality reduction techniques (namely, principal component analysis (PCA), Fisher discriminant analysis, Isomap) are adapted for functional datasets and applied in the basis function space to extract a minimal set of features along which affect-specific movements are best separable. Furthermore, the centroids of affect-specific clusters of movements in the resulting functional PCA subspace along with the inverse mapping of functional PCA are used to generate prototypical movements for each affective expression.
The functional discriminative modeling is however limited to cases where affect-specific movements also have similar kinematic trajectories and does not address the interpersonal and stochastic variations inherent to bodily expression of affect. This thesis presents the Fisher score representation as an alternative movement representation approach to account for kinematic, interpersonal, and stochastic variations. The Fisher scores are derived from affect-specific hidden Markov model encoding of the movements and exploited to discriminate between different affective expressions using a support vector machine (SVM) classification. Furthermore, a minimal set of movement features most salient to discriminating between different affective expressions is identified by mapping the Fisher scores to a low-dimensional space using supervised PCA (sPCA) driven by the Hilbert Schmidt independence criterion. The resulting subspace forms a suitable basis for affective movement recognition using nearest neighbour classification and retains the high recognition rates achieved by SVM classification in the Fisher score space. The dimensions of the sPCA subspace form a minimal set of salient features and are used to explore movement kinematic and dynamic cues that connote affective expressions.
Furthermore, the thesis explores the potential of movement notation systems from the dance community (specifically, the Laban system) for abstract coding and computational analysis of the movements. In particular, a quantification approach for Laban Effort and Shape is proposed and used in developing a computational model for affective movement generation. Using the Laban movement analysis, the proposed generation approach searches a labeled dataset for movements that are kinematically similar to a desired motion path and convey a target emotion. A hidden Markov model of the identified movements is obtained and used with the desired motion path in the Viterbi state estimation. The estimated state sequence is then used to generate a novel movement that is a version of the desired motion path, modulated to convey the target emotion. Various affective human movement corpora are used to evaluate and demonstrate the efficacy of the developed approaches for the automatic recognition and generation of affective expressions in movements.
Finally, the thesis assesses the human perception of affective movements and the impact of display embodiment and the observer's gender on the affective movement perception via user studies in which participants rate the expressivity of synthetically-generated and human-generated affective movements animated on anthropomorphic and non-anthropomorphic embodiments. The user studies show that the human perception of affective movements is mainly shaped by intended emotions, and that the display embodiment and the observer's gender can significantly impact the perception of affective movements.