Abstract: Artificial intelligence (AI) driven speech emotion recognition (SER) is bringing in more flexible and context-aware solutions in human-computer interaction (HCI). Conventional SER models ...
Abstract: Recently, audio-visual speech recognition has attracted increasing attention. However, most existing works only focused on scenarios with two speakers. In this work, we study the effect of ...