Abstract: The aim of the violent recognition task is to determine whether a video contains violent behaviors. Given that violent behavior often comes with visual and audio anomalies, multimodal ...
Abstract: Affective Video Facial Analysis (AVFA) is important for advancing emotion-aware AI, yet the persistent data scarcity in AVFA presents challenges. Recently, the self-supervised learning (SSL) ...
A collection of videos chosen for their oddly calming and satisfying visuals. Trump moves to dismantle major US climate research center in Colorado Border Patrol commander returns to Chicago as agents ...
Abstract: Weakly-supervised audio-visual video parsing (WS-AVVP) aims to localize the temporal extents of audio, visual and audio-visual event instances as well as identify the corresponding event ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results