Abstract: Human skeleton sequences are widely used for action recognition due to their robustness to background noise and computational efficiency. In this paper, we propose a transformer-based method ...
Abstract: Recent studies have integrated convolutions into transformers to introduce inductive bias and improve generalization performance. However, the static nature of conventional convolution ...