Abstract: Transformer-based object detection models usually adopt an encoding-decoding architecture that mainly combines self-attention (SA) and multilayer perceptron (MLP). Although this architecture ...
Abstract: Multi-object tracking (MOT) aims to estimate the bounding boxes and ID labels of objects in videos. The challenging issue in this task is to alleviate competitive learning between the ...
Note that Python version 3.9.2 might be required for compatilibity with the numba package (on March 2022). Note that the latest Python version (3.13) is currently not ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results