Describing Objects Worksheets

Triplet Network Template for Siamese Trackers

Abstract: Siamese network based trackers describe object tracking as a similarity matching problem and these trackers achieve state-of-the-art performance on multiple benchmarks. However, due to the ...

GitHub

Contextual Object Detection with Multimodal Large Language Models

Recent Multimodal Large Language Models (MLLMs) are remarkable in vision-language tasks, such as image captioning and question answering, but lack the essential perception ability, i.e., object ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Triplet Network Template for Siamese Trackers

Contextual Object Detection with Multimodal Large Language Models

Trending now