Visual Objects Programming Language

Visual-Linguistic Feature Alignment With Semantic and Kinematic Guidance for Referring Multi-Object Tracking

Abstract: Referring Multi-Object Tracking (RMOT) aims to dynamically track an arbitrary number of referred targets in a video sequence according to the language expression. Previous methods mainly ...

IEEE

Do Visual Imaginations Improve Vision-and-Language Navigation Agents?

Abstract: Vision-and-Language Navigation (VLN) agents are tasked with navigating an unseen environment using natural language instructions. In this work, we study if visual representations of ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Visual-Linguistic Feature Alignment With Semantic and Kinematic Guidance for Referring Multi-Object Tracking

Do Visual Imaginations Improve Vision-and-Language Navigation Agents?

Trending now