Abstract: Visual grounding, i.e., localizing objects in images ac-cording to natural language queries, is an important topic in visual language understanding. The most effective approaches for this ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results