Abstract: Visual soft attention has been widely adopted in image captioning models. Traditional Soft Attention Mechanism (TSAM) assigns a weight to a certain region by using a multilayer perceptron ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results