Abstract: Underwater image captioning bridges the gap between visual perception and semantic understanding of underwater scenes, playing a crucial role in applications such as ocean geoscience and ...
Abstract: State-of-the-art audio captioning methods typically use the encoder-decoder structure with pretrained audio neural networks (PANNs) as encoders for feature extraction. However, the ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results