Generalized Decoding for Pixel, Image, and Language Image Captioning

Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training

Abstract: In rapidly evolving field of vision-language models (VLMs), contrastive language-image pre-training (CLIP) has made significant strides, becoming foundation for various downstream tasks.

IEEE

SAIST: Segment Any Infrared Small Target Model Guided by Contrastive Language-Image Pretraining

Abstract: Infrared Small Target Detection (IRSTD) aims to identify low signal-to-noise ratio small targets in infrared images with complex backgrounds, which is crucial for various applications.

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Advancing Myopia To Holism: Fully Contrastive Language-Image Pre-training

SAIST: Segment Any Infrared Small Target Model Guided by Contrastive Language-Image Pretraining

Trending now