Encoder/Decoder Models Differences

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

Penguin-VL is a compact vision-language model family built to study how far multimodal efficiency can be pushed by redesigning the vision encoder, rather than only scaling data or model size.

IEEE

ERD: Encoder-Residual-Decoder Neural Network for Underwater Image Enhancement

Abstract: In underwater environments, the absorption and scattering of light often result in various types of degradation in captured images, including color cast, low contrast, low brightness, and ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results

Penguin-VL: Exploring the Efficiency Limits of VLM with LLM-based Vision Encoders

ERD: Encoder-Residual-Decoder Neural Network for Underwater Image Enhancement

Trending now