Abstract: The rotation of camera during scanning imaging induces the image degradation, which means both resident space objects (RSOs) and stars have the same morphological characteristics in the ...
InstructSAM is a training-free framework for Instruction-Oriented Object Counting, Detection, and Segmentation (InstructCDS). We construct EarthInstruct, an InstructCDS benchmark for remote sensing.
Abstract: Goal-Oriented Vision-and-Language Navigation (VLN) aims to enable agents to navigate to specified locations and identify designated target objects following natural language instruction.