We introduce FarSLIP, a vision-language foundation model for remote sensing (RS) that achieves fine-grained vision-language alignment. FarSLIP demonstrates state-of-the-art performance on both ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results