Abstract: Hyperspectral image (HSI) captured by uncrewed aerial vehicles (UAVs) is distinguished by superior spatial resolution and intricate spectral detail, with widespread applications in precise ...
We introduce OneThinker, an all-in-one multimodal reasoning generalist that is capable of thinking across a wide range of fundamental visual tasks within a single model. OneThinker demonstrates strong ...
Abstract: Foundation models have achieved remarkable breakthroughs across various domains, with the widely use of masked image modeling (MIM) and self-supervised learning (SSL). However, these models ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results