TY - JOUR
T1 - MCE-ST
T2 - Classifying crop stress using hyperspectral data with a multiscale conformer encoder and spectral-based tokens
AU - Khotimah, Wijayanti Nurul
AU - Bennamoun, Mohammed
AU - Boussaid, Farid
AU - Xu, Lian
AU - Edwards, David
AU - Sohel, Ferdous
N1 - Publisher Copyright:
© 2023
PY - 2023/4
Y1 - 2023/4
N2 - Recently, transformers have achieved great success in a number of computer vision tasks due to their excellent ability to capture long-range feature dependencies. In contrast, convolutional neural networks (CNNs) are good at extracting local features. Given that the capture of short- and long-range band dependencies are both important for hyperspectral data classification, we propose MCE-ST, a convolution-transformer (conformer) based framework capable of exploiting the complementary strengths of transformers and CNNs. In contrast to the conventional transformer, which uses a linear projection for tokenization, the proposed MCE-ST uses a convolution-based tokenization method to extract local dependency between spectral bands. Moreover, since different hyperspectral samples may have different spans of local relationships, a multiscale conformer encoder (MCE) comprising two separate branches of depth-wise dilated convolution with different kernel sizes is used to extract the different spans of the local interactions between tokens. We conducted experiments on four salt stress datasets and one cassava disease dataset. The results show that the proposed MCE-ST outperforms the state-of-the-art techniques for crop stress classification using hyperspectral data. The code for MCE-ST is publicly available at https://github.com/Weejaa04/MCE-ST-GitHub.
AB - Recently, transformers have achieved great success in a number of computer vision tasks due to their excellent ability to capture long-range feature dependencies. In contrast, convolutional neural networks (CNNs) are good at extracting local features. Given that the capture of short- and long-range band dependencies are both important for hyperspectral data classification, we propose MCE-ST, a convolution-transformer (conformer) based framework capable of exploiting the complementary strengths of transformers and CNNs. In contrast to the conventional transformer, which uses a linear projection for tokenization, the proposed MCE-ST uses a convolution-based tokenization method to extract local dependency between spectral bands. Moreover, since different hyperspectral samples may have different spans of local relationships, a multiscale conformer encoder (MCE) comprising two separate branches of depth-wise dilated convolution with different kernel sizes is used to extract the different spans of the local interactions between tokens. We conducted experiments on four salt stress datasets and one cassava disease dataset. The results show that the proposed MCE-ST outperforms the state-of-the-art techniques for crop stress classification using hyperspectral data. The code for MCE-ST is publicly available at https://github.com/Weejaa04/MCE-ST-GitHub.
KW - Deep learning
KW - Disease classification
KW - Hyperspectral
KW - Multiscale conformer encoder
KW - Spectral-to-tokens
KW - Stress classification
KW - Transformer
UR - http://www.scopus.com/inward/record.url?scp=85151660356&partnerID=8YFLogxK
U2 - 10.1016/j.jag.2023.103286
DO - 10.1016/j.jag.2023.103286
M3 - Article
AN - SCOPUS:85151660356
SN - 1569-8432
VL - 118
JO - International Journal of Applied Earth Observation and Geoinformation
JF - International Journal of Applied Earth Observation and Geoinformation
M1 - 103286
ER -