Abstract
The transformer model encounters challenges with variable-length input sequences, leading to efficiency and scalability concerns. To overcome this, we propose a pyramid-based hierarchical spatial-spectral transformer (PyFormer). This innovative approach organizes input data hierarchically into pyramid segments, each representing distinct abstraction levels, thereby enhancing processing efficiency. At each level, a dedicated transformer encoder is applied, effectively capturing both local and global context. Integration of outputs from different levels culminates in the final input representation. In short, the pyramid excels at capturing spatial features and local patterns, while the transformer effectively models spatial-spectral correlations and long-range dependencies. Experimental results underscore the superiority of the proposed method over state-of-The-Art approaches, achieving overall accuracies of 96.28% for the Pavia University dataset and 97.36% for the University of Houston dataset. In addition, the incorporation of disjoint samples augments robustness and reliability, thereby highlighting the potential of PyFormer in advancing hyperspectral image classification (HSIC).
| Original language | English |
|---|---|
| Pages (from-to) | 17681-17689 |
| Number of pages | 9 |
| Journal | IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing |
| Volume | 17 |
| DOIs | |
| State | Published - 2024 |
| Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2008-2012 IEEE.
Keywords
- Pyramid network
- hyperspectral image classification (HSIC)
- spatial-spectral transformer (SST)
ASJC Scopus subject areas
- Computers in Earth Sciences
- Atmospheric Science