Abstract
The accurate segmentation of land cover in high-resolution remote sensing imagery is crucial for applications such as urban planning, environmental monitoring, and disaster management. However, traditional convolutional neural networks (CNNs) struggle to balance fine-grained local detail with large-scale contextual information. To tackle these challenges, we combine large-kernel convolutions, attention mechanisms, and multi-scale feature fusion to form a novel LKAFFNet framework that introduces the following three key modules: LkResNet, which enhances feature extraction through parameterizable large-kernel convolutions; Large-Kernel Attention Aggregation (LKAA), integrating spatial and channel attention; and Channel Difference Features Shift Fusion (CDFSF), which enables efficient multi-scale feature fusion. Experimental comparisons demonstrate that LKAFFNet outperforms previous models on both the LandCover dataset and WHU Building dataset, particularly in cases with diverse scales. Specifically, it achieved a mIoU of 0.8155 on the LandCover dataset and 0.9326 on the WHU Building dataset. These findings suggest that LKAFFNet significantly improves land cover segmentation performance, offering a more effective tool for remote sensing applications.
Original language | English |
---|---|
Article number | 54 |
Journal | Sensors |
Volume | 25 |
Issue number | 1 |
DOIs | |
Publication status | Published - Jan 2025 |
Keywords
- CNN
- deep learning
- feature restoration
- smart city
- sustainable building
- urban land use